基于多粒度认知的命名实体识别方法
作者:
作者单位:

四川大学计算机学院

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家重点基础研究发展计划


Named entity recognition method based on multi-granularity cognition
Author:
Affiliation:

College of Computer Science,Sichuan University

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在数据匮乏的领域,命名实体识别效果受限于欠拟合的字词特征表达,引入常规的多任务学习方法可以有所改善,但需要额外的标注成本.针对这一问题,提出了一种基于多粒度认知的命名 实体识别方法,在不产生额外标注成本的前提下,增强字特征信息,提高命名实体识别效果.该方法从多粒度认知理论出发,以BiLSTM和CRF为基础模型,将字粒度下的命名实体识别任务与句 子全局粒度下的实体数量预测任务相联合,共同优化字嵌入表达.三个不同类型的数据集上的多组实验表明,引入多粒度认知的方法有效地提升了命名实体识别效果.

    Abstract:

    In the field of data scarcity, the performance of named entity recognition is limited by the expression of underfitting word features. The named entity recognition effect can be improved by introducing conventional multitask learning methods, but additional labeling costs are required. Aiming at addressing this problem, we propose a new named entity recognition method based on multigranularity cognition, which can enhance the character feature information and improve the performance of named entity recognition without incurring additional tagging costs. In order to optimize the expression of word embedding, in this approach, we start from the multi granularity cognition theory and use BiLSTM and CRF as the basic model, the task of named entity recognition under word granularity is combined with the task of entity number prediction under sentence global granularity. Multiple experiments on three different types of data sets show that the method of introducing multigranularity cognition method can effectively improve the performance of named entity recognition.

    参考文献
    相似文献
    引证文献
引用本文

引用本文格式: 李攀锋,陈樱珏,钟泠韵,林锋. 基于多粒度认知的命名实体识别方法[J]. 四川大学学报: 自然科学版, 2022, 59: 022004.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-06-19
  • 最后修改日期:2021-08-17
  • 录用日期:2021-09-08
  • 在线发布日期: 2022-04-01
  • 出版日期: