面向教育领域的基于SVR-BiGRU-CRF中文命名实体识别方法

PDF(1986 KB)

中文信息学报 ›› 2022, Vol. 36 ›› Issue (7) : 114-122.

信息抽取与文本挖掘

面向教育领域的基于SVR-BiGRU-CRF中文命名实体识别方法

张召武,徐彬,高克宁,王同庆,张乔乔

作者信息 +

SVR-BiGRU-CRF Based Chinese Named Entity Recognition for Education Domain

ZHANG Zhaowu, XU Bin, GAO Kening, WANG Tongqing, ZHANG Qiaoqiao

Author information +

History +

摘要

在教育领域中,命名实体识别在机器自动提问和智能问答等相关任务中都有应用。传统的中文命名实体识别模型需要改变网络结构来融入字和词信息,增加了网络结构的复杂度。另一方面,教育领域中的数据对实体边界的识别要十分精确,传统方法未能融入位置信息,对实体边界的识别能力较差。针对以上的问题,该文使用改进的向量表示层,在向量表示层中融合字、词和位置信息,能够更好地界定实体边界和提高实体识别的准确率,使用BiGRU和CRF分别作为模型的序列建模层和标注层进行中文命名实体识别。该文在Resume数据集和教育数据集(Edu)上进行了实验,F₁值分别为95.20%和95.08%。实验结果表明,该文方法对比基线模型提升了模型的训练速度和实体识别的准确性。

Abstract

In the field of education, named entity recognition is widely used in Automatic machine questioning and Intelligent question answering. The traditional Chinese named entity recognition model needs to change the network structure to incorporate character and word information, which increases the complexity of the network structure. On the other hand, the data in the education field must be very accurate in the identification of entity boundaries. Traditional methods cannot incorporate location information, and the ability to identify entity boundaries is poor. In response to the above problems, this article uses an improved vector representation layer to integrate words, character, and location information in the vector representation layer, which can better define entity boundaries and improve the accuracy of entity recognition. BiGRU and CRF are used as models respectively. The sequence modeling layer and the annotation layer perform Chinese named entity recognition. This article conducted experiments on the Resume data set and the education data set (Edu), and the F1 values were 95.20% and 95.08%, respectively. The experimental results show that the method proposed in this paper improves the training speed of the model and the accuracy of entity recognition compared with the baseline model.

导出引用

张召武,徐彬,高克宁,王同庆,张乔乔. 面向教育领域的基于SVR-BiGRU-CRF中文命名实体识别方法. 中文信息学报. 2022, 36(7): 114-122

ZHANG Zhaowu, XU Bin, GAO Kening, WANG Tongqing, ZHANG Qiaoqiao. SVR-BiGRU-CRF Based Chinese Named Entity Recognition for Education Domain. Journal of Chinese Information Processing. 2022, 36(7): 114-122

参考文献

[1] Farmakiotou D, Karkaletsis V, Koutsias J, et al. Rule-based named entity recognition for Greek financial texts[C]//Proceedings of the Workshop on Computational Lexicography and Multimedia Dictionaries, 2000: 75-78.
[2] Luo G, Huang X, Lin C Y, et al.Joint entity recognition and disambiguation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015: 879-888.
[3] Tong S. Support vector machine active learning with applications to text classification[C]//Proceedings of the 17th International Conference on Machine Learning, 2000: 999-1006.
[4] Eddy S R. Hidden Markov models[J]. Current Opinion in Structural Biology, 1996, 6(3): 361-365.
[5] Hammerton J. Named entity recognition with long short-term memory[C]//Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL, 2003: 172-175.
[6] Lample G, Ballesteros M, Subramanian S, et al. Neural architectures for named entity recognition[J]. arXiv preprint arXiv: 1603.01360, 2016.
[7] Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging[J]. arXiv preprint arXiv: 1508.01991, 2015.
[8] Collobert R, Weston J, Bottou L, et al. Natural language processing (almost) from scratch[J]. Journal of Machine Learning Research, 2011, 12(1): 2493-2537.
[9] Chen X, Shi Z,Qiu X, et al. Adversarial multi-criteria learning for chinese word segmentation[J]. arXiv preprint arXiv: 1704.07556, 2017.
[10] He J, Wang H. Chinese named entity recognition and word segmentation based on character[C]//Proceedings of the 6th SIGHAN Workshop on Chinese Language Processing, 2008.
[11] Chen X,Qiu X, Zhu C, et al. Long short-term memory neural networks for Chinese word segmentation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015: 1197-1206.
[12] Yang J, Zhang Y, Dong F. Neural word segmentation with rich pretraining[J]. arXiv preprint arXiv: 1704.08960, 2017.
[13] Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging[J]. arXiv preprint arXiv: 1508.01991, 2015.
[14] Zhang Y, Yang J. Chinese NER using lattice LSTM[J]. arXiv preprint arXiv: 1805.02023, 2018.
[15] Zhu Y, Wang G, Karlsson B F. CAN-NER: Convolutional attention network for Chinese named entityrecognition[J]. arXiv preprint arXiv: 1904.02141, 2019.
[16] Gui T, Ma R, Zhang Q, et al. CNN-based Chinese NER with lexicon rethinking[C]//Proceedings of the IJCAI, 2019: 4982-4988.
[17] Sui D, Chen Y, Liu K, et al. Leverage lexical knowledge for Chinese named entity recognition via collaborative graph network[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019: 3821-3831.
[18] Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[J]. arXiv preprint arXiv: 1810.04805, 2018.
[19] Peng M, Ma R, Zhang Q, et al. Simplify the usage of lexicon in Chinese NER[J]. arXiv preprint arXiv: 1908.05969, 2019.
[20] Goldberg Y, Levy O. Word2Vec explained: Deriving Mikolov's negative-sampling word-embedding method[J]. arXiv preprint arXiv: 1402.3722, 2014.

基金

国家自然科学基金联合基金(U1811261);中央高校基本科研业务费专项资金(N2116019)

PDF(1986 KB)

Accesses

Citation

Detail

段落导航

摘要
Abstract
关键词
Key words
引用本文
参考文献
基金

Received	Published
2021-04-22	2022-09-01
Issue Date
2022-09-01

{{custom_sec.title}}

{{custom_sec.title}}

{{custom_fnGroup.title_cn}}

脚注

基金

选择文件类型/文献管理软件名称

选择包含的内容

摘要

Abstract

关键词

Key words

引用本文

{{custom_sec.title}}

{{custom_sec.title}}

参考文献

{{custom_fnGroup.title_cn}}

脚注

基金