Tibetan Entity Relation Extraction Based on Joint Model
XIA Tianci1,2, SUN Yuan1,2
1.School of Information Engineering, Minzu University of China, Beijing 100081, China; 2.Minority Languages Branch, National Language Resource and Monitoring Research Center, Minzu University of China, Beijing 100081, China
Abstract:Extracting the entities and the relationship between them from unstructured texts is a challenging issue. This paper applies the joint model in Tibetan to perform the entity identification and relation extraction at the same time. An end-to-end sequence labelling framework of BiLSTM is adopted, and the POS information is integrated to enhance the performance. It is also demonstrated that the character-level processing method is more effective in Tibetan than the word-level processing. The experimental results show that the method improves the accuracy by 30%~40%, compared the SVM and LR.
[1] Alexandre Passos,Vineet Kumar,Andrew McCallum. Lexicon infused phrase embeddings for named entity resolution[C]//Proceedings of International Conference on Computational Linguistics,2014: 78-86. [2] Gang Luo,et al. Joint entity recognition and disambiguation[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing,2015: 879-888. [3] Jason PC Chiu,Eric Nichols. Named entity recognition with bidirectional lstm-cnns[C]//Processings of Transactions of the Association for Computational Linguistics,2015. [4] Zhiheng Huang,Wei Xu,Kai Yu. Bidirectional lstm-crf models for sequence tagging.arXiv preprint arXiv:1508.01991,2015. [5] Guillaume Lample,et al.Neural architectures for named entity recognition[C]//Proceedings of the NAACL international conference,2016. [6] Bryan Rink,et al.Utd: Classifying semantic relations by combining lexical and semantic resources[C]//Proceedings of the 5th International Workshop on Semantic Evaluation,2010: 256-259. [7] Nanda Kambhatla. Combining lexical,syntactic,and semantic features with maximum entropy models for extracting relations[C]//Proceedings of the 43th ACL International Conference,2004: 22. [8] Kun Xu,et al. Semantic relation classificationvia convolutional neural networks with simple negative sampling[C]//Proceedings of the EMNLP,2015a. [9] Suncong Zheng,et al.A neural network framework for relation extraction: Learning entity semantic and relation pattern[J].KnowledgeBased Systems,2016(114):12-23. [10] Xiang Ren,et al.Cotype: Joint extraction of typed entities and relations with knowledge bases[C]//Proceedings of the 26th WWW International Conference,2017. [11] Bishan Yang,Claire Cardie. Joint inference for fine-grained opinion extraction[C]//Proceedings of the 51rd Annual Meeting of the Association for Computational Linguistics,2013:1640-1649. [12] Sameer Singh,et al.Joint inference of entities,relations,and coreference[C]//Proceedings of the 2013 Workshop on Automated Knowledge Base Construction.ACM,2013: 1-6. [13] Makoto Miwa,Mohit Bansal. End-to-end relation extraction using lstms on sequences and tree structures[C]//Proceedings of the 54rd Annual Meeting of the Association for Computational Linguistics,2016. [14] Suncong Zheng,et al.Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme[J].arXiv preprint,2017,arXiv:1706.05075. [15] 金明,杨欢欢,单广荣.藏语命名实体识别研究[J].西北民族大学学报(自然科学版),2010(3):49-52. [16] 罗智勇,宋柔,朱小杰.藏族人名汉译名识别研究[J].情报学报,2009(3):475-480. [17] 华却才让,等.基于感知机模型藏文命名实体识别[J].计算机工程与应用,2014,50(15):172-176. [18] 刘飞飞,王志娟.基于层次特征的藏文人名识别研究[J/OL].计算机应用研究,2018(09):1-7[2018-03-22]. [19] 龙从军,周学文.藏语名词语义关系研究[C].全国少数民族青年自然语言处理学术研讨会,2008. [20] 马宁,等.面向互联网的藏文实体关系模板获取技术研究[J].中央民族大学学报(自然科学版),2015,24(01):35-39. [21] 何鸿业,郑瑾,张祖平.基于词性结合的卷积神经网络文本情感分析[J/OL].计算机工程:1-7[2018-03-13].