张涛,刘康,赵军. 一种基于图模型的维基概念相似度计算方法及其在实体链接系统中的应用[J]. 中文信息学报, 2015, 29(2): 58-67.
ZHANG Tao, LIU Kang, ZHAO Jun. A Graph-based Similarity Measure between Wikipedia Concepts and Its Application in Entity Linking System. , 2015, 29(2): 58-67.
一种基于图模型的维基概念相似度计算方法及其在实体链接系统中的应用
张涛,刘康,赵军
中国科学院自动化研究所 模式识别国家重点实验室,北京 100190)
A Graph-based Similarity Measure between Wikipedia Concepts and Its Application in Entity Linking System
ZHANG Tao, LIU Kang, ZHAO Jun
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing 100190, China
Abstract:Entity linking is the task of map entity mentions in a document to their entities in a knowledge base (KB). In this paper, we briefly introduce the traditional entity linking system and point out the key problem of entity linking system-the semantic similarity measure between the content of entity mention and the document of the candidate entity. And then, we propose a novel semantic relatedness measure between Wikipedia concepts based on the graph structure of Wikipedia. With this similarity measure, we present a novel learning to rank framework which leverage the rich semantic information derived from Wikipedia to deal with the entity lining task. Experiment results show that the performance of the system is comparable to the state-of-art result.
[1] S Auer, C Bizer, G Kobilarov, et al. Dbpedia: A Nucleus for Web of Open Data [C]//Proceedings of ISWC, 2007:11-15. [2] http://www.wikipedia.org/ [3] Marius Pasca. Outclassing Wikipedia in Open-domain Information Extraction: Weakly-supervised Acquisition of Attributes over Conceptual Hierarchies[C]//Proceedings of the 12th Conference of the European Chapter of the ACL, 2009: 639-647. [4] Simone Palo Ponzetto, Michael Strube. Knowledge Derived from Wikipedia for Computing Semantic Relatedness[J]. Journal of Artificial Inteeligence Research, 2007: 181-212. [5] Angela Fogarolli. Word Sense Disambiguation based on Wikipedia link structure [C]//Proceedings of International Conference on Semantic Computing, 2009: 77-82. [6] P McNamee, H Simpson, H T Dang. Overview of the TAC 2009 Knowledge Base Population Track [C]//Proceedings of TAC, 2009. [7] X Han, J Zhao. Named Entity Disambiguation by Leveraging Wikipedia Semantic Knowledge [C]//Proceedings of CIKM, 2009: 215-224. [8] E Gabrilovich, S Markovitch. Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis [C]//Proceedings of IJCAI, 2007: 1606-1611. [9] David Milne, Ian H Witten. Learning to link with Wikipedia [C]//Proceedings of CIKM 2008.USA: ACM, 2008:509-518. [10] Jian Hu, Gang Wang, Fred Lochovsky, Jian-Tao Sun, and Zheng Chen. Understanding User’s Query Intent with Wikipedia [C]//Proceedings of WWW, 2009: 471-480. [11] Y Guo, W Che, T Liu, et al. A Graph-based Method for Entity Linking. [C]//Proceedings of IJCNLP, 2011: 1010-1018. [12] T Joachims. Optimizing Search Engines Using Click through Data [C]//Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD), ACM, 2002.