Ontology and Dependency Syntax Based Word Semantic Relation Annotation and Its Evaluation
XIONG Jing1, ZHI Liping1, YUAN Dong2
1. School of Computer and Information Engineering, Anyang Normal University, Anyang, Henan 455000, China; 2. State Key Laboratory of high-performance server and storage technology, Jinan, Shandong 250101,China
摘要为弥补传统的语义标注方法在词语或句子成分之间关系描述方面的不足,该文提出了一种基于本体和依存句法的非结构化文本语义关系标注算法。算法以句子为单位,综合POS(Part of Speech)、语义辞典、语言学特征等因素对句子中词汇的语义关系进行识别,利用词语间的依存关系对词语进行语义组合,从而实现词汇语义关系标注。结合语义标注过程中的语义匹配度、语义丰富度等特征,设计了评价算法,用以衡量标注结果的正确性。实验结果表明,该标注算法能获得较高的准确率,在大规模语料下效果尤为显著。
Abstract：In bridge the gap between words and syntactic components in current semantic annotation, a semantic annotation method based on ontology and dependency syntax for unstructured text is proposed. Applied in the sentence level, this method employs the features including POS, semantic dictionary, and other linguistic features, and determines the the lexical semantic relations by the dependency structure between them.. Meanwhile, an evaluation metric combing features like semantic similarity and semantic richness are designed, which is essentially the confidence of the method itself. Experimental results show that the semantic tagging algorithm can reach high accuracy especially on large-scale corpus.