词义知识表示主要依赖属性描述或分类描述,这两种方式各有所长,但不同表示之间相互转换的可行性与现实状况还未被关注。在属性描述的基础上,该文引入序关系的思想,提出基于特征序列的概念与方法,以此来模拟、分析概念涵义从一般到特殊的渐次生成过程,发掘尚未显性化的中间概念,自动构建出一个语义分类体系。以HowNet(2000版)数据为例,实验表明该方法可以生成一个性质优良、覆盖完全的新的语义分类体系,并反映此前的属性描述在语言知识工程实践中不易察觉的一些问题。
Abstract
Feature description and taxonomic description are two basic knowledge representations widely employed in lexical semantics. However, the the transformation between them remains an open issue with well discussion. In this paper, we applies the notion of ordering relationship into the feature description, and automatically derive a taxonomy from general to specific concepts, in which the previous undefined intermediate concepts are revealed. Experiments on HowNet (2000) show that a semantic taxonomy, with a fine-defined inheritance and a full coverage of all concepts, can be automatically generated by this approach. Further analysis of the output also indicates some underlined defects in the feature description for natural language knowledge engineering.
关键词
词义知识 /
属性描述 /
分类描述 /
序关系 /
特征序列 /
语义分类体系
{{custom_keyword}} /
Key words
lexical semantics /
feature description /
taxonomic description /
ordering relation /
feature sequences /
semantic taxonomy
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] T Gruber. Toward Principles for the Design of Ontologies Used for Knowledge Sharing[J]. International Journal of Human-Computer Studies, 1995, 43 (5-6): 907-928.
[2] 邓志鸿, 唐世渭, 张铭,等. Ontology研究综述[J]. 北京大学学报(自然科学报), 2002, 38(9): 728-730.
[3] 董振东. 知网(2000版)[DB/OL], http://www.keenage.com.
[4] G A Miller, B Richard, F Christiane, et al. Introduction to WordNet: An On-line Lexical Database[R]. Five Papers on WordNet, CSL Report 43, Cognitive Science Laboratory, Princeton University, 1993.
[5] P W Wong, P Fung. Nouns in Wordnet and HowNet: An Analysis and Comparison of Semantic Relations[C]//Proceedings of GWC′02, India, 2002: 319-322.
[6] 卢鹏, 孙明勇, 陆汝占. 基于知网的词汇语义自动分类系统[J]. 计算机仿真, 2004, 21(2): 127-133.
[7] H P Chen, L He, B Chen. Research and Implementation of Ontology Automatic Construction Based on Relational Database[C]//Proceedings of International Conference on Computer Science and Software Engineering-CSSE, 2008: 1078-1081.
[8] N Liu, G Y Li, Y F Zhang. Research on Domain Ontology Semi-automatic Construction Model towards Chinese Text[C]//Proceedings of International Convention on Information and Communication Technology, Electronics and Microelectronics-MIPRO, 2010.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家重点基础研究发展计划资助项目(2014CB340504);国家社科基金重大项目(12&ZD119)。
{{custom_fund}}