全昌勤,何婷婷,姬东鸿,刘辉. 从搭配知识获取最优种子的词义消歧方法[J]. 中文信息学报, 2005, 19(1): 31-36.
QUAN Chang-qin , HE Ting-ting , JIDong-hong , LIU Hui. Chinese WSD Based on Selecting the Best Seeds from Collocations. , 2005, 19(1): 31-36.
Chinese WSD Based on Selecting the Best Seeds from Collocations
QUAN Chang-qin , HE Ting-ting , JIDong-hong , LIU Hui
1.Department of Computer Science and Technology Central China Normal University , Wuhan , Hubei 430079 , China ;2.Institute for Infocomm Research , Heng Mui Keng Terrace , 21 , 119613 , Singapore ;3.Department of Biology , Central China Normal University , Wuhan , Hubei 430079 , China
Abstract:The key problemof word sense disambiguation based on statistic model lies in how to acquiring the word sense indicators automatically. Although it is feasible to acquire a large number of collocations by learning examples , it is hard to select good seeds manually to increase new collocations effectively. The method of selecting the best seeds by machine learning is provided in this paper to solve this problem. The best seeds are used to augment more new word sense indicators ; finally disambiguate polysemous words with the acquired indicators. The average accuracy is 8717 % for 8 polysemous words by this method.
[1 ] Nancy I de , Jean Veronis. Introduction to the Special Issue on Word Sense Disambiguation :The State of the Art [J ] .Computational Linguistics. 1998 , 1 - 42. [2 ] Yarowsky D. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods[A] . In : Proceedings of 33rd Annual Meeting of ACL[C] , Cambridge , Massachusetts , USA , 1995 , 181 - 188. [3 ] HAO Trang Dang , Ching - yi Chia. Simple Features for Chinese Word Sense Disambiguation[A] . In : Proceedings of COLING- 2002 [C] , Philadelphia , USA , 2002 , 88 - 941 [4 ] 郑杰,茅于杭,董清富. 基于语境的语义排歧方法[J ] . 中文信息学报,14 (5) :1 - 7. [5 ] 李涓子,黄昌宁,杨尔弘. 一种自组织的汉语词义排歧方法[J ] . 中文信息学报,13 (3) :1 - 8. [6 ] Lesk , Michael , Automatic Sense Disambiguation : How to tell a Pine Cone from and Ice Cream Cone , Proceeding of the 1986 SIGDOC Conference , Association for Computing Machinery , New York , 1986. [7 ] 姚天顺. 自然语言理解———一种让机器懂得人类语言的研究[M] . 北京:清华大学出版社(第2 版) ,2002.