康司辰;刘 扬;. 基于语义构词的汉语词语语义相似度计算[J]. 中文信息学报, 2017, 31(1): 94-101.
KANG Sichen; LIU Yang;. Semantic Word-formation Based Chinese Word Similarity Computing. , 2017, 31(1): 94-101.
Semantic Word-formation Based Chinese Word Similarity Computing
KANG Sichen1,3, LIU Yang2,3
1. Department of Chinese Language and Literature, Peking University, Beijing 100871, China; 2. Institute of Computational Linguistics, Peking University, Beijing 100871, China; 3. Key Laboratory of Computational Linguistic(Ministry of Education), Peking University, Beijing 100871, China
Abstract:Chinese word similarity computing plays an important role in the Chinese information processing. Based on the notion of character-orientation, Chinese semantic word-formation knowledge, including word POS, word-formation pattern and morphemic concepts, is employed to compute Chinese word similarity. This lexical knowledge representation is simple, intuitive and easy to expand and the model is straight-forward, with characteristics and parameters adopted as less as possible. Experimental results show that the approach is promising for the typical sampling word pair. Also, the numerical values of similarity are more in line with human cognition and present a reasonable distribution of the global data.