1.School of Chinese Language and Literature, Ludong University, Yantai, Shandong 264025, China;
2.Key Laboratory of Computational Linguistics at Peking University, Ministry of Education, Beijing 100871, China;
3.Collaborative Innovation Center for Language Ability, Xuzhou, Jiangsu 221009, China
Abstract:Part-of-speech annotation has attracted extensive attention from the areas including Chinese information processing, Chinese grammar study and Chinese lexicographer. Multiple part-of-speech systems have been proposed and there are significant differences between these systems. So far, little research has been done to systematically compare different large-scale part-of-speech annotations. Based on the part-of-speech annotation results in Dictionary of Contemporary Chinese and Grammatical Knowledge-Base Dictionary, this paper proposes a mapping algorithm, which can detect part-of-speech differences in two dictionaries automatically. Further, we analyze the differences and conclude in two perspectives. 1) about 83.5% of the part-of-speech annotation results is identical. and 2) all the differences can be attributed to three effects: part-of-speech shifting, different part-of-speech annotation standards and different senses.