王 敬;杨丽姣;蒋宏飞;苏靖杰;付静玲. 汉语二语教学领域词义标注语料库的研究及构建[J]. 中文信息学报, 2017, 31(1): 221-229.
WANG Jing; YANG Lijiao; JIANG Hongfei; SU Jingjie; FU Jingling. A Word Sense Annotated Corpus for Teaching Chinese as Second Language. , 2017, 31(1): 221-229.
汉语二语教学领域词义标注语料库的研究及构建
王 敬,杨丽姣,蒋宏飞,苏靖杰,付静玲
北京师范大学 中文信息处理研究所,北京 100875
A Word Sense Annotated Corpus for Teaching Chinese as Second Language
WANG Jing, YANG Lijiao, JIANG Hongfei, SU Jingjie, FU Jingling
Institute of Chinese Information Processing, Beijing Normal University, Beijing 100875, China
Abstract:In field of teaching Chinese as a second language, the teaching of word is very important, in which polysemous word is a challenging issue. After a survey of 3 classical vocabularies in this field, this paper selects 1 181 polysemous words. Then an annotation specification is designed, with a reference to Modern Chinese Dictionary (Edition 6). Tagging the 1 181 words appeared in 197 popular Chinese textbooks yields a corpus with word senense annotation over 3.5 million characters. A quantitative study on the 1 811 polysemous words is also made, with an analysis of the distribution of total 4 323 word senses.