刘盈盈,罗森林,冯 扬,韩 磊,陈 功,王 倩. BFS-CTC汉语句义结构标注语料库[J]. 中文信息学报, 2013, 27(1): 72-81.
LIU Yingying, LUO Senlin, FENG Yang, HAN Lei, CHEN Gong, WANG Qian. BFS-CTC: A Chinese Corpus of Sentential Semantic Structure. , 2013, 27(1): 72-81.
BFS-CTC汉语句义结构标注语料库
刘盈盈,罗森林,冯 扬,韩 磊,陈 功,王 倩
北京理工大学 信息与电子学院 信息安全与对抗技术实验室, 北京 100081
BFS-CTC: A Chinese Corpus of Sentential Semantic Structure
LIU Yingying, LUO Senlin, FENG Yang, HAN Lei, CHEN Gong, WANG Qian
Lab of Information Security & Countermeasures Technology, School of Information & Electronics, Beijing Institute of Technology, Beijing 100081, China
Abstract:Sentential semantic structure analysis is an important issue in Chinese semantic analysis. Based on the Modern Chinese Semantics, this paper establishes a hierarchical Chinese sentential semantic structure model, defines the standard and the tagset, and thus constructs a Chinese corpus of sentential senmantic structureBFS-CTC (Beijing Forest Studio - Chinese Tagged Corpus). All sentences in this corpus are tagged on the lexical, the syntactic and the whole sentential semantic structure levels, and it is easy to analyze the relation between syntax and semantics. The core of BFS-CTC is consists of four banksthe original sentence bank (OSB), the lexical tagged bank (LTB), the syntax tagged bank (STB) and the semantic structure tagged bank (SSTB). The more than 10,000 sentences in current version come from news texts, covering six major sentence types in Chinese. Key wordsnatural language processing; semantic analysis; sentential semantic structure; corpus