为支持基于句式结构的大规模树库建设与研究,该文设计了人机结合的可视化语法图解标注系统,通过句式结构的框架约束和词汇知识库的底层支持有效规范了标注结果的结构层次和词性标记,在一定程度上保证了树库标注的一致性和高效率。该文从实践角度介绍了基于句式结构的语法图解标注系统在辅助构建大规模汉语树库中的操作模式和功能。
Abstract
This paper designed a human-computer interaction graphical syntax tagging system based on the Sentence Pattern Structure. It's designed directly to support the Treebank constructing and deeply research base on the Sentence Pattern Structure. With the constraint of sentence pattern system and the supprot of lexical knowledge database, the hierarchy and word type tags of results are normalized effectively. To a certain extent, the consistency and quality of syntax results can be ensured. This paper illustrated the creative mode and experience of this system from the perspective of practice.
关键词
树库 /
句本位语法 /
句式结构 /
图解标注
{{custom_keyword}} /
Key words
treebank /
sentence-based grammar /
sentence pattern structure /
syntax tagging
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 王跃龙,姬东鸿.汉语树库综述[J].当代语言学,2009(1):47-55.
[2] 周强,张伟,俞士汶.汉语树库的构建[J].中文信息学报,1997,11(4):43-52.
[3] 周强.汉语句法树库标注体系[J].中文信息学报,2004,18(4):1-8.
[4] 王慧兰.汉语句类依存树库的构建研究[J].北京大学学报(自然科学版),2013(1):25-30.
[5] 赵怿怡,关润池.汉语依存树库的构建[A].第三届学生计算语言学研讨会论文集[C],2006.
[6] 彭炜明, 宋继华, 王宁. 基于句式结构的汉语图解析句法设计[J]. 计算机工程与应用.2014,50(06):11-18.
[7] Jing He, Weiming Peng, Jihua Song, et al. Anatation Schema for Contemporary Chinese Based on JinXi Lis Grammar System. In: Proceedings of The 14th Chinese Lexical Semantics Workshop (CLSW2013), LNAI,Volume 8229, Springer,2013:668-681.[8] 彭炜明,何静,宋继华.句本位语法图解析句系统的设计与实现[C]//第四届数字典藏与数字人文国际研讨会.台湾:2012.11.30.
[9] 彭炜明,宋继华,王宁,康明吉.汉语传统语法及其在中文信息处理中的应用展望[J].中文信息学报,2012,26(4):50-60.
[10] 彭炜明,宋继华,俞士汶.中文信息处理的词法问题——以句本位语法图解树库构建为背景[J].中文信息学报,2014,28(02):1-7.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}