句子语义分析是语言研究深入发展的客观要求,也是当前制约语言信息处理技术深度应用的主要因素。在探索深层语义分析方法的基础上,该文根据汉语的特点,提出了一整套语义依存图的构建方法,并建立了一个包含30 000个句子的语义依存图库。以兼语句为重点研究对象,该文研究了语料库中所有纯粹的兼语句所对应的句模情况,进而试图构建基于语义依存图的句模系统,总结句型和句模的映射规则,从而为更好的建立语义自动分析模型提供相应的知识库。
Abstract
Semantic analysis of sentences is essential to language study, and it is also the major bottleneck restricting the large-scale application of language information technology at present. Based on the study of deep semantics analysis methods, we propose a new semantic analysis method-Semantic Dependency Graph, and construct a corpus consisting of 30,000 sentences. Furthermore, we make a study on the semantic sentence patterns of pure pivotal sentences in the corpus, and try to construct the system of semantic sentence patterns based on semantic dependency graph. We also summarize corresponding relations between sentence patterns and semantic sentence patterns to provide the automatic semantic parsing system with a corresponding knowledge base.
Key words semantic sentence patterns; semantic analysis; semantic dependency graph; pivotal sentence
关键词
句模 /
语义分析 /
语义依存图 /
兼语句
{{custom_keyword}} /
Key words
semantic sentence patterns /
semantic analysis /
semantic dependency graph /
pivotal sentence
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 孙道功,亢世勇,孙茂松.基于标注语料库的现代汉语单句句型句模的对应关系研究[C].全国第八届计算语言学联合学术会议论文集,2005:234-240.
[2] 孙道功,亢世勇,孙茂松.面向语言处理的单句句型句模对应关系研究——基于标注语料库的定量考察[J].计算机工程与应用,2006,33: 170-173.
[3] 黄晓冬.试论兼语短语的语义结构[J].四川师范大学学报,2009,36(6):38-43.
[4] Fillmore Charles J, Collin F Baker. FrameNet:Frame Semantics Meets the Corpus[D].Poster presentation,74th Annual Meeting of the Linguistics Society of America, 2000.
[5] Palmer M,Gildea D,Kingsbury P. The Proposition Bank:A Corpus Annotated with Semantic Roles[J].Computational Linguistics,2005,31(1):71-105.
[6] Chang Liu, HweeTou Ng. Learning Predictive Structures for Semantic Role Labeling of NomBank[C]//Proceedings of the ACL 2007: 208-215.
[7] N Xue, M Palmer. Annotating the Propositions in the Penn Chinese Treebank. Q Ma, F Xia, (Editors)[C]//Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, 2003: 47-54.
[8] Liping You, Kaiying Liu. Building Chinese FrameNet Database[C]//Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, 2005: 301-306.
[9] Nianwen Xue.Annotating the predicate-argument structure of Chinese nominalizations[C]//Proceedings of the fifth international conference on Language Resources and Evaluation, Genoa, Italy, 2006.
[10] Min Zhang, Wanxiang Che, Guodong Zhou, et al. Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kemel[C]//Proceedings of the IEEE Transactions on Audio Speech and Language Processing 2008, 16(7):1315-1329.
[11] Wanxiang Che, Min Zhang, Ai Ti Aw, et al. Using a Hybrid Convolution Tree Kernel for Semantic Role Labeling[C]//Proceedings of the ACMTransactions on Asian Language Information Processing 2008, 7(4):1-23.
[12] Xue NW. Labeling Chinese Predicates with Semantic Roles[J]. Computational Linguistics, 2008,34(2):225-255.
[13] Hacioglu K. Semantic Role Labeling using Dependency trees[C]//Proceedings of the COLING04 Proceedings of the 20th international conference on Computational Linguistics. 2004: 1273-1281.
[14] 丁伟伟, 常宝宝. 基于语义组块分析的汉语语义角色标注[J]. 中文信息学报, 2009, 23(5):53-61,74.
[15] Carreras X, Mrques L. Introduction to the conll2005 shared task: semantic role labeling[C]//Proceedings of the CONLL 05 Proceedings of the 9th Conference on Computational Natural Language Learning.Stroudsburg: Association for Computational Linguistics. 2005: 152-164.
[16] 尤昉, 李涓子, 王作英. 基于语义依存关系的汉语语料库的构建[J]. 中文信息学报, 2003,17(1): 46-53.
[17] B Chen, D Ji. Chinese Semantic parsing Based on dependency Graph[C]//Proceedings of the First International Conference on Electronic & Mechanical Engineering and Information Technology, 2011:1730-1734.
[18] 范晓.三个平面的语法观[M].北京: 北京语言学院出版社,1996.
[19] 范晓,朱晓亚.论句模研究的方法[J].徐州师范大学学报,1999,25(4):18-23.
[20] 朱晓亚.现代汉语句模研究[M].北京: 北京大学出版社,2001.
[21] 徐昌火.试论句模研究的对象、起点和基本原则[J].南京师范大学学报,1999, 4:101-108.
[22] 鲁川.汉语语法的意合网络[M].北京: 商务印书馆,2001.
[23] 田珍都.基于标注语料库的现代汉语句模研究[D].烟台师范学院硕士学位论文,2002.
[24] 亢世勇,许小星.现代汉语句系系统的构建和研究[J].中文信息学报,2010,24(1):103-109.
[25] 黄伯荣,廖旭东.现代汉语(第四版)[M].北京: 高等教育出版社,2007.
[26] 许小星.基于标注语料库的现代汉语特殊句式语义分析[D].烟台: 鲁东大学硕士学位论文,2007.
[27] Robinson J J.Dependency Structures and Transformation Rules[J]. Language,1970,46(2): 259-285.
[28] 郑丽娟, 邵艳秋, 杨尔弘. 中文非投射语义依存现象分析研究[J]. 中文信息学报,2014, 28(6):41-47.
[29] Qiang Dong, ZhendongDong.Hownet and Computation of Meaning[M].World Scientific Publishing Company,2006.
[30] 冯志伟. 中文信息处理与汉语研究[M]. 北京: 商务出版社,1992.
[31] 袁毓林. 基于认知的汉语计算语言学研究[M].北京大学出版社,2008.
[32] 游汝杰. 现代汉语兼语句的句法和语义特征[J]. 汉语学习,2002, 6:1-6.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金(61170144),国家语委十二五科研规划重点项目(ZDI125-41),2014年度北京语言大学中青年学术骨干支持计划,北京语言大学研究生创新基金(中央高校基本科研业务费专项资金)(15YCX102)
{{custom_fund}}