依存句法是句法分析的一种,相比于短语结构句法分析,依存句法具有更简洁的表达方式。该文采用英文语义角色标注的研究方法,实现了一个基于中文依存句法分析的语义角色标注系统。该系统针对中文依存关系树,采用有效的剪枝算法和特征,使用最大熵分类器进行语义角色的识别和分类。系统使用了两种不同的语料,一种是由标准短语结构句法分析(CTB5.0)转换而来,另一种是CoNLL2009公布的中文语料。系统分别在两种语料的标准谓词和自动谓词的基础上进行实验,在标准谓词上取得的F1值分别为84.30%和81.68%,在自动谓词上的F1值为81.02%和81.33%。
Abstract
Dependency representations are more simple and intuitive than constituent representations for Chinese parse. This paper implements a Chinese dependency parse based semantic role labeling (SRL) by using the similar methods in English SRL. In the system, effective pruning algorithm and useful features are adopted for Chinese dependency tree, and the semantic role identification and classification are accomplished by a maximum entropy classifier. Two different corpora are adopted to test our system, one is transferred from constituent-based corpus (CTB5.0), and the other is Chinese dataset provided by CoNLL 2009 shared task. Based on the two datasets, the system achieves, respectively, 84.30% and 81.68% in labeled F1 for gold predicates, and 81.02% and 81.33% for automatic predicates.
Key wordscomputer application; Chinese information processing; Semantic Role Labeling;Dependency Relations;maximum entropy classifier
关键词
计算机应用 /
中文信息处理 /
语义角色标注 /
依存关系 /
最大熵分类器
{{custom_keyword}} /
Key words
computer application /
Chinese information processing /
Semantic Role Labeling /
Dependency Relations /
maximum entropy classifier
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] CoNLL 2008,http://www.yr-bcn.es/conll2008/, [EB].
[2] CoNLL 2009,http://ufal.mff.cuni.cz/conll2009-st/,[EB].
[3] Kadri Hacioglu. Semantic Role Labeling Using Dependency Trees[C]//Proc.of CoNLL-2004,Boston,MA,US,2004.
[4] Johansson R. and Nugues P.. Dependency-based semantic role labeling of PropBank[C]//Proceedings of EMNLP-2008. 2008.
[5] Johansson R. and Nugues P.Dependency-based syn-
tactic-semantic analysis with PropBank and NomBank[C]//Proceedings of CoNLL-2008.23-24 Aug. 2008.
[6] Xue Nianwen,Palmer M .Automatic semantic role labeling for Chinese verbs[C]//Proc. of IJCAI-2005,2005.
[7] Xue Nianwen, Palmer M. Calibrating features for semantic role labeling[C]//Proc. of EMNLP-2004,2004.
[8] 袁晓虹,王步康,王红玲,周国栋: 基于依存关系的中文谓词标注研究[C]//全国第十届计算语言学学术会议(CNCCL-2009),烟台,2009.7.
[9] Gildea D,Jurafsky D. Automatic labeling of semantic roles[J] .Computational Linguistics,2002,28(3):245-288.
[10] 刘挺,车万翔,李生.基于最大熵分类器的语义角色标注[J].软件学报,2007,18(3): 565-573.
[11] Wang Hongling,Zhou Guodong,Zhu Qiaoming and Qian Peide.Exploring various features in semantic role labeling[C]//Proceedings of ALPIT’2008. 2008:23-25 .
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金资助项目(60673041,60873150);国家教育部博士点基金资助项目(200802850006);江苏省自然科学基金资助项目(BK2008160);江苏省高校自然科学重大基础研究项目(08KJA520002)。
{{custom_fund}}