中文篇章级句间语义关系体系及标注

张牧宇,秦 兵,刘 挺

PDF(1021 KB)
PDF(1021 KB)
中文信息学报 ›› 2014, Vol. 28 ›› Issue (2) : 28-36.
语言分析与生成

中文篇章级句间语义关系体系及标注

  • 张牧宇,秦 兵,刘 挺
作者信息 +

Chinese Discourse Relation Semantic Taxonomy and Annotation

  • ZHANG Muyu, QIN Bing, LIU Ting
Author information +
History +

摘要

篇章句间关系(Discourse Relation)是篇章级语义分析的重要内容,该文在英文篇章句间关系研究的基础上分析了中英文间的差异,总结了中文篇章级语义分析的特点,并在此基础上提出面向中文篇章句间关系的层次化语义关系体系,对句间关系类型进行详细描述。为了验证体系的合理性和完备性,我们在互联网新闻语料上进行了标注实践,分析了标注中遇到的难点并给出解决方案,为进一步的中文篇章级语义分析工作奠定基础。

Abstract

Discourse Relation is an important part of discourse semantic analysis. This paper analyses the differences between Chinese and English discourses, then presents the first Chinese discourse relation taxonomy based on the English discourse relation researches in details. Aiming at the rationality of the hierarchy, we conducts annotation experiments on Chinese internet news texts and analyses all difficulties happened during the data annotation together with the resolution to lay a foundation for the future discourse semantic analysis.

关键词

中文篇章级语义分析 / 句间关系 / 语义体系 / 语料标注

Key words

Chinese discourse semantic analysis / discourse relation / semantic taxonomy / data annotation

引用本文

导出引用
张牧宇,秦 兵,刘 挺. 中文篇章级句间语义关系体系及标注. 中文信息学报. 2014, 28(2): 28-36
ZHANG Muyu, QIN Bing, LIU Ting. Chinese Discourse Relation Semantic Taxonomy and Annotation. Journal of Chinese Information Processing. 2014, 28(2): 28-36

参考文献

[1] D Marcu. The theory and practice of discourse parsing and summarization[M]. MIT Press, 2000.
[2] R Girju. Automatic detection of causal relations for questions answering[C]//Proceedings of the ACL 2003 Workshop on Multilingual Summarisation and Question Answering, 2003: 76-83.
[3] S Somasundaran, J Wiebe, J Ruppenhofer. Discourse-level opinion interpretation[C]//Proceedings of Coling 2008.
[4] Zhou L, Li B, Gao W, et al. Unsupervised Discovery of Discourse Relations for Eliminating Intra-sentence Polarity Ambiguities[C]//Proceedings of EMNLP 2011 (Oral presentation), Edinburgh, Scotland, July 27-31, 2011.
[5] E Pitler, A Nenkova. Revisiting readability: A unified framework for predicting text quality[C]//Proceedings of EMNLP 2008: 186-195.
[6] Ziheng Lin, Hwee Tou NG, Min-Yen Kan. Automatically Evaluating Text Coherence Using Discourse Relations[C]//Proceedings of ACL-HLT: 997-1006.
[7] Morris J, Hirst G. Lexical cohesion computed by thesaural relations as an indicator of the structure of text[J]. Computational Linguistics, 1991, 17(1):21-48.
[8] Grosz Barbara J, Aravind K Joshi, Scott Weinstein. Centering: A Framework for Modelling the Local Coherence of Discourse[J]. Computational Linguistics, 1995,21/2: 203-25.
[9] Fillmore, Charles J. Frame semantics and the nature of language[J]. In Annals of the New York Academy of Sciences: Conference on the Origin and Development of Language and Speech, 1976, 280: 20‐32.
[10] Schank, R C, A belson, R Scripts, Plans, Goals, and Understanding [M]. Hillsdale, N J: Earlbaum Assoc, 1977.
[11] Mann William C, Sandra A Thompson. Rhetorical structure theory[C]//Proceedings of Toward a fanctional theory of text organizition Text 8.3.1988: 243-281.
[12] Marti A. Hearst. TextTiling: Segmenting text into multi-paragraph subtopic passages[J]. Computational Linguistics, 1997, 23(1):33-64.
[13] R Prasad, N Dinesh, A Lee, et al. The Penn Discourse Treebank 2.0[C]//Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008).
[14] Daniel Marcu. The Rhetorical Parsing[C]//Proceedings of Summarization, and Generation of Natural Language Texts. PhD thesis, University of Toronto, 1997.
[15] Radu Soricut, Daniel Marcu. Sentence level discourse parsing using syntactic and lexical information[C]//Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Edmonton, Canada, 2003.
[16] David duVerle, Helmut Prendinger. A novel discourse parser based on Support Vector Machine classification[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Singapore, 2009.
[17] VW Feng, Hirst. Text-level Discourse Parsing with Rich Linguistic Features[C]//Proceedings Of ACL 2012.
[18] E Pitler, Ani Nenkova. Using syntax to disambiguate explicit discourse connectives in text[C]//Proceedings of the ACL-IJCNLP 2009, Conference Short Papers, Singapore, 2009.
[19] E Pitler, Annie Louis, Ani Nenkova. Automatic sense prediction for implicit discourse relations in text[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Singapore, 2009.
[20] Ben Wellner, James Pustejovsky. Automatically identifying the arguments of discourse connectives[C]//Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, Czech Republic, 2007.
[21] Robert Elwell, Jason Baldridge. Discourse connective argument identification with connective specific rankers[C]//Proceedings of the IEEE International Conference on Semantic Computing, Washington, DC, USA, 2008.
[22] Ziheng Lin, Min-Yen Kan, Hwee Tou Ng. Recognizing implicit discourse relations in the Penn Discourse Treebank[C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 2009.
[23] Ziheng Lin, Hwee Tou Ng, Min-Yen Kan. A PDTB-styled end-to-end discourse parser[C]//Proceedings of Technical Report TRB8/10, School of Computing, National University of Singapore, August, 2010.
[24] WenTing Wang, Jian Su, Chew Lim Tan. Kernel based discourse relation recognition with temporal ordering information[C]//Proceedings of the 48th Annual Meeting of the Association for Computation, 2010.
[25] Z Zhou, Y Xu, Z Niu, et al. Predicting discourse connectives for implicit discourse relation recognition[C]//Proceedings of Coling 2010: 1507-1514.
[26] C Chiarcos. Towards the Unsupervised Acquisition of Discourse Relations[C]//Proceedings of ACL, 2012.
[27] Rashmi Prasad, Samar Husain, Dipti Sharma, Aravind Joshi. Towards an annotated corpus of discourse relations in Hindi[C]//Proceedings of the Third International Joint Conference on Natural Language Processing, Hyderabad, India,2008b.
[28] Deniz Zeyrek, Bonnie Webber. A Discourse Resource for Turkish: Annotating Discourse Connectives in the METU Corpus[C]//Proceedings of IJCNLP-2008. Hyderabad, India, 2008.
[29] Amal Al-Saif and Katja Markert. Modelling discourse relations for Arabic[C]//Proceedings, Empirical Methods in Natural Language Processing, 2011: 736-747.
[30] Xue Nianwen. Annotating discourse connectives in the Chinese Treebank[C]//Proceedings of The ACL Workshop in Frontiers in Annotation II: Pie in the Sky. Ann Arbor, Michigan: ACL, 2005.
[31] Yuping Zhou, Nianwen Xue. PDTB-style Discourse Annotation of Chinese Text[C]//Proceedings of ACL 2012.
[32] 王力. 《王力文集》. 山东: 山东教育出版社. 1984: 35-36.

基金

国家自然科学基金重点项目(61133012);国家自然科学基金面上项目(61273321);国家863前沿技术研究项目(2012AA011102)
PDF(1021 KB)

889

Accesses

0

Citation

Detail

段落导航
相关文章

/