徐 凡,王明文,谢旭升,李茂西,万剑怡. 基于主位-述位结构理论的英文作文连贯性建模研究[J]. 中文信息学报, 2016, 30(1): 115-124.
XU Fan, WANG Mingwen, XIE Xusheng, LI Maoxi, WAN Jianyi. Coherence Modeling for English Student Essay Based on Theme-rheme Structure Theory. , 2016, 30(1): 115-124.
基于主位-述位结构理论的英文作文连贯性建模研究
徐 凡,王明文,谢旭升,李茂西,万剑怡
江西师范大学 计算机信息工程学院,江西 南昌 330022)
Coherence Modeling for English Student Essay Based on Theme-rheme Structure Theory
XU Fan, WANG Mingwen, XIE Xusheng, LI Maoxi, WAN Jianyi
School of Computer Information Engineering, Jiangxi Normal University, Nanchang, Jiangxi 330022, China
Abstract:This paper presents an unsupervised theme-rheme structure theory based discourse coherence model, in contrast to the current supervised entity based model and the discourse relation grid based model. Our model describes discourse coherence via calculating the similarity between theme or rheme of adjacent sentences through incorporating more semantic knowledge like word stem, hypernym, hyponym, synonym and paraphrase etc. Meanwhile, this paper also presents a simple and effective coherence model based on counting the number of discourse relations within a discourse, and integrates the theme-rheme-based model using linear combination method. Evaluation on benchmark English student essay dataset reveals the effectiveness of our linear combination discourse coherence model, significantly outperforming baselines the literature.
[1] 黄国文. 语篇分析概要[M]. 长沙:湖南教育出版社,1987:1-221.
[2] Fox H J. Phrasal cohesion and statistical machine translation[C]//Proceedings of the Empirical Methods in Natural Language Processing (EMNLP). Philadelphia, U.S.A., Association for Computational Linguistics Press: 2002: 304-311.
[3] Soricut R, Marcu D. Discourse generation using utility-trained coherence models[C]//Proceedings of the Joint Conference of 44th Annual Meeting of the Association for Computational Linguistics and 21st International Conference on Computational Linguistics (ACL-COLING). Sydney, Australia, Association for Computational Linguistics Press: 2006: 803-810.
[4] Barzilay R, Lee L. Catching the drift: probabilistic content models, with applications to generation and summarization[C]//Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL). Boston, Massachusetts, U.S.A., Association for Computational Linguistics Press: 2004:113-120.
[5] Lin Z H, Liu C, Ng H W, et al. Combining coherence models and machine translation evaluation metrics for summarization evaluation[C]//Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL). Jeju Island, Korea, Association for Computational Linguistics Press: 2012:1006-1014.
[6] Bollegala D, Okazaki N, Ishizuka M. A bottom-up approach to sentence ordering for multi-document summarization[C]//Proceedings of the Joint Conference of 44th Annual Meeting of the Association for Computational Linguistics and 21st International Conference on Computational Linguistics (ACL-COLING). Sydney, Australia, Association for Computational Linguistics Press: 2006: 385-392.
[7] Yannakoudakis H, Briscoe T. Modeling coherence in ESOL learner texts[C]//Proceedings of the 7th Workshop on the Innovative Use of NLP for Building Educational Applications. Montreal, Canada, Association for Computational Linguistics Press: 2012:33-43.
[8] Yannakoudakis H, Briscoe T, Medlock B. A new dataset and method for automatically grading ESOL texts[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL). Portland, Oregon, Association for Computational Linguistics Press: 2011:180-189.
[9] Burstein J, Tetreault J, Andreyev S. Using entity-based features to model coherence in student essays[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL). Uppsala, Sweden, Association for Computational Linguistics Press: 2010: 681-684.
[10] Higgins D, Burstin J, Marcu D, et al. Evaluating multiple aspects of coherence in student essays[C]//Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. (HLT-NAACL). Boston, Massachusetts, U.S.A., Association for Computational Linguistics Press: 2004: 185-192.
[11] Foltz P W, Walter K, Thomas K L. The measurement of textual coherence with latent semantic analysis[J]. Discourse Processes,1998,25(2&3):285-307.
[12] Louis A, Nenkova A. A coherence model based on syntactic patterns[C]//Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CNLL). Jeju Island, Korea, Association for Computational Linguistics Press: 2012: 1157-1168.
[13] Barzilay R, Lapata M. Modeling local coherence: an entity-based approach[J]. Computational Linguistics,2008,34(1):1-34.
[14] Barzilay R, Lapata M. Modeling local coherence: an entity-based approach[C]//Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL). Ann Arbor, Association for Computational Linguistics Press: 2005: 141-148.
[15] Lapata M, Barzilay R. Automatic evaluation of text coherence: models and representations[C]//Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI). Edinburgh, Scotland, U.K.: 2005: 1085-1090.
[16] Feng V W, Hirst G. Extending the entity-based coherence model with multiple ranks[C]//Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL). Avignon, France, Association for Computational Linguistics Press: 2012: 315-324.
[17] Lin Z H, Ng H T, Kan M Y. Automatically evaluating text coherence using discourse relations[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL). Portland, Oregon, Association for Computational Linguistics Press: 2011: 997-1006.
[18] Iida R, Tokunaga T. A metric for evaluating discourse coherence based on coreference resolution[C]//Proceedings of the 24th International Conference on Computational Linguistics (COLING). IIT Bombay, Mumbai, India: 2012:483-494.
[19] Elsner M, Charniak E. Coreference-inspired coherence modeling[C]//Proceedings of the Human Language Technology Conference of the 46th Association for Computational Linguistics (ACL: HLT). Columbus, Ohio, USA, Association for Computational Linguistics Press: 2008: 41-44.
[20] Halliday M A K. An Introduction to Functional Grammar[M]. New York: Oxford University Press Inc., 2004:1-700.
[21] 程晓堂. 从主位结构看英语作文的衔接与连贯[J]. 山东师范大学学报,2002,(2):94-98.
[22] Landauer T K, Dumais S T. A solution to platos problem: the latent semantic analysis theory of acquisition, induction and representation of knowledge[J]. Psychological Review, 1997,104(2):211-240.
[23] Grosz B J, Weinstein S, Joshi A K. Centering: a framework for modeling the local coherence of discourse[J]. Computational Linguistics, 1995,21(2):203-225.
[24] 胡壮麟. 语篇的衔接与连贯[M]. 上海:上海外语教育出版社,1994:1-235.
[25] Pitler E, Nenkova A. Using Syntax to Disambiguate Explicit Discourse Relations in Text[C]//Proceedings of the the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL-IJCNLP).Suntec, Singapore, Association for Computational Linguistics Press: 2009: 13-16.