Abstract:Discourse analysis is a hot topic in the field of Natural Language Processing. Discourse nuclearity recognition, a subtask of discourse analysis, focuses on recognizing the main and secondary content of a discourse, to better understand and grasp its core content. This paper focuses on the task of macro Chinese discourse nulcearity recognition and proposes a recognition method based on discourse topic. This method introduces the semantic interaction between different discourse units and that between the discourse unit and its topic to identify the nuclearity. Moreover, it applies the selection mechanism of the discourse topic to further improve the performance of nuclearity recognition.Experimental results on MCDTB show that the proposed method outperforms the state-of-the-art baselines.
[1] 徐凡, 朱巧明, 周国栋. 篇章分析技术综述[J]. 中文信息学报, 2013, 27(3): 20-33. [2] 褚晓敏, 朱巧明, 周国栋. 自然语言处理中的篇章主次关系研究[J]. 计算机学报, 2017,40(4): 842-860. [3] Mann W C, Thompson S A. Rhetorical structure theory: Toward a functional theory of text organization[J]. Text-Interdisciplinary Journal for the Study of Discourse, 1988, 8(3): 243-281. [4] Liakata M, Dobnik S, Saha S, et al. A discourse-driven content model for summarising scientific articles evaluated in a complex question answering task[C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013: 747-757. [5] Cohan A, Goharian N. Scientific article summarization using citation-context and article's discourse structure[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: Association for Computational Linguistics, 2015: 390-400. [6] Zhou L, Li B, Gao W, et al. Unsupervised discovery of discourse relations for eliminating intra-sentence polarity ambiguities[C]//Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2011: 162-171. [7] Zou B, Zhou G, Zhu Q. Negation focus identification with contextual discourse information[C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014: 522-530. [8] Jiang F, Xu S, Chu X, et al. MCDTB: A macro-level Chinese Discourse TreeBank[C]//Proceedings of the 27th International Conference on Computational Linguistics, 2018: 3493-3504. [9] 蒋峰, 褚晓敏, 徐昇. 基于主题相似度的宏观篇章主次关系识别方法[J]. 中文信息学报, 2018, 32(1): 43-50. [10] Xu S, Li P, Zhou G, et al. Employing text matching network to recognise nuclearity in Chinese discourse[C]//Proceedings of the 27th International Conference on Computational Linguistics, 2018: 525-535. [11] Devlin J, Chang M W, Lee K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 29th Conference of the North American Chapter of the Association of Computational Linguistics, 2018: 4171-4186. [12] Carlson L, Okurowski M E, Marcu D. RST discourse treebank[M]. Linguistic Data Consortium, University of Pennsylvania, 2002. [13] Li Y, Kong F, Zhou G. Building Chinese discourse corpus with connective-driven dependency tree structure[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014: 2105-2114. [14] Hernault H, Prendinger H, Ishizuka M. HILDA: A discourse parser using support vector machine classification[J]. Dialogue & Discourse, 2010, 1(3): 1-33. [15] Joty S, Carenini G, Ng R, et al. Combining intra-and multi-sentential rhetorical parsing for document-level discourse analysis[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013, 1: 486-496. [16] Li J, Li R, Hovy E. Recursive deep models for discourse parsing[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014: 2061-2069. [17] Li Q, Li T, Chang B. Discourse parsing with attention-based hierarchical neural networks[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016: 362-371. [18] Chu X, Wang Z, Zhu Q, et al. Recognizing nuclearity between Chinese discourse units[C]//Proceedings of the 2015 International Conference on Asian Language Processing. IEEE, 2015: 197-200. [19] Kong F, Wang H, Zhou G. A CDT-styled end-to-end Chinese discourse parser[M]. Natural Language Understanding and Intelligent Applications. Springer, Cham, 2016: 387-398. [20] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[C]//Proceedings of Advances in Neural Information Processing Systems, 2017: 5998-6008.