基于文本的问题生成是从给定的句子或段落中生成相关问题。目前,主要采用序列到序列的神经网络模型来研究包含答案的句子生成问题,然而这些方法存在以下问题: ①生成的疑问词与答案类型不匹配; ②问题与答案的相关性不强。该文提出一个基于答案及其上下文信息的问题生成模型。该模型首先根据答案与上下文信息的关系确定与答案类型匹配的疑问词;然后利用答案及其上下文信息确定问题相关词,使问题尽可能使用原文中的词;最后结合原句作为输入来生成问题。相关实验表明,该文提出的模型性能明显优于基线系统。
Abstract
Text-based question generation is to generate related questions from a given sentence or paragraph. In recent years, sequence-to-sequence neural network models have been used to generate questions for sentences containing answers. However, these methods have the limitations: (1) the generated interrogatives do not match the answer type; and (2) the relevance of questions and the answer is not strong. This paper proposes a question generation model that based on answers and the contextual information. The model first determines interrogatives that match the answer type according to the relationship between the answer and the context information. Then, the model uses the answer and the context information to determine words related to questions, so that questions use words in the original text as much as possible. Finally, the model combines answer features, interrogatives, words related to questions with original sentences as inputs to generate a question. Experiments show that the proposed model is significantly better than the baseline systems.
关键词
问题生成 /
神经网络 /
问题相关词
{{custom_keyword}} /
Key words
question generation /
neural networks /
words related to questions
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Rus V, Wyse B, Piwek P, et al. A detailed account of the first question generation shared task evaluation challenge[J]. Dialogue & Discourse,2010: 177-204.
[2] Heilman M, Smith N A. Good question! statistical ranking for question generation[C]//Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics,2010: 609-617.
[3] Mannem P, Prasad R, Joshi A. Question generation from paragraphs at upenn: qgstec system description[C]//Proceedings of Question Generation 2010: The 3rd Workshop on Question Generation,2010: 84-91.
[4] Agarwal M, Shah R, Mannem P. Automatic question generation using discourse cues[C]//Proceedings of the 6th Workshop on Innovative Use of NLP for Building Educational Applications,2011: 1-9.
[5] Mazidi K, Nielsen R D. Pedagogical evaluation of automatically generated questions[C]//Proceedings of the 2014 International Conference on Intelligent Tutoring Systems,2014: 294-299.
[6] Serban I V, Garcia Duran A, Gulcehre C, et al.Generating factoid questions with recurrent neural networks: the 30m factoid question-answer corpus[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,2016: 588-598.
[7] Du X, Shao J, Cardie C. Learning to ask: neural question generation for reading comprehension[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics,2017:1342-1352 .
[8] Duan N, Tang D, Chen P, et al. Question generation for question answering[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing,2017: 866-874.
[9] Zhou Q, Yang N, Wei F, et al. Neural question generation from text: a preliminary study[C]//Proceedings of Natural Language Processing and Chinese Computing,2017.
[10] Sun X, Liu J, Lyu Y, et al. Answer-focused and position-aware neural question generation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing,2018: 3930-3939.
[11] Liu B, Zhao M, Niu D, et al. Learning to generate questions by learning what not to generate[J]. arXiv preprint arXiv: 1902.10418, 2019.
[12] Chen W, Aist G, Mostow J. Generating questions automatically from informational text[C]//Proceedings of the 2nd Workshop on Question Generation (AIED 2009),2009: 17-24.
[13] Pennington J, Socher R, Manning C. Glove: Global vectors for word representation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing,2014.
[14] Luong T, Pham H, Manning C D. Effective approaches to attention based neural machine translation[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing,2015: 1412-1421.
[15] Gulcehre C, Ahn S, Nallapati R, et al. Pointing the unknown words[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,2016: 140-149.
[16] Mihalcea R, Tarau P. Textrank: bringing order into texts[C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing,2004: 404-411.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金(61673248,61806117)
{{custom_fund}}