边宁,韩先培,何苯,孙乐. 面向高考历史科目试题的自动答题系统[J]. 中文信息学报, 2022, 36(4): 137-145.
BIAN Ning, HAN Xianpei, HE Ben, SUN Le. Automatic Question Answering System for History Subject of National College Entrance Examination. , 2022, 36(4): 137-145.
Automatic Question Answering System for History Subject of National College Entrance Examination
BIAN Ning1,2, HAN Xianpei2, HE Ben1,2, SUN Le2
1.School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China; 2.Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
Abstract:The National College Entrance Examination is a standardized test that comprehensively evaluates the level of human knowledge and abilities, which serves as a more challenging question answering task. This paper designs an automatic question answering system for the history subject of National College Entrance Examination based on deep neural networks. One of the challenges for knowledge-enhanced question answering is the contextual sensitivity of knowledge: among the large amount of knowledge stored in the knowledge base, only a few pieces of knowledge are relevant to answering a certain question. In response to this challenge, this paper designs a knowledge-enhanced question answering system that combines knowledge retrieval and machine reading comprehension. Through the relevance ranking ability of the knowledge retrieval system and the knowledge positioning ability of the machine reading comprehension model, knowledge related to the question can be effectively discovered, thereby enhancing the performance of the question answering system. The experimental results show that the system can effectively answer questions in the history subject of National College Entrance Examination.
[1] Ferrucci A, Brown W, Chu-Carroll J, et al. Building Watson: an overview of the DeepQA project[J]. Ai Magazine, 2010, 31(3): 59-79. [2] Cheng G, Zhu W, Wang Z, et al. Taking up thegaokao challenge: an information retrieval approach[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence, 2016: 2479-2485. [3] Chen D, Fisch A, Weston J, et al.Reading Wikipedia to answer open-domain questions[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017: 1870-1879. [4] Lee K, Chang M W, Toutanova K. Latent retrieval for weakly supervised open domain question answering[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019: 6086-6096. [5] Karpukhin V, Oguz B, Min S, et al. Dense passage retrieval for open-domain question answering[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2020: 6769-6781. [6] Rajpurkar P, Zhang J, Lopyrev K, et al. SQuAD: 100,000+ questions for machine comprehension of text[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2016: 2383-2392. [7] Chen D, Bolton J, Manning C D.A thorough examination of the CNN/daily mail reading comprehension task[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016: 2358-2367. [8] Devlin J, Chang M W, Lee K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1, 2019: 4171-4186. [9] Vinyals O, Fortunato M, Jaitly N. Pointer networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems-Volume 2, 2015: 2692-2700. [10] Zhang S, Zhao H, Wu Y, et al. DCMN+: Dual co-matching network for multi-choice reading comprehension[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(05): 9563-9570. [11] Shibuki H, Sakamoto K, Kano Y, et al. Overview of the NTCIR-11 QA-Lab task[C]//Proceedings of the 11th NTCIR Conference, 2014: 518-529. [12] Fujita A, Kameda A, Kawazoe A, et al. Overview oftodai robot project and evaluation framework of its NLP-based problem solving[J]. World History, 2014, 36(36): 148. [13] Wang D, Boytsov L, Araki J, et al. CMU multiple-choice question answering system at NTCIR-11 QA-Lab[C]//Proceedings of the 11th NTCIR Conference, 2014: 542-549. [14] Hosseini M J,Hajishirzi H, Etzioni O, et al. Learning to solve arithmetic word problems with verb categorization[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2014: 523-533. [15] 陈志刚,刘青文,林伟,等.一种面向初高中英语单选题的类人答题系统[J]. 中国科学技术大学学报,2016(3): 231-237. [16] Wu J, Liu X, Zhang X, et al. Master clinical medical knowledge at certificated-doctor-level with deep learning model[J]. Nature communications, 2018, 9(1): 1-7. [17] Zeng D, Liu K, Chen Y, et al. Distant supervision for relation extraction via piecewise convolutional neural networks[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2015: 1753-1762. [18] Huang Z, Wu A, Shen Y, et al.When retriever-reader meets scenario-based multiple-choice questions[C]//Proceedings of the Association for Computational Linguistics: EMNLP, 2021: 985-994.