面向事实性问题的答案选择技术研究综述

董燕举,蔡东风,白宇

PDF(802 KB)
PDF(802 KB)
中文信息学报 ›› 2009, Vol. 23 ›› Issue (1) : 86.
综述

面向事实性问题的答案选择技术研究综述

  • 董燕举1,2,蔡东风2,白宇2
作者信息 +

A Survey on Answer Selection Technology for Factoid Question

  • DONG Yan-ju1,2, CAI Dong-feng2, BAI Yu2
Author information +
History +

摘要

答案选择是问答系统的一个关键步骤,它的任务是从候选答案集中选择出最佳答案返给用户,其主要研究内容包括答案选择的标准、方法及评价。该文首先介绍了主要的答案选择标准,分析了答案选择标准与问答系统评测之间的关系。然后将答案选择策略分为基于冗余的策略、基于相似性的策略和基于推理的策略,分别对每种策略的主要答案选择方法和特点进行了概述。随后又介绍了答案选择的评价指标及答案验证评测。最后讨论了答案选择所面临的主要问题,并对其未来的发展方向进行了展望。

Abstract

Answer selection as a crucial step in question answering system is to choose the best answer from the candidates. The research issues include the criteria, the strategies, the methods and the evaluation for answer selection.This paper first illustrates the main answer selection criteria and analyzes the relationship between the criteria and the question answering evaluation. Then it summarizes the answer selection strategies into redundancy-based, similarity-based and reasoning-based strategy, presenting the algorithms and characteristics of each strategy. The evaluation measures for answer selection and the Answer Validation Exercise are also introduced. Finally, the paper discusses the major problems in answer selection and the prospects for its future research.

关键词

计算机应用 / 中文信息处理 / 综述 / 自然语言处理 / 问答系统 / 答案选择 / 答案验证 / 答案选择标准

Key words

computer application / Chinese information processing / overview / natural language processing / question answering / answer selection / answer validation / answer selection criteria

引用本文

导出引用
董燕举,蔡东风,白宇. 面向事实性问题的答案选择技术研究综述. 中文信息学报. 2009, 23(1): 86
DONG Yan-ju, CAI Dong-feng, BAI Yu. A Survey on Answer Selection Technology for Factoid Question. Journal of Chinese Information Processing. 2009, 23(1): 86

参考文献

[1] 郑实福, 刘挺, 秦兵, 李生. 自动问答综述[J] . 中文信息学报, 2002 ,16(6) :46-52.
[2] 王灿辉,张敏,马少平. 自然语言处理在信息检索中的应用综述[J] . 中文信息学报, 2007, 21(2) :35-45.
[3] 张宇, 刘挺, 文勖. 基于改进贝叶斯模型的问题分类[J] . 中文信息学报, 2005 ,19(2) :100-105.
[4] 文勖, 张宇, 刘挺, 马金山. 基于句法结构分析的中文问题分类[J] . 中文信息学报, 2006 ,20(2) :33-39.
[5] 孙景广, 蔡东风, 吕德新,等. 基于知网的中文问题自动分类[J] . 中文信息学报, 2007 ,21(1) :90-95.
[6] 吴友政,赵军,段湘煜,等.问答式检索技术及评测研究综述[J].中文信息学报,2005,19(3):1-13.
[7] 吴晨, 张全. 基于概念匹配的中文问答处理模型核心问题探讨[J] . 中文信息学报, 2006,20(4) :49-55.
[8] 刘佳宾,胡国平,陈超,邵正荣. 基于决策树和马尔可夫链的问答对自动提取[J] . 中文信息学报, 2007, 21(2):46-51.
[9] L. Hirschman and R. Gaizauskas. Natural Language Question Answering: the View from Here [J]. Natural language Engineering, 2001, 7(4): 275-300.
[10] John. Burger, et al. Issues, Tasks and Program Structures to Roadmap Research in Question Answering (QA) [C]// www-nlpir.nist.gov/projects/duc/papers/qa.Roadmap-paper_v2.doc. Available, April, 2007.
[11] Dang, H. T., Kelly, D. &. Lin, J. Overview of the TREC 2007 Question Answering Track [C]//Proceedings of the Sixteenth Text REtrieval Conference , 2007.
[12] Magnini B., Negri M., Prevete R. and Tanev H. Towards Automatic Evaluation of Question/Answering Systems [C]//Third International Conference on Language Resources and Evaluation (LREC-2002) Proceedings , 2002.
[13] Ellen M. Voorhees and Hoa T. Dang. Overview of the TREC 2005 Question Answering Track [C]//Proceedings of the Fourteenth Text REtrieval Conference (TREC 2005),2006.
[14] Dang, H. T., Kelly, D. &. Lin, J. Overview of the TREC 2006 Question Answering Track [C]//Proceedings of the Fifteenth Text REtrieval Conference, 2007.
[15] Diane Kelly, Jimmy Lin. Overview of the TREC 2006 ciQA Task. SIGIR Forum [J], 2007, 41(1):107-116.
[16] Junichi Fukumoto, Tsuneaki Kato, et al. An Overview of the 4th Question Answering Challenge (QAC-4) at NTCIRWorkshop 6 [C]//Proceedings of NTCIR-6 Workshop Meeting, 2007.
[17] C. Clarke, G. Cormack, and T. Lynam. Exploiting Redundancy in Question Answering [C]//Proceedings of ACM SIGIR Conference on Research and Development on Information Retrieval, 2001.
[18] Magnini, B., Negri, M., Prevete, R., Tanev, H., Comparing Statistical and Content-Based Techniques for Answer. Validation on the Web [C]//Proceeding of the VIII Convegno AI*IA , 2002.
[19] Dell Zhang. Web based Question Answering with Aggregation Strategy [C]//Proceedings of the 6th Asia Pacific Web Conference (APWEB), 2004.
[20] Magnini, B., Negri, M., Prevete, R., Tanev, H. Is it the Right Answer? Exploiting Web Redundancy for Answer Validation [C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 2002: 425-432.
[21] Deepak Ravichandran, Eduard Hovy. 2002. Learning Surface Text Patterns for a Question Answering System [C]//Proceedings of the 40th ACL, 2002.
[22] Martin M. Soubbotin, Sergei M. Soubbotin. Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach [C]//Proceedings of the Text REtrieval Conference 2002, 2002.
[23] 崔恒, 蔡东风, 苗雪雷. 基于网络的中文问答系统及信息抽取算法研究[J] . 中文信息学报,2004, 18(3): 24-31.
[24] Shu-Jung Lin, Min-Shiang Shia et al. Improving Answer Ranking Using Cohesion between Answer and Keywords [C]//Proceedings of NTCIR-5 Workshop Meeting , 2005.
[25] H. Cui, K. Li, R. Sun,T. Chua, and M.Y. Kan. National University of Singapore at the TREC-13 Question Answering Main Task [C]//proceedings of the 13th Text Retrieval Conference, 2004.
[26] Vasin Punyakanok, Dan Roth, Wen-tau Yih. Mapping Dependencies Trees: An Application to Question Answering [C]//Proceedings of AI & Math 2004, 2004.
[27] H. Tanev, M. Kouylekov, B. Magnini, et al. Exploiting Linguistic Undices and Syntactic Structures for Multi-lingual Question Answering [C]//CLEF-2005 Working notes, 2005.
[28] Renxu Sun, Hang Cui, Keya Li, Min-Yen Kan, Tat-Seng Chua. Dependency Relation Matching for Answer Selection [C]//Proceedings of the 28th Annual International ACM SIGIR Conference, 2005.
[29] J. Xu, A. Licuanan, J. May, S. Miller, and R. Weischedel. TREC 2002 QA at BBN: Answer Selection and Confidence Estimation [C]//Proceedings of TREC 2002, 2003.
[30] S. Schlobach, M. Olsthoorn, and M. de Rijke. Type Checking in Open-domain Question Answering [C]//Proceedings of European Conference on Artificial Intelligence, 2004.
[31] S. Sinha and S. Narayanan. Model-based Answer Selection [C]//proceedings of AAAI-05 Workshop on Question Answering in Restricted Domains, 2005.
[32] Adrian Novischi, Dan Moldovan. Question Answering with Lexical Chains Propagating Verb Arguments [C]//ACL 2006, 2006.
[33] Karin Kipper, Hoa Trang Dang, and Martha Palmer. Class-based construction of a verb lexicon [C]//Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence.2000: 691-696.
[34] D. Ravichandran, E. Hovy, and F. Josef Och. Statistical QA-Classifier vs. Re-ranker: What’s the Difference? [C]//Proceedings of the ACL Workshop on Multilingual Summarization and Question Answering-Machine Learning and Beyond, 2003.
[35] 游斓, 周雅倩, 黄萱菁, 吴立德. 基于最大熵模型的QA系统置信度评分算法[J].软件学报, 2005, 16(8): 1407-1414.
[36] Ang Sun, Minghu Jiang, Yanjun Ma. An Instance-Based Approach for Pinpointing Answers in Chinese Question Answering [C]//Proceedings of 2006 International Conference on Signal Processing (ICSP 2006), 2006.
[37] Jun Suzuki, Yutaka Sasaki, Eisaku Maeda. SVM Answer Selection for Open-Domain Question Answering [C]//Proceedings of Coling-2002, 2002.
[38] Y. Wu, R. Zhang, X. Hu, and H. Kashioka. Learning Unsupervised SVM Classifier for Answer Selection in Web Question Answering [C]//Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007.
[39] A. Echihabi and D.Marcu. A Noisy-Channel Approach to Question Answering [C]//Proceedings of ACL2003, 2003.
[40] D. Moldovan, C.Clark, S.Harabagiu, et al. Cogex: A Logic Prover for Question Answering [C]//Proceedings of HLT-NAACL 2003, 2003.
[41] Kouylekov M., Magnini B., Negri M., Coppola B. Towards Entailment-based Question Answering: ITC-irst at CLEF 2006 [C]//CLEF-2006 Working Notes, 2006.
[42] Sanda Harabagiu and Andrew Hickl. Methods of Using textual Entailment in Open-domain Question Answering [C]//Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics, 2006.
[43] J. Prager, J. Chu-Carroll, and K. Czuba. Question Answering Using Constraint Satisfaction: QA-by-Dossier-with-Constraints [C]//Proceedings of ACL 2004, 2004.
[44] Jeongwoo Ko, Teruko Mitamura and Eric Nyberg. Language-independent Probabilistic Answer Ranking for Question Answering [C]//Proceedings of ACL 2007, 2007.
[45] J. Prager, J. Chu-Carroll, K. Czuba, C. Welty, A. Ittycheriah, and R. Mahindru. IBM’s PIQUANT in TREC2003 [C]//Proceedings of the Text REtrieval Conference , 2003.
[46] D. Buscaldi and P. Rosso. Mining Knowledge from Wikipedia for the Question Answering task [C]//Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2006.
[47] V. Jijkoun, J. van Rantwijk, D. Ahn, E. Tjong Kim Sang, and M. de Rijke. The University of Amsterdam at CLEF@QA 2006 [C]//Working Notes CLEF, 2006.
[48] J. Chu-Carrol, J. Prager, C. Welty, et al. A Multi-strategy and Multi-source Approach to Question Answering [C]//Proceedings of TREC 2002, 2003: 281-288.
[49] V. Jijkoun and M. De Rijke, Answer Selection in A Multi-stream Open Domain Question Answering System [C]//proceedings ECIR 2004, 2004.
[50] A. Hickl, J. Williams, J. Bensley, K. Roberts, Y. Shi, and B. Rink. Question Answering with LCC's CHAUCER at TREC 2006 [C]//Proceedings of the Text REtrieval Conference, 2007.
[51] A. Pe as, A. Rodrigo, V. Sama, and F. Verdejo. Overview of the Answer Validation Exercise 2006 [C]//Working notes of the CLEF06 workshop, 2006.
[52] A. Pe as, A. Rodrigo, V. Sama, and F. Verdejo. Overview of the Answer Validation Exercise 2007[C]// Working notes of the CLEF07 workshop, 2007.


基金

教育部科学技术研究重点资助项目(207148);辽宁省自然科学基金资助项目(20062006)
PDF(802 KB)

734

Accesses

0

Citation

Detail

段落导航
相关文章

/