Recognizing Textual Entailment Based on Inference Phenomena
REN Han1,2, FENG Wenhe1,2, LIU Maofu2,3, WAN Jing2
1. Laboratory of Language Engineering and Computing, Guangdong University of Foreign Studies, Guangzhou, Guangdong 510006, China; 2. Hubei Research Center for Language and Intelligent Information Processing, Wuhan University, Wuhan, Hubei 430072,China; 3. College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, Hubei 430065, China; 4. Center for Lexicographical Studies, Guangdong University of Foreign Studies, Guangzhou, Guangdoing 510420, China
Abstract:This paper introduces an approach of textual entailment recognition based on language phenomena. The approach asopts a joint classification model for language phenomenon recognition and entailment recognition, so as to learn two highly relevant tasks, avoiding error propagation in pipeline strategy. For language phenomenon recognition, 22 specific and 20 general features are employed. And for enhancing the generalization of random forest, a feature selection method is adopted on building trees of random forest. Experimental results show that the joint classification model based on random forest recognizes language phenomena and entailment relation effectively.
[1] Dagan I, Glickman O. Probabilistic Textual Entailment: Generic Applied Modeling of Language Variability[C]//Proceedings of PASCAL Workshop on Learning Methods for Text Understanding and Mining. 2004. [2] Androutsopoulos I, Malakasiotis P. A Survey of Paraphrasing and Textul Entailment Methods[J]. Journal of Artificial Intelligence Research, 2010, 38(1): 135-187. [3] Dagan I, Dolan B. Recognizing textual entailment: Rational, evaluation and approaches[J]. Natural Language Engineering, 2009, 15(4): i-xvii. [4] Cabrio E. Specialized Entailment Engines: Approaching Linguistic Aspects of Textual Entailment[C]//Proceedings of the 14th International Conference on Applications of Natural Language to Information Systems, 2009: 305-308. [5] Bentivogli L, Cabrio E, Dagan I, et al. Building textual entailment specialized data sets: a methodology for isolating linguistic phenomena relevant to inference[C]//Proceedings of the International Conference on Language Resources and Evaluation. 2010: 3542-3549. [6] Kaneko K, Miyao Y, Bekki D. Building Japanese Textual Entailment Specialized Data Sets for Inference of Basic Sentence Relations[C]//Proceedings of the 51st Annual Meeting of the Association of Computational Linguistics 2013: 273-277. [7] Sammons M, Vydiswaran V G V, Roth D. “Ask not what Textual Entailment can do for you…”[C]//Proceedings of the Annual Meeting of the Association for Computational Linguistics. 2010: 1199-1208. [8] Garoufi K. Towards a better understanding of applied textual entailment: Annotation and evaluation of the RTE-2 dataset. Germany, Saarland University. Master Thesis. 2007. [9] Matsuyoshi S, Miyao Y, Shibata T, et al. Overview of the NTCIR-11 Recognizing Inference in TExt and Validation (RITE-VAL) Task[C]//Proceedings of the 11th NTCIR Conference. 2014: 223-232. [10] Huang H H, Chang K C, Chen H H. Modeling Human Inference Process for Textual Entailment Recognition[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 2013: 446-450. [11] 江敏, 肖诗斌, 王弘蔚, 等. 一种改进的基于知网的词语语义相似度计算[J]. 中文信息学报, 2008, 22(5): 84-89. [12] 张志昌, 周慧霞, 姚东任, 等. 基于词向量的中文词汇蕴涵关系识别[J]. 计算机工程, 2016, 42(2): 169-174. [13] Ren H, Wu H, Tan X, et al. The WHUTE System in NTCIR-11 RITE Task[C]//Proceedings of the 11th NTCIR Conference. 2014: 309-316.