苏玉兰,陈鑫,洪宇,朱朦朦,张民. 基于语义空间距离的相似问句识别方法研究[J]. 中文信息学报, 2021, 35(12): 36-46.
SU Yulan, CHEN Xin, HONG Yu, ZHU Mengmeng, ZHANG Min. A Semantic Distance Measure for Similar Question Identification. , 2021, 35(12): 36-46.
基于语义空间距离的相似问句识别方法研究
苏玉兰,陈鑫,洪宇,朱朦朦,张民
苏州大学计算机科学与技术学院,江苏 苏州 215006
A Semantic Distance Measure for Similar Question Identification
SU Yulan, CHEN Xin, HONG Yu, ZHU Mengmeng, ZHANG Min
School of Computer Science and Technology, Soochow University,Suzhou, Jiangsu 215006, China
摘要前沿相关研究将相似问句识别转化为二元问句匹配识别并取得很大进展。但是在自动问答系统的实际应用场景中存在大量数据,这些方法受限于二元问句匹配识别模式,导致时效性不高。针对这一问题,受人脸识别相关研究的启发,该文提出基于语义空间距离衡量的相似问句识别方法(Semantic Space Distance Method,SSDM)。该方法将相似问句识别作为多分类问题进行训练,通过利用人脸识别任务中Margin Softmax损失函数得到语义编码模型。该语义编码模型能够将相似问句在语义空间中聚合,不相似问句在语义空间中远离。SSDM方法将相似问句识别转化成语义空间中的向量距离计算,突破二元问句匹配的方式,保证了一定的高时效性,并且仍然能够在深层语义层面对相似问句进行识别。该方法在Biendata的ASQD数据集中实验测试,取得了比基线方法更优的性能,验证了SSDM方法的有效性。
Abstract:To address the low efficeiency limited by the binary classification made in the scenario of question answering system, this paper proposes a similar question identification method based on semantic space distance measure (SSDM), which is inspired by related research on face identification. This method obtains a semantic encoder by similar question multiclassification process via the Margin Softmax introduced from face identification community. The semantic encoder can aggregate similar question in the semantic space, and make dissimilar questions to be far away from each other in semantic space. SSDM method transforms similar questions identification into vector distance calculation in semantic space, and breaks the binary question matching and guarantees a certain high efficiency. We test the SSDM method in the ASQD dataset from Biendata, and the experimental results show that the SSDM method is better in performance than the baseline method.
[1]Xue X, Jeon J, Croft W B. Retrieval models for question and answer archives[C]//Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008: 475-482.
[2]Socher R, Huang E, Pennin J, et al. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection[C]//Proceedings of the Advances in Neural Information Processing Systems, 2011:801-809.
[3]Li H, Xu J. Semantic matching in search[J]. Foundations and Trends in Information Retrieval, 2014, 7(5): 343-469.
[4]Robertson S E, Walker S. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval[C]//Proceedings of the SIGIR. Springer, London, 1994: 232-241.
[5]Salton G, Wong A, Yang C S. A vector space model for automatic indexing[J]. Communications of the ACM, 1975, 18(11): 613-620.
[6]Katz B, Felshin S, Yuret D, et al. Omnibase: Uniform access to heterogeneous data for question answering[C]//Proceedings of the International Conference on Application of Natural Language to Information Systems. Springer, Berlin, Heidelberg, 2002: 230-234.
[7]Mollá D, Schwitter R, Rinaldi F, et al. NLP for answer extraction in technical domains[C]//Proceedings of the Workshop on Natural Language Processing for Question Answering, 2003:5-12.
[8]Sneiders E. Automated faq answering: Continued experience with shallow language understanding[C]//Proceedings of the Question Answering Systems. Papers from the AAAI Fall Symposium, 1999: 97-107.
[9]Zhou G, Huang J X. Modeling and learning distributed word representation with metadata for question retrieval[J]. IEEE Transactions on Knowledge and Data Engineering, 2017, 29(6): 1226-1239.
[10]Zhou G, Cai L, Zhao J, et al. Phrase-based translation model for question retrieval in community question answer archives[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 2011: 653-662.
[11]Ji Z, Xu F, Wang B, et al. Question-answer topic model for question retrieval in community question answering[C]//Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012: 2471-2474.
[12]Das A, Yenala H, Chinnakotla M, et al. Together we stand: Siamese networks for similar question retrieval[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016: 378-387.
[13]Kamineni A, Shrivastava M, Yenala H, et al. Siamese LSTM with convolutional similarity for similar question retrieval[C]//Proceedings of the 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing. IEEE, 2018: 1-7.
[14]Pang L, Lan Y, Guo J, et al. Textmatching as Image Recognition[J]. arXiv preprint arXiv:1602.06359, 2016.
[15]陈鑫, 李伟康, 洪宇, 等. 面向问句复述识别的多卷积自交互匹配方法研究[J]. 中文信息学报, 2019, 33(10): 99-108,118.
[16]Devlin J, Chang M W, Lee K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding[J]. arXiv preprint arXiv:1810.04805, 2018.
[17]Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences[J]. arXiv preprint arXiv:1404.2188, 2014.
[18]Taigman Y, Yang M, Ranzato M A, et al. Deepface: Closing the gap to human-level performance in face verification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014: 1701-1708.
[19]Liu W, Wen Y, Yu Z, et al. Sphereface: Deep hypersphere embedding for face recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 212-220.
[20]Wang H, Wang Y, Zhou Z, et al. Cosface: Large margin cosine loss for deep face recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 5265-5274.
[21]Pennington J, Socher R, Manning C D. Glove: Global vectors for word representation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2014: 1532-1543.
[22]Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[C]//Proceedings of the Advances in Neural Information Processing Systems. 2017: 5998-6008.
[23]袁和金, 张旭, 牛为华, 等. 融合注意力机制的多通道卷积与双向 GRU 模型的文本情感分析研究[J]. 中文信息学报, 2019, 33(10): 109-118.
[24]Liu W, Wen Y, Yu Z, et al. Large-margin softmax loss for convolutional neural networks[C]//Proceedings of the 23rd International Conference on Machine Learning, 2016, 2(3): 7.
[25]Wang F, Cheng J, Liu W, et al. Additive margin softmax for face verification[J]. IEEE Signal Processing Letters, 2018, 25(7): 926-930.
[26]Wang Z, Hamza W, Florian R. Bilateral multi-perspective matching for natural language sentences[J]. arXiv preprint arXiv:1702.03814, 2017.