|
|
A Semantic Distance Measure for Similar Question Identification |
SU Yulan, CHEN Xin, HONG Yu, ZHU Mengmeng, ZHANG Min |
School of Computer Science and Technology, Soochow University,Suzhou, Jiangsu 215006, China |
|
|
Abstract To address the low efficeiency limited by the binary classification made in the scenario of question answering system, this paper proposes a similar question identification method based on semantic space distance measure (SSDM), which is inspired by related research on face identification. This method obtains a semantic encoder by similar question multiclassification process via the Margin Softmax introduced from face identification community. The semantic encoder can aggregate similar question in the semantic space, and make dissimilar questions to be far away from each other in semantic space. SSDM method transforms similar questions identification into vector distance calculation in semantic space, and breaks the binary question matching and guarantees a certain high efficiency. We test the SSDM method in the ASQD dataset from Biendata, and the experimental results show that the SSDM method is better in performance than the baseline method.
|
Received: 22 March 2019
|
|
|
|
|
[1]Xue X, Jeon J, Croft W B. Retrieval models for question and answer archives[C]//Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008: 475-482.
[2]Socher R, Huang E, Pennin J, et al. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection[C]//Proceedings of the Advances in Neural Information Processing Systems, 2011:801-809.
[3]Li H, Xu J. Semantic matching in search[J]. Foundations and Trends in Information Retrieval, 2014, 7(5): 343-469.
[4]Robertson S E, Walker S. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval[C]//Proceedings of the SIGIR. Springer, London, 1994: 232-241.
[5]Salton G, Wong A, Yang C S. A vector space model for automatic indexing[J]. Communications of the ACM, 1975, 18(11): 613-620.
[6]Katz B, Felshin S, Yuret D, et al. Omnibase: Uniform access to heterogeneous data for question answering[C]//Proceedings of the International Conference on Application of Natural Language to Information Systems. Springer, Berlin, Heidelberg, 2002: 230-234.
[7]Mollá D, Schwitter R, Rinaldi F, et al. NLP for answer extraction in technical domains[C]//Proceedings of the Workshop on Natural Language Processing for Question Answering, 2003:5-12.
[8]Sneiders E. Automated faq answering: Continued experience with shallow language understanding[C]//Proceedings of the Question Answering Systems. Papers from the AAAI Fall Symposium, 1999: 97-107.
[9]Zhou G, Huang J X. Modeling and learning distributed word representation with metadata for question retrieval[J]. IEEE Transactions on Knowledge and Data Engineering, 2017, 29(6): 1226-1239.
[10]Zhou G, Cai L, Zhao J, et al. Phrase-based translation model for question retrieval in community question answer archives[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 2011: 653-662.
[11]Ji Z, Xu F, Wang B, et al. Question-answer topic model for question retrieval in community question answering[C]//Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012: 2471-2474.
[12]Das A, Yenala H, Chinnakotla M, et al. Together we stand: Siamese networks for similar question retrieval[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016: 378-387.
[13]Kamineni A, Shrivastava M, Yenala H, et al. Siamese LSTM with convolutional similarity for similar question retrieval[C]//Proceedings of the 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing. IEEE, 2018: 1-7.
[14]Pang L, Lan Y, Guo J, et al. Textmatching as Image Recognition[J]. arXiv preprint arXiv:1602.06359, 2016.
[15]陈鑫, 李伟康, 洪宇, 等. 面向问句复述识别的多卷积自交互匹配方法研究[J]. 中文信息学报, 2019, 33(10): 99-108,118.
[16]Devlin J, Chang M W, Lee K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding[J]. arXiv preprint arXiv:1810.04805, 2018.
[17]Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences[J]. arXiv preprint arXiv:1404.2188, 2014.
[18]Taigman Y, Yang M, Ranzato M A, et al. Deepface: Closing the gap to human-level performance in face verification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014: 1701-1708.
[19]Liu W, Wen Y, Yu Z, et al. Sphereface: Deep hypersphere embedding for face recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 212-220.
[20]Wang H, Wang Y, Zhou Z, et al. Cosface: Large margin cosine loss for deep face recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 5265-5274.
[21]Pennington J, Socher R, Manning C D. Glove: Global vectors for word representation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2014: 1532-1543.
[22]Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[C]//Proceedings of the Advances in Neural Information Processing Systems. 2017: 5998-6008.
[23]袁和金, 张旭, 牛为华, 等. 融合注意力机制的多通道卷积与双向 GRU 模型的文本情感分析研究[J]. 中文信息学报, 2019, 33(10): 109-118.
[24]Liu W, Wen Y, Yu Z, et al. Large-margin softmax loss for convolutional neural networks[C]//Proceedings of the 23rd International Conference on Machine Learning, 2016, 2(3): 7.
[25]Wang F, Cheng J, Liu W, et al. Additive margin softmax for face verification[J]. IEEE Signal Processing Letters, 2018, 25(7): 926-930.
[26]Wang Z, Hamza W, Florian R. Bilateral multi-perspective matching for natural language sentences[J]. arXiv preprint arXiv:1702.03814, 2017.
|
|
|
|