|
|
PCC: A Personalized Dialogue System with Single User Modeling |
GUO Yu2,3, DOU Zhicheng1,3, WEN Jirong3,4 |
1. Gaoling School of Artificial Intelligence, Renmin University of China, Beijing 100086, China;
2. School of Information, Renmin University of China, Beijing 100086, China;
3. Beijing Key Laboratory of Big Data Management and Analysis Method, Beijing 100086, China;
4. Key Laboratory of Data Engineering and Knowledge Engineering, Beijing 100086, China; |
|
|
Abstract Dialogue system is an important downstream task in the field of natural language processing (NLP) receiving more and more attention in recent years. In order to make the dialogue models more in line with the way of human dialogue and have better personalized modeling capabilities, this paper proposes a new personalized model PCC(a Personalized Chatbot with Convolution mechanism)to model a single user. At the encoder, text convolutional neural network (TextCNN) is used to process user history posts to obtain user interest information. At the decoder, we search for the reply that best matches the current question in the users historical answers through similarity, so as to guide the models generation together with user ID. Experimental results show that our model can improve the accuracy and diversity of the generation, and reveal the effectiveness of historical information in personalized modeling.
|
Received: 20 October 2020
|
|
|
|
|
[1]Young S, Gasic M, Thomson B, et al. POMDP-based statistical spoken dialog systems: A review[C]//Proceedings of the IEEE, 2013, 101(5): 1160-1179.
[2]Shawar B A, Atwell E. Chatbots: Are they really useful?[J]. Journal for Language Technology and Compuational Linguistics, 2007,22(1): 29-49.
[3]Pieraccini R, Suendermann D, Dayanidhi K, et al. Are we there yet? Research in commercial spoken dialog systems[C]//Proceedings of the 12th International Conference on Text Speech and Dialogue, 2009: 3-13.
[4]Serban I V, Sordoni A, Bengio Y, et al. Building end-to-end dialogue systems using generative hierarchical neural network models[J]. arXiv: Computation and Language, 2015.
[5]Shao L,Gouws S, Britz D, et al. Generating high-quality and informative conversation responses with sequence-to-sequence models[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2017: 2210-2219.
[6]Li J, Galley M, Brockett C, et al. A personal-based neural conversation model[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016: 994-1003.
[7]Kim Y . Convolutional neural networks for sentence classification[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2014: 1746-1751.
[8]Weizenbaum J. ELIZA: A computer program for the study of natural language communication between man and machine[J]. Communications of the ACM, 1966, 9(1): 36-45.
[9]Colby K M ,Hilf F D, Weber S, et al. Turing-like indistinguishability tests for the validation of a computer simulation of paranoid processes[J]. Artificial Intelligence, 1972, 3(1): 199-221.
[10]Koehn P,Och F J, Marcu D, et al. Statistical phrase-based translation[C]//Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003: 127-133.
[11]Cho K, Van Merrienboer B, Gulcehre C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. arXiv: Computation and Language, 2014.
[12]Shang L, Lu Z, Li H, et al. Neural responding machine for short-text conversation[J].arXiv: Computation and Language, 2015.
[13]Sutskever I, Vinyals O, Le Q V, et al. Sequence to sequence learning with neural networks[J]. arXiv: Computation and Language, 2014.
[14]Luan Y, Brockett C, Dolan B, et al. Multi-task learning for speaker-role adaptation in neural conversation models[J].arXiv: Computation and Language, 2017.
[15]Qian Q, Huang M, Zhao H, et al. Assigning personality/identity to a chatting machine for coherent conversation generation[J]. arXiv: Computation and Language, 2017.
[16]Zhang W, Zhu Q, Wang Y, et al. Neural personalized response generation as domain adaptation[J]. World Wide Web, 2019, 22(4): 1427-1446.
[17]Liu B, Xu Z, Sun C, et al. Content-oriented user modeling for personalized response ranking in chatbots[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2018, 26(1): 122-133.
[18]Chu E ,Vijayaraghavan P, Roy D, et al. Learning personas from dialogue with attentive memory networks.[J]. arXiv: Computation and Language, 2018.
[19]Li J, Galley M, Brockett C, et al. A diversity-promoting objective function for neural conversation models[J].arXiv: Computation and Language, 2015.
|
|
|
|