DQN-based Policy Learning for Open Domain Multi-turn Dialogues
SONG Haoyu, ZHANG Weinan, LIU Ting
中文信息学报 . 2018, (7): 99 -108,136 .