Analysis of Data Parallel Training of Neural Language Models via Multiple GPUs
LI Yinqiao, HAN Ambyer, XIAO Tong, BO Le, ZHU Jingbo, ZHANG Li
中文信息学报 . 2018, (7): 37 -43 .