Analysis of Data Parallel Training of Neural Language Models via Multiple GPUs
LI Yinqiao, HAN Ambyer, XIAO Tong, BO Le, ZHU Jingbo, ZHANG Li
Analysis of Data Parallel Training of Neural Language Models via Multiple GPUs
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 | 〉 |