Pretrain Deep Models by Distant Supervision for Weibo Sentiment Analysis
WAN Shengxian 1;2; LAN Yanyan 1;2; GUO Jiafeng 1;2; CHENG Xueqi1;2
Author information+
1. CAS Key Laboratory of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 2. University of Chinese Academy of Sciences, Beijing 100190, China
Sentiment analysis (SA) is important in many applications such as commercial business and political election. The state-of-the-art methods of SA are based on shallow machine learning models. These methods are heavily dependent on feature engineering, however, the features for Weibo SA are difficult to be extracted manually. Deep learning (DL) can learn hierarchical representations from raw data automatically and has been applied for SA. Recently proposed DL techniques shown that one can train deep models successfully given enough supervised data. However, in Weibo SA, supervised data are usually too scarce. It is easy to obtain large scale distant supervision data in Weibo. In this paper, we proposed to pre-train deep models by distant supervision and used supervised data to fine-tune the deep models. This approach could take the advantages of distant supervision to learn good initial models while using supervised data to improve the models and to correct the errors brought by distant supervision. Experimental results on Sina Weibo dataset show that we can train deep models with small scale supervised data and obtain better results than shallow models.
WAN Shengxian ; LAN Yanyan ; GUO Jiafeng ; CHENG Xueqi;.
Pretrain Deep Models by Distant Supervision for Weibo Sentiment Analysis. Journal of Chinese Information Processing. 2017, 31(3): 191-197