Sentiment Analysis and Social Computing
XU Yuemei, SHI Lingyu, CAI Lianqiao
2022, 36(2): 129-141.
In cross-lingual sentiment analysis, pre-trained Bilingual Word Embedding (BWE) dictionaries are leveraged to generate text vector representations of source and target languages. In order to obtain a qualified BWE dictionary, a novel model is proposed to utilize the affective features in source language as supervised information for word representation generation. The representations we pre-trained contain both semantic and emotional information , suitable for sentiment prediction in target language. In our cross-lingual sentiment analysis experiments, the source language is English, and the target languages include Chinese, French, German, Japanese, Korean and Thai. The results show that the accuracy of our proposed model is about 9.3% higher than Machine Translation (MT) based method, and 8.7% higher than parallel method without sentiment-aware representations. As expected, the experiments on English and German sentiment classification achieved best performance, for both languages belong to the Germanic language group and are more similar in grammar and semantics.