王素格,李德玉,魏英杰,宋晓雷. 基于同义词的词汇情感倾向判别方法[J]. 中文信息学报, 2009, 23(5): 68-75.
WANG Su-ge, LI De-yu, WEI Ying-jie , SONG Xiao-lei. A Synonyms Based Word Sentiment Orientation Discriminating. , 2009, 23(5): 68-75.
A Synonyms Based Word Sentiment Orientation Discriminating
WANG Su-ge1,3, LI De-yu2,3, WEI Ying-jie4 , SONG Xiao-lei1
1. School of Mathematics Science, Shanxi University, Taiyuan 030006, China; 2. School of Computer & Information Technology, Shanxi University, Taiyuan 030006, China; 3. Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University, Taiyuan 030006, China; 4. Science Press, Beijing 100717, China
Abstract:The word sentiment orientation directly influences the sentiment orientation of higher level linguistic unit, such as the phrase, the sentence, the paragraph and the text. This paper proposes a paradigm word selection method based on the category distinguishing ability of a word and the sentiment word table. In consideration of that a word usually has the same sentiment orientation with its synonyms, we propose a method for word sentiment orientation discriminating based on synonyms. The method can avoid the data sparseness issue in a certain extent. The experiment results indicate that the proposed method is superior to the method based on the object word and paradigm words. Key words computer application; Chinese information processing; word sentiment orientation; paradigm word; relation intensity; synonym
[1] PETER D. Turney. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL)//Philadelphia, PA, USA. 2002: 417-424. [2] PETER D. Turney and MICHAEL L. Littman. Measuring praise and criticism: inference of semantic orientation from association[J]. ACM Transactions on Information Systems, 2003, 21(4): 315-346. [3] PETER D. Turney and MICHAEL L. Littman. Unsupervised learning of semantic orientation from a hundred-billion-word corpus [R]. Tech. Rep. EGB-1094, National Research Council Canada: 2002. [4] DAVE K., LAWRENCE S., and PENNOCK D.. Mining the peanut gallery: opinion extraction and semantic classification of product reviews [C]//Proceedings of the 22nd International World Wide Web Conference. Budapest, Hungary: 2003. [5] YUEN Raymond W.M., CHAN Terence Y.W., LAI Tom B.Y. et al. Morpheme-based derivation of bipolar semantic orientation of Chinese words [C]//Proc. Of the 20th International Conference on Computational Linguistics (COLING-2004), Geneva, Switzerland. 2004: 1008-1014. [6] 朱嫣岚, 闵锦, 周雅倩,等. 基于HowNet的词汇语义倾向计算[J]. 中文信息学报, 2006,21(1): 14-20. [7] 徐琳宏, 林鸿飞, 杨志豪. 基于语义理解的文本倾向性识别机制[J]. 中文信息学报, 2007,21[1]:96-100. [8] 王根, 赵军. 中文褒贬义词语倾向性的分析[C]//第三届学生计算语言学研讨会论文集. 沈阳. 2006: 81-85. [9] 张伟,刘缙,郭先珍.学生褒贬义词典[M].中国大百科全书出版社. 2004. [10] 史继林,朱英贵.褒义词词典[M].四川:四川辞书出版社. 2005. [11] 杨玲,朱英贵.贬义词词典[M].四川:四川辞书出版社. 2005. [12] 王素格. 基于Web的评论文本的情感分类问题研究[D]. 博士论文.上海:上海大学.2008.