程南昌,侯敏,滕永林. 基于文本特征的短文本倾向性分析研究[J]. 中文信息学报, 2015, 29(2): 163-169.
CHENG Nanchang, HOU Min, TENG Yonglin. Short Text Attitude Analysis Based on Textual Characteristics. , 2015, 29(2): 163-169.
Short Text Attitude Analysis Based on Textual Characteristics
CHENG Nanchang1, HOU Min2, TENG Yonglin2
1. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China; 2. Broadcast Media Language Branch, National Langage Resources Monitoring and Research Center, Communication University of China, Beijing 100024, China
Abstract:This paper takes the online product reviews as samples to investigate the characteristics and strategies in the attitude analysis of short texts. According to different performances of decisive factors of attitude polarity, the online review texts can be divided into four categories: the text containing overt summery sentence, the texts containing covert summary sentence, the texts containing characteristic words and the normal texts. Different strategies are established to deal with different types of texts, and a text attitude analysis system CUCsas is constructed based on dictionaries and rules. The system generates promising results in the Fourth Chinese Opinion Analysis Evaluation- COAE2012.
[1] Liu B, Hu M, Cheng J. Opinion observer: analyzing and comparing opinions on the Web[C]//Proceedings of the 14th international conference on World Wide Web. ACM, 2005: 342-351. [2] Pang B, Lee L. Opinion mining and sentiment analysis[J]. Foundations and trends in information retrieval, 2008, 2(1-2): 1-135. [3] 赵妍妍, 秦兵, 刘挺. 文本情感分析[J]. 软件学报, 2010, 21(8): 1834-1848. [4] 姚天昉, 程希文, 徐飞玉, 等. 文本意见挖掘综述[J]. 中文信息学报, 2008, 22(3): 71-80. [5] 刘康,王素格,廖祥文,等.第一届中文倾向性分析评测技术报告[C]//第一届中文倾向性分析评测会议(COAE2008), 北京, 2008: 1-20. [6] Kamps J, Marx M J, Mokken R J, et al. Using wordnet to measure semantic orientations of adjectives[J]. 2004. [7] 朱嫣岚, 闵锦, 周雅倩, 等. 基于 HowNet 的词汇语义倾向计算[J]. 中文信息学报, 2006, 20(1): 14-20. [8] Kim S M, Hovy E. Determining the sentiment of opinions[C]//Proceedings of the 20th international conference on Computational Linguistics. Association for Computational Linguistics, 2004: 1367. [9] Yuen R W M, Chan T Y W, Lai T B Y, et al. Morpheme-based derivation of bipolar semantic orientation of Chinese words[C]//Proceedings of the 20th international conference on Computational Linguistics. Association for Computational Linguistics, 2004: 1008. [10] Whitelaw C, Garg N, Argamon S. Using appraisal groups for sentiment analysis[C]//Proceedings of the 14th ACM international conference on Information and knowledge management. ACM, 2005: 625-631. [11] 李钝, 曹付元, 曹元大, 等. 基于短语模式的文本情感分类研究[J]. 计算机科学, 2008, 35(4): 132-134. [12] 李雪燕,侯明午,侯敏,等. 汉语否定形式的倾向性研究[C]. 第四届中文倾向性分析(COAE2012)评测研讨会论文. 南昌,2012. [13] 姚天昉, 娄德成. 汉语语句主题语义倾向分析方法的研究[J]. 中文信息学报, 2007, 21(5): 73-79. [14] 刘康, 赵军. 基于层叠 CRFs 模型的句子褒贬度分析研究[J]. 中文信息学报, 2008, 22(1): 123-128. [15] 杨江, 侯敏, 王宁. 基于浅层篇章结构的评论文倾向性分析[J]. 中文信息学报, 2011, 25(2): 83-88. [16] Pang B, Lee L, Vaithyanathan S. Thumbs up?: sentiment classification using machine learning techniques[C]//Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, 2002: 79-86. [17] Turney P D. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews[C]//Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 2002: 417-424. [18] 侯敏,滕永林,郑双美,等.话题型微博语言特点及其倾向性分析策略研究[J].语言文字应用,2013(2): 135-143. [19] 周红照,侯明午,侯敏,等. 基于语义分类的比较句识别与比较要素抽取研究[C]//第四届中文倾向性分析(COAE2012)评测研讨会论文.南昌, 2012. [20] 唐都钰,石秋慧. HITIRSYS:COAE2012情感分析系统[C]//第四届中文倾向性分析(COAE2012)评测研讨会论文. 南昌,2012.