谢丽星1,周 明2,孙茂松1. 基于层次结构的多策略中文微博情感分析和特征抽取[J]. 中文信息学报, 2012, 26(1): 73-84.
XIE Lixing1, ZHOU Ming 2, SUN Maosong1. Hierarchical Structure Based Hybrid Approach to #br#
Sentiment Analysis of Chinese Micro Blog and Its Feature Extraction. , 2012, 26(1): 73-84.
Hierarchical Structure Based Hybrid Approach to #br#
Sentiment Analysis of Chinese Micro Blog and Its Feature Extraction
XIE Lixing1, ZHOU Ming 2, SUN Maosong1
1. State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for
Information Science and Technology, Department of Computer Science and Technology,
Tsinghua University, Beijing 100084, China; 2. Microsoft Research Asia, Beijing 100084, China
Abstract:With the development of Web 2.0, micro blog has drawn substantial attention from both academia and industry communities. This paper utilizes micro blog API from Sina and carries out sentiment analysis on Chinese micro blog. We compare performances of three method, based on the emoticon, the sentiment lexicon and the hybrid approach over hierarchical structure using SVM, respectively. Through the experiments, we find that SVM based hybrid approach achieves the best performance. Furthermore, we analyze the contribution of various features in this model, including target-independent features and target-dependent features. Experimental results show that SVM based method can gain an accuracy of 66.467% with target-independent features, and an improved accuracy of 67.283% with the addition of target-dependent features.
Key wordssina micro blog; sentiment analysis; SVM
[1] M.Q. Hu, B. Liu. Mining and Summarizing Customer Reviews[C]//ACM SIGKDD 2004: 168-177.
[2] Bo Pang, Lillian Lee. Opinion mining and sentiment analysis[C]//Foundations and Trends in Information Retrieval, 2(1-2): 1-135.
[3] M.Q. Hu, B. Liu. Opinion Extraction and Summarization on the Web[C]//AAAI06, Boston: 1621-1624.
[4] H. Yu, V. Hatzivassiloglou. Towards Answering Opinion Question: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences[C]//EMNLP03: 129-136.
[5] Bo Pang, Lillian Lee, Shivakumar Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques[C]//ACL02: 79-86.
[6] Bo Pang, Lillian Lee. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts[C]//ACL04: 271-278.
[7] E. Riloff, J. Wiebe. 2003. Learning extraction patterns for subjective expressions[C]//EMNLP03: 105-112.
[8] Glance, N., M. Hurst, K. Nigam, et al. 2005. Deriving marketing intelligence from online discussion[C]//SIGKDD05: 419-428.
[9] Wilson, T., J. Wiebe, P. Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis[C]//HLT-EMNLP05: 347-354.
[10] Luciano Barbosa, Junlan Feng. 2010. Robust Sentiment Detection on Twitter from Biased and Noisy Data[C]//Coling 2010 (poster paper): 36-44.
[11] Long Jiang, Mo Yu, Ming Zhou, et al. Target-dependent Twitter Sentiment Classification[C]//ACL 2011.
[12] Lun-Wei Ku, Tung-Ho Wu, Li-Ying Lee, et al. 2005. Construction of an Evaluation Corpus for Opinion Extraction[C]//In NTCIR-5 Japan, 2005: 513-520.
[13] S. Dasgupta, V. Ng. Mine the Easy, Classify the Hard: A Semi-Supervised Approach to Automatic Sentiment Classification[C]//ACL09: 701-709.
[14] Fangtao Li, Nathan Liu, Hongwei Jin, et al. Incorporating Reviewer and Product Information for Review Rating Prediction[C]//IJCAI 2011.
[15] V. Hatzivassiloglou, J. Wiebe. Effects of adjective orientation and gradability on sentence subjectivity[C]//COLING00: 299-305.
[16] P. Turney. 2002. Thumbs up or thumbs down Semantic Orientate-on Applied to Unsupervised Classification of reviews[C]//ACL02: 417-424.
[17] Tetsuya Nasukawa, Jeonghee Yi. 2003. Sentiment Analysis: Capturing Favorability Using Natural Language Processing[C]//Proceedings of the 2nd International Conference on Knowledge Capture: 70-77.
[18] Xiaowen Ding, Bing Liu. 2007. The Utility of Linguistic Rules in Opinion Mining[C]//SIGIR-2007 (poster paper), 811-812.
[19] Minqing Hu, Bing Liu. Mining and summarizing customer reviews[C]//KDD-2004 (full paper), Seattle, Washington, USA, Aug 22-25, pp.168-177.
[20] Dmitry Davidiv, Oren Tsur, Ari Rappoport. Enhanced Sentiment Learning Using Twitter Hash-tags and Smileys[C]//Coling 2010 (poster paper), pp.241-249.
[21] Alec Go, Richa Bhayani, Lei Huang. 2009. Twitter Sentiment Classification using Distant Supervision[R]. Technical report, Stanford Digital Library Technologies Project.
[22] Ravi Parikh, Matin Movassate. 2009. Sentiment Analysis of User-Generated Twitter Updates using Various Classification Techniques[R]. CS224N Final Report: 1-18.
[23] Zhao Jun, Liu Kang, Wang Gen. Adding Redundant Features for CRFs-based Sentence Sentiment Classification[C]//EMNLP 2008: 117-126.
[24] Lanjun Zhou, Yunqing Xia, Binyang Li et al. WIA-Opinmine System in NTCIR-8 MOAT Evaluation[C]//NTCIR-8 Workshop Meeting, 2010.
[25] 李寿山,黄居仁. 基于Stacking组合分类方法的中文情感分类研究[J]. 中文信息学报,2010,24(5): 56-61.