Abstract:This paper analyses sentiment orientation of English sentences with modality. Sentences with modality are used widely in English, which comprise a significant proportion of typical reviews corpus. Due to the unique characteristics of modality, it is challenging for a general sentiment analysis system to handle these sentences. This paper identifies these sentences with the help of POS tagging and present a new modal feature that has been rarely discussed in previous studies. To further improve the accuracy, we develop a novel method which can effectively combine phrases sharing similar meanings of modality. The experimental results illustrate that the F-score of the proposed method increases by 4% and 7% than classic methods in the two-class and three-class sentiment orientation classifications, respectively.
[1] Murthy Ganapathibhotla, Liu Bing. Mining opinions in comparative sentences[C]//Proceedings of the 22nd International Conference on Computational Linguistics, Stroudsburg, 2008: 241-248. [2] Ramanathan Narayanan, Bing Liu, Alok Choudhary. Sentiment analysis of conditional sentences[C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Stroudsburg, 2009: 180-189. [3] Pang Bo, Lee Lillian. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales[C]//Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, 2005:115-124. [4] G Szarvas, V Vincze, R Farkas, et al. Cross-genre and cross-domain detection of semantic uncertainty[J]. Computational Linguistics, 2012, 38(2):335-367. [5] Polanyi Livia, Zaenen Annie. Contextual valence shifters[M]. Computing attitude and affect in text: Theory and applications, 2006: 1-10. [6] Frank Robert Palmer. Mood and Modality[M]. Cambridge University Press. 2001 [7] 赵妍妍,秦兵,刘挺. 文本情感分析[J]. Journal of Software, 2010, 21(8): 1834-1848. [8] Kristina Toutanova, Christopher D Manning. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger[C]// Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000). 2000: 63-70. [9] George A Miller. WordNet: A Lexical Database for English[J].Communications of the ACM. 1995, 38(11): 39-41. [10] 姚天顺,张俐,高竹. WordNet综述[J]. 语言文字应用, 2004, (1): 27-32. [11] Bo Pang, Lillian Lee, Shivakumar Vaithyanathan. Thumbs up? Sentiment Classification using Machine Learning Techniques[C]//Proceedings of the ACL-02 conference on Empirical methods in natural language processing. Philadelphia: Association for Computational Linguistics, 2002: 79-86. [12] Mingqing Hu, Bing Liu. Mining and summarizing customer reviews[C]//Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining. 2004: 168-177. [13] T Joachims, Making large-Scale SVM Learning Practical. Advances in Kernel Methods—Support Vector Learning, B Schlkopf, C Burges, A Smola (ed.)[M]. MIT-Press. 1998. [14] Mark Hall, Eibe Frank, Geoffrey Holmes, et al. The WEKA Data Mining Software: An Update[J]. ACM SIGKDD Explorations Newsletter, 2009, 11(1):10-18. [15] Liu Bing. Sentiment analysis and opinion mining[J]. Synthesis Lectures on Human Language Technologies, 2012,5(1):1-167. [16] Pang Bo, Lee Lillian. Opinion mining and sentiment analysis[J]. Foundations and trends in information retrieval, 2008,2(1-2):1-135. [17] Peter D Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews classification of reviews[C]// Proceedings of the 40th Annual Meeting on ACL, Philadelphia, July 2002: 417-424. [18] 唐慧丰,谭松波,程学旗. 基于监督学习的中文情感分类技术比较研究[J]. 中文信息学报, 2007, 21(6):88-94. [19] Hong Yu, Vasileios Hatzivassiloglou. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences[C]//Proceedings of Stroudsburg, 2003: 129-136. [20] 杨源,林鸿飞. 基于产品属性的条件句倾向性分析[J].中文信息学报, 2011, 25(3):86-92.