在互联网技术快速发展、网络信息爆炸的今天,通过计算机自动分析大规模文本中的态度倾向信息的技术,在企业商业智能系统、政府舆情分析等诸多领域有着广阔的应用空间和发展前景。同时,语义褒贬倾向研究也为文本分类、自动文摘、文本过滤等自然语言处理的研究提供了新的思路和手段。篇章语义倾向研究的基础工作是对词汇的褒贬倾向判别。本文基于HowNet,提出了两种词汇语义倾向性计算的方法:基于语义相似度的方法和基于语义相关场的方法。实验表明,本文的方法在汉语常用词中的效果较好,词频加权后的判别准确率可达80%以上,具有一定的实用价值。
Abstract
Nowadays , with the development of Internet and information explosion ,automated techniques for analyzing author’s attitudes towards specific events will make great effort to business intelligence and public opinion survey. Semantic orientation inference has become a meaningful tool , which could provide useful information for text classification , summarization , filtering etc. Measuring the semantic orientation of words would greatly contribute to predicting the author’s attitude in a passage. In this paper , a simple HowNet-based method for semantic orientation computation of Chinese words is introduced. Although this method requires only a few seed words , satisfactory results can still be obtained. And the performance is even better for frequently used words , with the frequency-weighted accuracy of above 80%.
关键词
计算机应用 /
中文信息处理 /
态度分类 /
语义倾向 /
知网
{{custom_keyword}} /
Key words
computer application /
Chinese information processing /
sentiment classification /
semantic orientation /
HowNet
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Vasileios Hatzivassiloglou , Kathleen R. McKeown. Predicting the semantic orientation of adjectives[A] . In : Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL[C] , 1997 :174 - 181.
[2] Turney , Peter , Littman Michael. Measuring praise and criticism: Inference of semantic orientation from association [J] . ACM Transactions on Information Systems , 2003 , 21 (4) : 315 - 346.
[3] Turney Peter. Thumbs Up or Thumbs Down ? Semantic Orientation Applied to Unsupervised Classification of Reviews [A] . In : Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics[C] . 2002 : 417 - 424.
[4] Bo Pang , Lillian Lee , Shivakumar Vaithyanathan. Thumbs up ? Sentiment classification using machine learning techniques[A] . In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing[C] . 2002 : 79 - 86.
[5] Bo Pang , Lillian Lee. Seeing Stars : Exploiting Class Relationships for Sentiment Categorization with respect to Rating Scales[A] . ACL2005 , 115 - 124.
[6] K Dave , S Lawrence , DM Pennock. ,Mining the peanut gallery : opinion extraction and semantic classification of product reviews[A] . WWW2003 , 519 - 28.
[7] Bing Liu ,Minqing Hu ,Junsheng Cheng. Opinion observer : analyzing and comparing opinions on the Web [A] . WWW2005 , 324 - 351.
[8] HowNet [R] . HowNet’s Home Page. http://www.keenage.com.
[9] 刘群,李素建. 基于《知网》的词汇语义相似度的计算[A] . 第三届汉语词汇语义学研讨会,台北,2002.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金资助项目(60435020);上海市科技攻关计划资助项目(035115028)
{{custom_fund}}