评价对象抽取是情感分析任务中一个重要的子任务。该文使用基于条件随机场模型的监督学习方法实现英文的评价对象抽取。为了更好的捕捉评价对象和情感词之间的关系,引入句法分析用以加入丰富的句法特征提高评价对象抽取性能。实验中,我们在两个不同的数据集上考查了句法特征对评价对象抽取性能的影响,并做了详细的分析比较。实验结果表明,将句法特征应用在评价对象抽取任务中能够取得不错的效果,明显提高了评价对象的抽取召回率。
Abstract
Opinion target extraction is an important sub-task of the sentiment analysis. This paper employs a supervised model to extract English opinion target with Conditional Random Fileds (CRFs). To better capture the relationship between the opinion targets and opinion expression, we add the syntactic features extracted from the parsing trees. In the experiments, two different data sets are used to evaluate the proposed approach. The experimental results demonstrate that using syntactic features is effective and it could improve the recall of opinion target extraction significantly.
关键词
情感分析 /
评价对象 /
句法特征 /
条件随机场
{{custom_keyword}} /
Key words
Sentiment analysis /
Opinion target /
syntactic features /
CRF
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Pang B, Lee L. Opinion Mining and Sentiment Analysis[J]. Foundations and Trends in Information Retrieval, 2008, 2(1-2) :1-135.
[2] Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment Classification using Machine Learning Techniques[C]//Proceedings of the EMNLP 2002. 2002: 79-86.
[3] 赵妍妍,秦兵,刘挺.文本情感分析[J]. 软件学报, 2010, 21(8):1834-1848.
[4] Kim S,Hovy E. Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text[C]//Proceedings of the ACL Workshop on Sentiment and Subjectivity in Text. 2006: 1-8.
[5] Hu M, Liu B. Mining Opinion Features in Customer Reviews[C]//Proceedings of the AAAI-2004. 2004: 755-760.
[6] Jakob N, Gurevych I. Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields[C]//Proceedings of the EMNLP-2010. 2010: 1035-1045.
[7] Li B, Zhou L, Feng S, et al. A Unified Graph Model for Sentence-based Opinion Retrieval[C]//Proceedings of the ACL-2010. 2010:1367-1375.
[8] 王荣洋,鞠久鹏,李寿山,等. 基于CRFs的评价对象抽取特征研究[J]. 中文信息学报,2012,26(2): 56-61.
[9] Popescu A, Nguyen B, Etzioni O. OPINE: Extracting Product Features and Opinions from Reviews[C]//Proceedings of HLT/EMNLP-2005. 2005:32-33.
[10] Zhuang L, Jing F, Zhu X. Movie review mining and summarization[C]//Proceedings of the CIKM-2006. 2006: 43-50.
[11] Kessler J, Nicolov N. Targeting Sentiment Expressions through Supervised Ranking of Linguistic Configurations[C]//Proceedings of the Third International AAAI Conference on Weblogs and Social Media, San Jose, California, USA, May.2009: 90-97.
[12] Putthividhya D, Hu J. Bootstrapped Named Entity Recognition for Product Attribute Extraction[C]//Proceedings of the EMNLP-2011. 2011: 1557-1567.
[13] Toprak C, Jakob N, Gurevych I. Sentence and Expression Level Annotation of Opinions in User-Generated Discourse[C]//Proceedings of the ACL-2010. 2010: 575-584.
[14] Jiang Z, Ng H. Semantic Role Labeling of NomBank: A Maximum Entropy Approach[C]//Proceedings of the EMNLP-2006.2006:138-145.
[15] 宗成庆. 统计自然语言处理[M]. 北京: 清华大学出版社,2008:1-475.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金(61003155),国家自然科学基金(60873150),模式识别国家重点实验室开发课题基金。
{{custom_fund}}