网络新闻口语评论文本中人物对象识别方法

林 琛, 李弼程, 周 杰

PDF(813 KB)
PDF(813 KB)
中文信息学报 ›› 2010, Vol. 24 ›› Issue (4) : 25-32.
综述

网络新闻口语评论文本中人物对象识别方法

  • 林 琛, 李弼程, 周 杰
作者信息 +

Recognition of Person Names in Netnews Oral Reviews

  • LIN Chen, LI Bicheng, ZHOU Jie
Author information +
History +

摘要

网络新闻口语评论文本中的人物对象是网络舆情的重要内容,是口语评论情感倾向性分析的基础。该文结合新闻口语评论中人物对象特点,提出了一种有效的人物对象自动识别方法。该方法首先在分词基础上,采用多频率综合判别对单字作为人物对象的可靠度进行评估,以获得稳定的识别线索;其次,根据线索划定处理窗口,利用改进频繁项挖掘算法,从窗口中提取候选人物对象;最后,对结果中存在的冗余进行优化处理。实验结果表明,新方法能够完整、有效地识别网络新闻口语评论文本中的人物对象。

Abstract

The person is an important object of comment in the in Netnews oral reviews, and thus the identificaiton of persona names is essential to the sentiment analysis for oral reviews. This paper resentss an efficient method for identifying the person namesbased on the text features in Netnews oral reviews. The method firstly evaluates the reliability of a word as a part of personal objects via the multi-frequency as the discriminating clue; Secondly, certain windows are set up according to the clues and an improved algorithm of frequent pattern mining are applied to get the candidates. Lastly, the results are optimized by a series of ways. The experimental results display the method can efficiently identify the full person names commented in Netnews oral reviews.
Key wordscomputer application; Chinese information processing; public opinion in Internet;oral reviews;person names;frequent pattern mining

关键词

计算机应用 / 中文信息处理 / 网络舆情 / 口语评论 / 人物对象 / 频繁项挖掘

Key words

computer application / Chinese information processing / public opinion in Internet / oral reviews / person names / frequent pattern mining

引用本文

导出引用
林 琛, 李弼程, 周 杰. 网络新闻口语评论文本中人物对象识别方法. 中文信息学报. 2010, 24(4): 25-32
LIN Chen, LI Bicheng, ZHOU Jie. Recognition of Person Names in Netnews Oral Reviews. Journal of Chinese Information Processing. 2010, 24(4): 25-32

参考文献

[1] 刘毅.网络舆情研究概论[M].天津: 天津人民出版社,2007.
[2] 韩运荣,喻国明.舆论学[M]. 北京: 中国传媒大学出版社,2005.
[3] 吕雅娟,等.基于分解与动态规划策略的汉语未登录词识别[J].中文信息学报, 2001,15(1): 33-38.
[4] Jin Rong,Yan Rong,Zhmag Jian.A faster iterative scaling algorithm for conditional exponential model[C]//Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003),Washington DC,2003.
[5] 张华平,刘群.基于角色标注的中国人名自动识别研究[J].计算机学报,2004,27(1):85-91.
[6] 王振华,孔祥龙,等. 结合决策树方法的中文姓名识别[J]. 中文信息学报,2004,18(6):10-15.
[7] 李中国,刘颖. 基于边界模版和局部统计相结合的中国人名识别[J].中文信息学报,2006,20(5):44-50.
[8] 季姬,罗振声.基于统计与规则的中文姓名自动辨识别[J].语言文字应用,2001,31(1): 14-18.
[9] X.Cheng. Automatic topic term detection and sentiment classification for opinion mining[D]. Master Thesis. Saarbr cken, Germany: The University of Sarrland, 2007.
[10] 姚天昉,娄德成.汉语语句主题语义倾向分析方法的研究[J].中文信息学报,2007,21(5):73-79.
[11] Ana-Maria Popescu and Oren Etzioni. Extracting Product Features and Opinion from Reviews[C]// Proceedings of the Human Language Technology Conference on Empirical Methods in Natural Language Processing. Vancouver, Canada, 2005:339-346.
[12] J.Yi,Nasukawa, R. Bunescu, and E.Niblack. Sentiment Analyzer: Extracting sentiments about a given topic using natural languages processing Techniques [C]// Proceeding of the 3rd IEEE International Conference on Data Mining. Melbourne, USA.2003:427-434.
[13] 刘非凡,赵军,等.面向商务信息抽取的产品命名实体识别研究[J].中文信息学报, 2006, 20 (1):7-13.
[14] Han J W, Kember M. Data Mining Concepts and Techniques [M]. Beijing: Higher Education Press. 2001:240-243.
[15] 朱明. 数据挖掘[M]. 合肥: 中国科学技术大学出版社. 2002:135-141.

基金

国家863计划资助项目(2007AA01Z439)
PDF(813 KB)

Accesses

Citation

Detail

段落导航
相关文章

/