Abstract:An approach is introduced to acquire fake Chinese reviews semi-automatically. It mainly includes a platform to get fake reviews, a syntactic parser, and a sentiment analysis component. Emphasis is on a syntactic based sentiment pair extraction, <comment object, comment phrase>. Finally, we analyze some experimental results and give some suggestions to improve the accuracy of sentiment analysis.
[1] N Jindal, B Liu. Opinion Spam and Analysis[C]//Proceedings of WSDM’08. 2008: 219-230.
[2] Jindal N, Liu B. Analyzing and detecting review spam[C]//Proceedings of the 7th IEEE Int’l Conf.on Data Mining. Washington: IEEE Computer Society, 2007: 547-552.
[3] 赵妍妍, 秦兵, 刘挺,等. 文本情感分析[J]. 软件学报, 2010, 21(8): 1834-1848.
[4] 王素格, 李德玉, 魏英杰,等. 基于赋权粗糙隶属度的文本情感分类方法[J]. 计算机研究与发展,2011,48(5): 855-861.
[5] 梁军,柴玉梅,原慧斌,等.基于深度学习的微博情感分析[J].中文信息学报,2014, 28(5):155-161.
[6] 李国林,万常选,边海容,等.基于语素的金融证券域文本情感探测[J].计算机研究与发展,2011,48(z2):432-437.
[7] 王昊,杨亮,林鸿飞,等.日本地震的微博热点事件分析[J].中文信息学报,2012,26(5):7-13.
[8] 林煜明,王晓玲,朱涛,等.用户评论的质量检测与控制研究综述[J].软件学报,2014, 25(3):506-527.
[9] Ott M, Choi Y Cardie, et al. Finding Deceptive Opinion Spam by Any Stretch of the Imagination [C]//Proceedings of ACL 2011: 309-319.
[10] https://www.mturk.com/mturk/welcome[EB/OL]. [2014-12-8]
[11] Popeseu AM, Etzioni O. Extracting Product Features and Opinions from Reviews [C]//Proceedings of HLT-EMNLP 2005. 2005: 339-346.
[12] 李岩,徐蔚然,陈光. PRIS_COAE CPAE 2013评测报告[C]//第五届中文倾向性分析评测研讨会(COAE 2013)评测报告论文集,2013: 53-69.
[13] 张莉, 钱玲飞, 许鑫等. 基于核心句及句法关系的评价对象抽取[J]. 中文信息学报, 2011, 25(3):23-29.
[14] Titov I, McDonald R. Modeling Online Reviews with Multi-grain Topic Models [C]//Proceedings of WWW 2008. 2008: 111-120.
[15] C Sauper, A Haghighi, R Barzilay. Content Models with Attitude [C]//Proceedings of ACL 2011. 2011: 350-358.
[16] Hu MQ, Liu B. Mining and Summarizing Customer Reviews [C]//Proceedings of KDD 2004. 2004: 68-177.
[17] Shoushan Li, Chengqing Zong and Xia Wang. Sentiment Classification through Combining Classifiers with Multiple Feature Sets [C]//Proceedings of NLP-KE 2007. 2007: 135-140.
[18] 王根, 赵军. 基于多重冗余标记CRFs的句子情感分析研究[J]. 中文信息学报, 2007, 21(5): 51-55,86.
[19] Andrew L Maas, Raymond E Daly, Peter T Pham, et al. Learning Word Vectors for Sentiment Analysis [C]//Proceedings of ACL 2011: 142-150.
[20] L Jiang, M Yu, M Zhou, et al. Target-dependent Twitter Sentiment Classification [C]//Proceedings of ACL 2011: 151-160.
[21] http://ictclas.nlpir.org/[EB/OL]. [2014-12-8]
[22] 谢涛丽.定中式“V+N”结构研究[D].上海师范大学硕士学位论文,2010.
[23] 尹世超.动词直接作定语与名词中心语的类[J].语文研究,2002,(2):1-7.
[24] 吕叔湘.吕叔湘全集(第一卷):中国文法要略[M].沈阳: 辽宁教育出版社, 2002.
[25] 张学会.浅析动词作宾语的谓宾动词[J].大庆师范学院学报,2008,28(1):99-101.
[26] 马新娜.试论形容词作宾语的述宾短语[D].浙江师范大学硕士学位论文,2010.
[27] 武钦青.述程结构“V/A+得+程度补语”研究[D].上海师范大学硕士学位论文,2012.
[28] 钱小飞.“地”字结构识别[J].现代语文(语言研究),2006,(5):61-63.
[29] 李淑荣.语气词“好了”[J].语文学刊,2006,(7):97-99.
[30] 杨亮,张绍武,林鸿飞等.基于图排序的词汇情感消歧研究[J].中文信息学报,2014, 28(6):129-136.