Abstract:The number of Wikipedia articles and contributors grows at a very fast pace, therefore, a remarkable property of some Wikipedia articles were written by up to thousands of authors who have contradicting opinions. This paper aims to indentify controversial articles in Wikipedia. It draws clues from the edit history page in Wikipedia based on the traditional methods, and takes into account the contributors of the corresponding article to compute controversial scores. We also introduce a new intuitive evaluation method besides the PRF and NDCG evaluation metrics. Experiments on 16745 Wikipedia articles show that our methods perform much better than the other baseline models.
[1] Wikipedia. What is Wikipedia [OL].http://wikipedia.jaylee.cn/. [2] J Giles. Internet encyclopedias go head to head [OL]. http://www.nature.com/news/2005/051212/full/438 900a.html. [3] V Franco, R Piirto, H Y Hu, et al. Anatomy of a flame: conflict and community building on the Internet [J]. Tech. and Society Magazine, IEEE, 1995,14: 12-21. [4] B Q Vuong, E P Lim, A Sun, et al. On ranking controversies in Wikipedia: models and evaluation[C]//Proceedings of the International Conference on Web Search and Web Data Mining (WSDM08), Palo Alto, California, USA, February 11-12, 2008: 171-182. [5] N Lipka, B Stein. Identifying featured articles in Wikipedia: writing style matters[C]//Proceedings of International World Wide Web Conferences (WWW10). Raleigh, North Carolina, USA, 2010: 1147-1148. [6] B T Adler, L de Alfaro. A content-driven reputation system for the Wikipedia[C]//Proceedings of International World Wide Web Conferences (WWW07), Banff, Canada, 2007: 261-270. [7] J E Blumenstock. Size matters: word count as a measure of quality on Wikipedia[C]//Proceedings of International World Wide Web Conferences (WWW08), Beijing, China, 2008: 1095-1096. [8] A Kittur, B Suh, B A Pendleton, et al. He says, she says: conflict and coordination in Wikipedia[C]//Proceedings of SIGCHI Conf. Human Factors in Computing Systems, Son Jose, California, USA, 2007: 453-462. [9] U Brandes, P Kenis, J Lerner, et al. Network analysis of collaboration structure in Wikipedia[C]//Proceedings of International World Wide Web Conferences (WWW09), Madrid, Spain, 2009: 731-740. [10] U Brandes, J Lerner. Visual analysis of controversy in contributor-generated encyclopedias [J]. Information Visualization, 2008,11: 34-48. [11] R Jesus. Bipartite networks of wikipedias articles and authors: a meso-level approach[C]//Proceedings of International Symposium on Wikis and Open Collaboration (WikiSym09). Orlando, Florida, USA, 2009: 1-10.