Document controversy classification based on the Wikipedia category structure

Dispute and controversy are parts of our culture and cannot be omitted on the Internet (where it becomes more anonymous). There have been many studies on controversy, especially on social networks such as Wikipedia. This free on-line encyclopedia has become a very popular data source among many researchers studying behavior or natural language processing. This paper presents using the category structure of Wikipedia to determine the controversy of a single article. This is the first part of the proposed system for classification of topic controversy score for any given text.

[1]  Ee-Peng Lim,et al.  On ranking controversies in wikipedia: models and evaluation , 2008, WSDM '08.

[2]  Jaap Kamps,et al.  Using wikipedia categories for ad hoc search , 2009, SIGIR.

[3]  Tony White,et al.  Measuring Semantic Similarity using a Multi-Tree Model , 2011, ITWP@IJCAI.

[4]  Jakob Voß,et al.  Collaborative thesaurus tagging the Wikipedia way , 2006, ArXiv.

[5]  Filip De Turck,et al.  Algorithms for Recollection of Search Terms Based on the Wikipedia Category Structure , 2014, TheScientificWorldJournal.

[6]  András Kornai,et al.  Dynamics of Conflicts in Wikipedia , 2012, PloS one.

[7]  May Sabai Han Semantic Information Retrieval based on Wikipedia Taxonomy , 2012 .

[8]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[9]  Piotr Turek,et al.  Learning about team collaboration from Wikipedia edit history , 2010, Int. Sym. Wikis.

[10]  Martin Wattenberg,et al.  Studying cooperation and conflict between authors with history flow visualizations , 2004, CHI.

[11]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[12]  Aniket Kittur,et al.  What's in Wikipedia?: mapping topics and conflict using socially annotated category structure , 2009, CHI.

[13]  András Kornai,et al.  Edit Wars in Wikipedia , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[14]  James A. Thom,et al.  Ontology evaluation using wikipedia categories for browsing , 2007, CIKM '07.

[15]  Adam Wierzbicki,et al.  Enriching Trust Prediction Model in Social Network with User Rating Similarity , 2009, 2009 International Conference on Computational Aspects of Social Networks.

[16]  Michael Strube,et al.  Decoding Wikipedia Categories for Knowledge Acquisition , 2008, AAAI.

[17]  Péter Schönhofen Identifying document topics using the Wikipedia category network , 2009, Web Intell. Agent Syst..

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Denilson Barbosa,et al.  Identifying controversial articles in Wikipedia: a comparative study , 2012, WikiSym '12.

[20]  Robert P. Biuk-Aghai,et al.  Visualizing large-scale human collaboration in Wikipedia , 2014, Future Gener. Comput. Syst..

[21]  Aniket Kittur,et al.  He says, she says: conflict and coordination in Wikipedia , 2007, CHI.

[22]  András Kornai,et al.  Characterization and prediction of Wikipedia edit wars , 2011 .

[23]  Adam Wierzbicki,et al.  Predicting Controversy of Wikipedia Articles Using the Article Feedback Tool , 2014, SocialCom '14.

[24]  Piotr Turek,et al.  Learning About the Quality of Teamwork from Wikiteams , 2010, 2010 IEEE Second International Conference on Social Computing.