Quality Evaluation of Wikipedia Articles through Edit History and Editor Groups

Wikipedia is well known as a free encyclopedia, which is a type of collaborative repository system that allows the viewer to create and edit articles directly in the web browser. The weakness of the Wikipedia system is the possibility of manipulation and vandalism cannot be ruled out, so that the quality of any given Wikipedia article is not guaranteed. It is an important work to establish a quality evaluation method to help users decide how much they should trust an article in Wikipedia. In this paper we investigate the edit history of Wikipedia articles and propose a model of network structure of editors. We propose an algorithm to calculate the network structural indicator restoreratio. We use the proposed indicator combined with existing metrics to predict the quality of Wikipedia articles through support vector machine technology. The experimental results show that the proposed indicator has better performance in quality evaluation than several existing metrics.

[1]  Tom Cross,et al.  Puppy smoothies: Improving the reliability of open, collaborative wikis , 2006, First Monday.

[2]  Krishnendu Chatterjee,et al.  Assigning trust to Wikipedia content , 2008, Int. Sym. Wikis.

[3]  Linda C. Smith,et al.  INFORMATION QUALITY IN A COMMUNITY-BASED ENCYCLOPEDIA , 2005 .

[4]  Mikalai Sabel Structuring wiki revision history , 2007, WikiSym '07.

[5]  J. W. Hunt,et al.  An Algorithm for Differential File Comparison , 2008 .

[6]  Aniket Kittur,et al.  He says, she says: conflict and coordination in Wikipedia , 2007, CHI.

[7]  Mark Kramer,et al.  Wiki trust metrics based on phrasal analysis , 2008, Int. Sym. Wikis.

[8]  John Riedl,et al.  Creating, destroying, and restoring value in wikipedia , 2007, GROUP.

[9]  Andrew Lih,et al.  Wikipedia as Participatory Journalism: Reliable Sources? Metrics for evaluating collaborative media as a news resource , 2004 .

[10]  Stephen Barrett,et al.  Computational Trust in Web Content Quality: A Comparative Evalutation on the Wikipedia Project , 2007, Informatica.

[11]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[12]  Joshua Evan Blumenstock,et al.  Size matters: word count as a measure of quality on wikipedia , 2008, WWW.

[13]  Les Gasser,et al.  Assessing Information Quality of a Community-Based Encyclopedia , 2005, ICIQ.

[14]  Bo Leuf,et al.  The Wiki Way: Quick Collaboration on the Web , 2001 .

[15]  Thomas Wöhner,et al.  Assessing the quality of Wikipedia articles with lifecycle based metrics , 2009, Int. Sym. Wikis.

[16]  Benno Stein,et al.  Automatic Vandalism Detection in Wikipedia , 2008, ECIR.

[17]  Ee-Peng Lim,et al.  Measuring Qualities of Articles Contributed by Online Communities , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[18]  Bernhard Hoisl,et al.  Social Rewarding in Wiki Systems - Motivating the Community , 2009, HCI.