Overview of the 1th International Competition on Quality Flaw Prediction in Wikipedia

The paper overviews the task "Quality Flaw Prediction in Wikipedia" of the PAN'12 competition. An evaluation corpus is introduced which comprises 1 592 226 English Wikipedia articles, of which 208 228 have been tagged to con- tain one of ten important quality flaws. Moreover, the performance of three qual- ity flaw classifiers is evaluated.

[1]  Pável Calado,et al.  Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia , 2009, JCDL '09.

[2]  Benno Stein,et al.  Towards automatic quality assurance in Wikipedia , 2011, WWW.

[3]  Les Gasser,et al.  Information quality work organization in wikipedia , 2008, J. Assoc. Inf. Sci. Technol..

[4]  Benno Stein,et al.  On the Evolution of Quality Flaws and the Effectiveness of Cleanup Tags in the English Wikipedia , 2012 .

[5]  Philip S. Yu,et al.  Building text classifiers using positive and unlabeled examples , 2003, Third IEEE International Conference on Data Mining.

[6]  Ee-Peng Lim,et al.  Measuring article quality in wikipedia: models and evaluation , 2007, CIKM '07.

[7]  Bernardo A. Huberman,et al.  Cooperation and quality in wikipedia , 2007, WikiSym '07.

[8]  Joshua Evan Blumenstock,et al.  Size matters: word count as a measure of quality on wikipedia , 2008, WWW.

[9]  Benno Stein,et al.  Predicting quality flaws in user-generated content: the case of wikipedia , 2012, SIGIR '12.

[10]  Paolo Rosso,et al.  On the Use of PU Learning for Quality Flaw Prediction in Wikipedia , 2012, CLEF.

[11]  Matthijs den Besten,et al.  Wikibugs: using template messages in open content collections , 2009, Int. Sym. Wikis.

[12]  Benno Stein,et al.  A breakdown of quality flaws in Wikipedia , 2012, WebQuality '12.

[13]  Oliver Ferschke,et al.  FlawFinder: A Modular System for Predicting Quality Flaws in Wikipedia , 2012, CLEF.

[14]  Benno Stein,et al.  Detection of text quality flaws as a one-class classification problem , 2011, CIKM '11.

[15]  Benno Stein,et al.  Identifying featured articles in wikipedia: writing style matters , 2010, WWW '10.