An Analysis of Topical Coverage of Wikipedia

Many have questioned the reliability and accuracy of Wikipedia. Here a different issue, but one closely related: how broad is the coverage of Wikipedia? Differences in the interests and attention of Wikipedia’s editors mean that some areas, in the traditional sciences, for example, are better covered than others. Two approaches to measuring this coverage are presented. The first maps the distribution of topics on Wikipedia to the distribution of books published. The second compares the distribution of topics in three established, field-specific academic encyclopedias to the articles found in Wikipedia. Unlike the top-down construction of traditional encyclopedias, Wikipedia’s topical coverage is driven by the interests of its users, and as a result, the reliability and completeness of Wikipedia is likely to be different depending on the subject-area of the article.

[1]  L. Wood,et al.  From the Authors , 2003, European Respiratory Journal.

[2]  Bernardo A. Huberman,et al.  Assessing the value of cooperation in Wikipedia , 2007, First Monday.

[3]  John Bowker Implicit Morality: An Empirical Ethical Perspective , 2007 .

[4]  R. G. Lerner,et al.  Encyclopedia of Physics , 1990 .

[5]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[6]  Martin Wattenberg,et al.  Proceedings of the 40th Hawaii International Conference on System Sciences- 2007 Talk Before You Type: Coordination in Wikipedia , 2022 .

[7]  Lisa M. George What's fit to print: The effect of ownership concentration on product variety in daily newspaper markets , 2001, Inf. Econ. Policy.

[8]  Panayiotis Zaphiris,et al.  Cultural Differences in Collaborative Authoring of Wikipedia , 2006, J. Comput. Mediat. Commun..

[9]  K. Lancaster The Economics of Product Variety: A Survey , 1990 .

[10]  Martin Wattenberg,et al.  Studying cooperation and conflict between authors with history flow visualizations , 2004, CHI.

[11]  Susan C. Herring,et al.  Collaborative Authoring on the Web: A Genre Analysis of Online Encyclopedias , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[12]  Maria Ruiz-Casado,et al.  Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets , 2005, AWIC.

[13]  McDaniel Cg In hospital program many cooks don't spoil the broth. , 1982 .

[14]  Alex Preminger,et al.  The New Princeton Encyclopedia of Poetry and Poetics , 1994 .

[15]  R. Bonato Network Analysis for Wikipedia , 2005 .

[16]  Linda C. Smith,et al.  INFORMATION QUALITY DISCUSSIONS IN WIKIPEDIA , 2005 .

[17]  Bryan A. Pendleton,et al.  Power of the Few vs. Wisdom of the Crowd: Wikipedia and the Rise of the Bourgeoisie , 2006 .

[18]  Bernardo A. Huberman,et al.  Assessing the Value of Coooperation in Wikipedia , 2007, ArXiv.

[19]  Katy Börner,et al.  Analyzing and visualizing the semantic coverage of Wikipedia and its authors , 2005, Complex..

[20]  Les Gasser,et al.  Assessing Information Quality of a Community-Based Encyclopedia , 2005, ICIQ.

[21]  J. Voß Measuring Wikipedia , 2005 .

[22]  Thomas Chesney,et al.  An empirical examination of Wikipedia's credibility , 2006, First Monday.

[23]  L. Terveen,et al.  Becoming Wikipedian : Transformation of Participation in a Collaborative Online Encyclopedia , 2009 .

[24]  Benjamin M. Compaine,et al.  Internet Radio: A New Engine for Content Diversity? , 2001, ArXiv.

[25]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[26]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[27]  V. Zlatic,et al.  Wikipedias: collaborative web-based encyclopedias as complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.