Two-dimensional ranking of Wikipedia articles

Abstract. The Library of Babel, described by Jorge Luis Borges, stores an enormous amount of information. The Library exists ab aeterno. Wikipedia, a free online encyclopaedia, becomes a modern analogue of such a Library. Information retrieval and ranking of Wikipedia articles become the challenge of modern society. While PageRank highlights very well known nodes with many ingoing links, CheiRank highlights very communicative nodes with many outgoing links. In this way the ranking becomes two-dimensional. Using CheiRank and PageRank we analyze the properties of two-dimensional ranking of all Wikipedia English articles and show that it gives their reliable classification with rich and nontrivial features. Detailed studies are done for countries, universities, personalities, physicists, chess players, Dow-Jones companies and other categories.

[1]  M. Gell-Mann,et al.  Physics Today. , 1966, Applied optics.

[2]  Philipp Blom Enlightening the World: Encyclopedie, The Book That Changed the Course of History , 2003 .

[3]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[4]  S. Redner Citation statistics from 110 years of physical review , 2005, physics/0506056.

[5]  Eli Upfal,et al.  Using PageRank to Characterize Web Structure , 2002, Internet Math..

[6]  D. Diderot,et al.  Encyclopédie, ou, Dictionnaire raisonné des sciences, des arts et des métiers , 1963 .

[7]  Santo Fortunato,et al.  Diffusion of scientific credits and the ranking of scientists , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  V. Zlatic,et al.  Wikipedias: collaborative web-based encyclopedias as complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[10]  R. Bonato Network Analysis for Wikipedia , 2005 .

[11]  Alexei Chepelianskii,et al.  Towards physical laws for software architecture , 2010, ArXiv.

[12]  Peter B. Danzig NetCache Architecture and Deployment , 1998, Comput. Networks.

[13]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  S. N. Dorogovtsev,et al.  Evolution of networks , 2001, cond-mat/0106144.

[15]  Cristina V. Lopes,et al.  Review-Based Ranking of Wikipedia Articles , 2009, 2009 International Conference on Computational Aspects of Social Networks.

[16]  Giuseppe Attardi,et al.  Ranking very many typed entities on wikipedia , 2007, CIKM '07.

[17]  Yoram Louzoun,et al.  Self-emergence of knowledge trees: extraction of the Wikipedia hierarchies. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  James Hendler,et al.  Google’s PageRank and Beyond: The Science of Search Engine Rankings , 2007 .

[19]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[20]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[21]  Guido Caldarelli,et al.  Preferential attachment in the growth of social networks: the case of Wikipedia , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[23]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[24]  Denis Diderot,et al.  Encyclopédie ou dictionnaire raisonné des sciences Paris 1751-1772 : anatomie, chirurgie , 1977 .

[25]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[26]  M. H. Hart,et al.  The 100: A Ranking of the Most Influential Persons in History , 1978 .

[27]  Debora Donato,et al.  Large scale properties of the Webgraph , 2004 .

[28]  G. Thomson,et al.  J. J. Thomson , 1956, Nature.