Investigating relationships within and between category networks in Wikipedia

This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.

[1]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[2]  Albert-Laszlo Barabasi,et al.  Statistical Mechanics of Complex Networks: From the Internet to Cell Biology , 2006 .

[3]  V. Zlatic,et al.  Wikipedias: collaborative web-based encyclopedias as complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  G. Caldarelli,et al.  Taxonomy and clustering in collaborative systems: The case of the on-line encyclopedia Wikipedia , 2007, 0710.3058.

[5]  Ismael Rafols,et al.  Is science becoming more interdisciplinary? Measuring and mapping six research fields over time , 2009, Scientometrics.

[6]  Jan Youtie,et al.  Where does nanotechnology belong in the map of science? , 2009, Nature nanotechnology.

[7]  G. Caldarelli,et al.  Preferential attachment in the growth of social networks, the Internet encyclopedia wikipedia , 2007 .

[8]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[9]  Kevin W. Boyack,et al.  Using detailed maps of science to identify potential collaborations , 2009, Scientometrics.

[10]  Yoram Louzoun,et al.  Self-emergence of knowledge trees: extraction of the Wikipedia hierarchies. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Michael A. Rodriguez,et al.  Clickstream Data Yields High-Resolution Maps of Science , 2009, PloS one.

[12]  S. N. Dorogovtsev,et al.  Evolution of networks , 2001, cond-mat/0106144.

[13]  Kevin W. Boyack,et al.  Toward a consensus map of science , 2009, J. Assoc. Inf. Sci. Technol..

[14]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[15]  Reka Albert,et al.  Mean-field theory for scale-free random networks , 1999 .

[16]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[17]  Kevin W. Boyack,et al.  Thought leadership: A new indicator for national and institutional comparison , 2008, Scientometrics.

[18]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009, J. Assoc. Inf. Sci. Technol..

[19]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[20]  D. Watts,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[21]  Katy Börner,et al.  Analyzing and visualizing the semantic coverage of Wikipedia and its authors , 2005, Complex..

[22]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[23]  Kevin W. Boyack,et al.  Toward a consensus map of science , 2009 .