Maps of random walks on complex networks reveal community structure

To comprehend the multipartite organization of large-scale biological and social systems, we introduce an information theoretic approach that reveals community structure in weighted and directed networks. We use the probability flow of random walks on a network as a proxy for information flows in the real system and decompose the network into modules by compressing a description of the probability flow. The result is a map that both simplifies and highlights the regularities in the structure and their relationships. We illustrate the method by making a map of scientific communication as captured in the citation patterns of >6,000 journals. We discover a multicentric organization with fields that vary dramatically in size and degree of integration into the network of science. Along the backbone of the network—including physics, chemistry, molecular biology, and medicine—information flows bidirectionally, but the map reveals a directional pattern of citation from the applied fields to the basic sciences.

[1]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[2]  George Kingsley Zipf,et al.  Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology , 2012 .

[3]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[4]  H. Peak Human behavior and the principle of least effort. An introduction to human ecology. , 1950 .

[5]  S. Lazerow The Institute of Scientific Information , 1961, Nature.

[6]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[7]  References , 1971 .

[8]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[9]  Harvey Fletcher,et al.  Speech and hearing. , 1930, Health services manager.

[10]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[11]  M. McBride Environmental Chemistry of Soils , 1994 .

[12]  J. Mogul,et al.  Computer networks and isdn systems , 1995 .

[13]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[14]  Gerard T. Barkema,et al.  Monte Carlo Methods in Statistical Physics , 1999 .

[15]  Henry Small Visualizing science by citation mapping , 1999 .

[16]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[18]  Sergei Maslov,et al.  Modularity and extreme edges of the internet. , 2003, Physical review letters.

[19]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[20]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Katy Börner,et al.  Mapping knowledge domains , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Zaida Chinchilla-Rodríguez,et al.  A new technique for building maps of large scientific domains based on the cocitation of classes and categories , 2004, Scientometrics.

[24]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Mark A. Pitt,et al.  Advances in Minimum Description Length: Theory and Applications , 2005 .

[26]  E. Ziv,et al.  Information-theoretic approach to network modularity. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[28]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[29]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[30]  E. Tufte Beautiful Evidence , 2006 .

[31]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[32]  Martin Rosvall,et al.  An information-theoretic framework for resolving community structure in complex networks , 2007, Proceedings of the National Academy of Sciences.

[33]  Roger Guimerà,et al.  Module identification in bipartite and directed networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  Sergio Gómez,et al.  Size reduction of complex networks preserving modularity , 2007, ArXiv.

[35]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[36]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.