Multilevel Compression of Random Walks on Networks Reveals Hierarchical Organization in Large Integrated Systems

To comprehend the hierarchical organization of large integrated systems, we introduce the hierarchical map equation, which reveals multilevel structures in networks. In this information-theoretic approach, we exploit the duality between compression and pattern detection; by compressing a description of a random walker as a proxy for real flow on a network, we find regularities in the network that induce this system-wide flow. Finding the shortest multilevel description of the random walker therefore gives us the best hierarchical clustering of the network — the optimal number of levels and modular partition at each level — with respect to the dynamics on the network. With a novel search algorithm, we extract and illustrate the rich multilevel organization of several large social and biological networks. For example, from the global air traffic network we uncover countries and continents, and from the pattern of scientific communication we reveal more than 100 scientific fields organized in four major disciplines: life sciences, physical sciences, ecology and earth sciences, and social sciences. In general, we find shallow hierarchical structures in globally interconnected systems, such as neural networks, and rich multilevel organizations in systems with highly separated regions, such as road networks.

[1]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[2]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[3]  Robin Palotai,et al.  Community Landscapes: An Integrative Approach to Determine Overlapping Network Module Hierarchy, Identify Key Nodes and Predict Network Dynamics , 2009, PloS one.

[4]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[5]  Mark A. Pitt,et al.  Advances in Minimum Description Length: Theory and Applications , 2005 .

[6]  R. Winther Varieties of modules: kinds, levels, origins, and behaviors. , 2001, The Journal of experimental zoology.

[7]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[8]  R. Guimerà,et al.  The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Albert-László Barabási,et al.  Hierarchical organization in complex networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  K Sneppen,et al.  Networks and cities: an information perspective. , 2005, Physical review letters.

[11]  Carl T. Bergstrom,et al.  Mapping Change in Large Networks , 2008, PloS one.

[12]  D. Thieffry,et al.  Modularity in development and evolution. , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[13]  Martin Rosvall,et al.  Maps of Information Flow Reveal Community Structure In Complex Networks , 2007 .

[14]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[15]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[17]  Marc-Thorsten Hütt,et al.  Organization of Excitable Dynamics in Hierarchical Biological Networks , 2008, PLoS Comput. Biol..

[18]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[19]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[20]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[21]  Carl T. Bergstrom,et al.  The map equation , 2009, 0906.1405.

[22]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[23]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[24]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[25]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[26]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[27]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[28]  P. Ronhovde,et al.  Multiresolution community detection for megascale networks by information-based replica correlations. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Ken Wakita,et al.  Finding community structure in mega-scale social networks: [extended abstract] , 2007, WWW '07.

[30]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..