Clustering method incorporating network topology and dynamics

Clustering is useful in elucidating associations among agents of networks and has been advantageously applied in numerous fields including biology, chemistry, sociology, and economics. Most clustering algorithms have been applied to (weighted) networks with a fixed topology. However, many networks are constructed to simulate particular dynamics on them; e.g., transmission of disease, vehicular transportation, electricity supply, and economic transfers in financial markets. These dynamics affect the large-scale structure that emerges from the underlying network. We present a clustering method that incorporates not only the weighted network topology, but also the particular dynamics for an application domain. The approach is general and can be used with any dynamic process that can be simulated on a network. We apply this method to several networks to validate it: a benchmark network, various toy networks, and two large realistic synthetic networks. These span four, five, and two orders of magnitude in numbers of agents and links, and average degree, respectively, and possess vastly different degree distributions. The largest network includes over 580,000 agents and 13 million edges. The results show that different structures arise from variations in dynamics processes for a fixed network, and reflect the changes in the process itself. We observe a sharp transition from unclustered to well-clustered communities as dynamical parameters vary.

[1]  Jari Saramäki,et al.  Model of community emergence in weighted social networks , 2009, Comput. Phys. Commun..

[2]  James Truscott,et al.  Control of a highly pathogenic H5N1 avian influenza outbreak in the GB poultry flock , 2007, Proceedings of the Royal Society B: Biological Sciences.

[3]  A. Nizam,et al.  Containing Pandemic Influenza at the Source , 2005, Science.

[4]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[5]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[6]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[7]  David L. Smith,et al.  Key strategies for reducing spread of avian influenza among commercial poultry holdings: lessons for transmission to humans , 2006, Proceedings of the Royal Society B: Biological Sciences.

[8]  M. E. Williams,et al.  TRANSIMS: TRANSPORTATION ANALYSIS AND SIMULATION SYSTEM , 1995 .

[9]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[10]  Jari Saramäki,et al.  Resolution limit in complex network community detection with Potts model approach , 2006 .

[11]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[12]  K. Kaski,et al.  Limited resolution in complex network community detection with Potts model approach , 2006 .

[13]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[14]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[15]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[16]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[17]  Armin Elbers,et al.  Risk Maps for the Spread of Highly Pathogenic Avian Influenza in Poultry , 2007, PLoS Comput. Biol..

[18]  S. Dongen Graph clustering by flow simulation , 2000 .

[19]  Ulrik Brandes,et al.  Experiments on Graph Clustering Algorithms , 2003, ESA.

[20]  Paul R. Cohen,et al.  Maximizing Influence Propagation in Networks with Community Structure , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Jean-Cédric Chappelier,et al.  Finding instabilities in the community structure of complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  M. Macy,et al.  Complex Contagions and the Weakness of Long Ties1 , 2007, American Journal of Sociology.

[23]  Achla Marathe,et al.  The structure of electrical networks: a graph theory based analysis , 2009, Int. J. Crit. Infrastructures.

[24]  Ulrik Brandes,et al.  On Finding Graph Clusterings with Maximum Modularity , 2007, WG.

[25]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[26]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[27]  Jari Saramäki,et al.  Emergence of communities in weighted networks. , 2007, Physical review letters.

[28]  M. Newman,et al.  Robustness of community structure in networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[32]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[33]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[34]  Samarth Swarup,et al.  A Study of Information Diffusion over a Realistic Social Network Model , 2009, 2009 International Conference on Computational Science and Engineering.

[35]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  V. S. Anil Kumar,et al.  Locational market power in network constrained markets , 2009 .

[38]  Madhav V. Marathe,et al.  EpiFast: a fast algorithm for large scale realistic epidemic simulations on distributed memory systems , 2009, ICS.

[39]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[41]  Madhav V. Marathe,et al.  Generation and analysis of large synthetic social contact networks , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).