Network structure exploration via Bayesian nonparametric models

Complex networks provide a powerful mathematical representation of complex systems in nature and society. To understand complex networks, it is crucial to explore their internal structures, also called structural regularities. The task of network structure exploration is to determine how many groups there are in a complex network and how to group the nodes of the network. Most existing structure exploration methods need to specify either a group number or a certain type of structure when they are applied to a network. In the real world, however, the group number and also the certain type of structure that a network has are usually unknown in advance. To explore structural regularities in complex networks automatically, without any prior knowledge of the group number or the certain type of structure, we extend a probabilistic mixture model that can handle networks with any type of structure but needs to specify a group number using Bayesian nonparametric theory. We also propose a novel Bayesian nonparametric model, called the Bayesian nonparametric mixture (BNPM) model. Experiments conducted on a large number of networks with different structures show that the BNPM model is able to explore structural regularities in networks automatically with a stable, state-of-the-art performance.

[1]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  C. Altafini,et al.  Computing global structural balance in large-scale signed social networks , 2011, Proceedings of the National Academy of Sciences.

[3]  V A Traag,et al.  Narrow scope for resolution-limit-free community detection. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Attila Szolnoki,et al.  Coevolutionary Games - A Mini Review , 2009, Biosyst..

[5]  P. Hebert,et al.  Complementary molecular information changes our perception of food web structure , 2014, Proceedings of the National Academy of Sciences.

[6]  Jian Yu,et al.  Combining a popularity-productivity stochastic block model with a discriminative-content model for general structure detection. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Purnamrita Sarkar,et al.  Nonparametric Link Prediction in Dynamic Networks , 2012, ICML.

[8]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[9]  Giacomo Indiveri,et al.  Synthesizing cognition in neuromorphic electronic systems , 2013, Proceedings of the National Academy of Sciences.

[10]  Amedeo Caflisch,et al.  Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[12]  P. Ronhovde,et al.  Local resolution-limit-free Potts model for community detection. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Buzhou Tang,et al.  Overlapping community detection in networks with positive and negative links , 2014 .

[14]  Mong-Li Lee,et al.  Community-based user recommendation in uni-directional social networks , 2013, CIKM.

[15]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[16]  S. Walker Invited comment on the paper "Slice Sampling" by Radford Neal , 2003 .

[17]  Jouhyun Jeon,et al.  Spatial and functional organization of mitochondrial protein network , 2013, Scientific Reports.

[18]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[19]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[20]  J. Pitman Combinatorial Stochastic Processes , 2006 .

[21]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  Attila Szolnoki,et al.  Self-organization towards optimally interdependent networks by means of coevolution , 2014, ArXiv.

[23]  E A Leicht,et al.  Mixture models and exploratory analysis in networks , 2006, Proceedings of the National Academy of Sciences.

[24]  Ana L. N. Fred,et al.  Robust data clustering , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[25]  Matteo Pellegrini,et al.  Detecting Communities Based on Network Topology , 2014, Scientific Reports.

[26]  Dean V. Buonomano,et al.  ROBUST TIMING AND MOTOR PATTERNS BY TAMING CHAOS IN RECURRENT NEURAL NETWORKS , 2012, Nature Neuroscience.

[27]  P. Taylor,et al.  Measurement of the World City Network , 2002 .

[28]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Sankaran Mahadevan,et al.  Box-covering algorithm for fractal dimension of weighted networks , 2013, Scientific Reports.

[30]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[31]  Lin Wang,et al.  Degree mixing in multilayer networks impedes the evolution of cooperation , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Amy Nicole Langville,et al.  A Reordering for the PageRank Problem , 2005, SIAM J. Sci. Comput..

[33]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[34]  J. Frouz,et al.  Soil food web properties explain ecosystem services across European land use systems , 2013, Proceedings of the National Academy of Sciences.

[35]  Attila Szolnoki,et al.  Interdependent network reciprocity in evolutionary games , 2013, Scientific Reports.

[36]  Morten Mørup,et al.  Bayesian Community Detection , 2012, Neural Computation.

[37]  Peter J. Taylor,et al.  Exploratory Analysis of the World City Network , 2002 .

[38]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[39]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[40]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Hang-Hyun Jo,et al.  Generalized friendship paradox in complex networks: The case of scientific collaboration , 2014, Scientific Reports.

[42]  Nima Sarshar,et al.  Experience versus talent shapes the structure of the Web , 2008, Proceedings of the National Academy of Sciences.

[43]  Z. Wang,et al.  The structure and dynamics of multilayer networks , 2014, Physics Reports.

[44]  Attila Szolnoki,et al.  Evolution of public cooperation on interdependent networks: The impact of biased utility functions , 2012, ArXiv.

[45]  Xueqi Cheng,et al.  Exploring the structural regularities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Norman P. Hummon,et al.  Connectivity in a citation network: The development of DNA theory☆ , 1989 .

[47]  Christoph Hauert,et al.  Origin and Structure of Dynamic Cooperative Networks , 2014, Scientific Reports.

[48]  Francesco Saverio Pavone,et al.  The theory of pattern formation on directed networks. , 2014, Nature communications.

[49]  Steve Gregory,et al.  Finding missing edges in networks based on their community structure , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[50]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[51]  Samuel Kaski,et al.  Inferring Vertex Properties from Topology in Large Networks , 2007, MLG.

[52]  Ellen V. Rothenberg,et al.  Developmental gene networks: a triathlon on the course to T cell identity , 2014, Nature Reviews Immunology.

[53]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[54]  J. Wang,et al.  Detecting groups of similar components in complex networks , 2008, 0808.1612.

[55]  Zoubin Ghahramani,et al.  An Infinite Latent Attribute Model for Network Data , 2012, ICML.

[56]  Richard Van Noorden Online collaboration: Scientists and the social network , 2014, Nature.

[57]  Radford M. Neal Slice Sampling , 2003, The Annals of Statistics.

[58]  Lucas C. Parra,et al.  Origins of power-law degree distribution in the heterogeneity of human activity in social networks , 2013, Scientific Reports.

[59]  Thomas A. Schreiber,et al.  The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[60]  Attila Szolnoki,et al.  Evolutionary dynamics of group interactions on structured populations: a review , 2013, Journal of The Royal Society Interface.