Finding community structure in networks using the eigenvectors of matrices.

We consider the problem of detecting communities or modules in networks, groups of vertices with a higher-than-average density of edges connecting them. Previous work indicates that a robust approach to this problem is the maximization of the benefit function known as "modularity" over possible divisions of a network. Here we show that this maximization process can be written in terms of the eigenspectrum of a matrix we call the modularity matrix, which plays a role in community detection similar to that played by the graph Laplacian in graph partitioning calculations. This result leads us to a number of possible algorithms for detecting community structure, as well as several other results, including a spectral measure of bipartite structure in networks and a centrality measure that identifies vertices that occupy central positions within the communities to which they belong. The algorithms and measures proposed are illustrated with applications to a variety of real-world complex networks.

[1]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[2]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[3]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[4]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[5]  Ronald S. Burt,et al.  Positions in Networks , 1976 .

[6]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[7]  Isma'il ibn Ali al-Sadiq AIDS , 1986, The Lancet.

[8]  Dorit S. Hochbaum,et al.  Polynomial algorithm for the k-cut problem , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[9]  Chung-Kuan Cheng,et al.  Towards efficient hierarchical designs by ratio cut partitioning , 1989, 1989 IEEE International Conference on Computer-Aided Design. Digest of Technical Papers.

[10]  R. May,et al.  Networks of sexual contacts: implications for the pattern of spread of HIV , 1989, AIDS.

[11]  D. Eppstein,et al.  Provably good mesh generation , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[12]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[13]  P. Gács,et al.  Algorithms , 1992 .

[14]  Martine D. F. Schlag,et al.  Spectral K-way ratio-cut partitioning and clustering , 1994, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[15]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[16]  Charles J. Alpert,et al.  Spectral Partitioning: The More Eigenvectors, The Better , 1995, 32nd Design Automation Conference.

[17]  K. Holmes,et al.  Sexual Mixing Patterns of Patients Attending Sexually Transmitted Diseases Clinics , 1996, Sexually transmitted diseases.

[18]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[19]  M. KleinbergJon Authoritative sources in a hyperlinked environment , 1999 .

[20]  Thorsten von Eicken,et al.  技術解説 IEEE Computer , 1999 .

[21]  K. Holmes,et al.  Sexual mixing patterns in the spread of gonococcal and chlamydial infections. , 1999, American journal of public health.

[22]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[23]  H E Stanley,et al.  Classes of small-world networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[24]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[25]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[26]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  R Pastor-Satorras,et al.  Dynamical and correlation properties of the internet. , 2001, Physical review letters.

[28]  S. Strogatz Exploring complex networks , 2001, Nature.

[29]  S. N. Dorogovtsev,et al.  Evolution of networks , 2001, cond-mat/0106144.

[30]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Béla Bollobás,et al.  Modern Graph Theory , 2002, Graduate Texts in Mathematics.

[32]  Mauricio Barahona,et al.  Synchronization in small-world systems. , 2002, Physical review letters.

[33]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[34]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[35]  F. Chung,et al.  Connected Components in Random Graphs with Given Expected Degree Sequences , 2002 .

[36]  Petter Holme,et al.  Network bipartivity. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  D. Mason,et al.  Compartments revealed in food-web structure , 2003, Nature.

[38]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[40]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[41]  Petter Holme,et al.  Subnetwork hierarchies of biochemical pathways , 2002, Bioinform..

[42]  Adilson E Motter,et al.  Heterogeneity in oscillator networks: are smaller worlds easier to synchronize? , 2003, Physical review letters.

[43]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[44]  Haijun Zhou Distance, dissimilarity index, and network community structure. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[45]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  M. A. Muñoz,et al.  Journal of Statistical Mechanics: An IOP and SISSA journal Theory and Experiment Detecting network communities: a new systematic and efficient algorithm , 2004 .

[47]  Massimo Marchiori,et al.  Method to find community structures based on information centrality. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[49]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[50]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[52]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[53]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[54]  Stefan Bornholdt,et al.  Detecting fuzzy community structures in complex networks with a Potts model. , 2004, Physical review letters.

[55]  Reinhard Lipowsky,et al.  Network Brownian Motion: A New Method to Measure Vertex-Vertex Proximity and to Identify Communities and Subcommunities , 2004, International Conference on Computational Science.

[56]  Albert-László Barabási,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW , 2004 .

[57]  J. A. Rodríguez-Velázquez,et al.  Spectral measures of bipartivity in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[58]  Bhavani Thuraisingham,et al.  Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics , 2005 .

[59]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[60]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[61]  Pekka Orponen,et al.  Local Clustering of Large Graphs by Approximate Fiedler Vectors , 2005, WEA.

[62]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[64]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[65]  G. Caldarelli,et al.  Detecting communities in large networks , 2004, cond-mat/0402499.

[66]  J. Doye,et al.  Identifying communities within energy landscapes. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[67]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[68]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[69]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[70]  Andrei Z. Broder,et al.  Workshop on Algorithms and Models for the Web Graph , 2007, WAW.

[71]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[72]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[73]  Leon Danon,et al.  The effect of size heterogeneity on community identification in complex networks , 2006, physics/0601144.

[74]  Mika Gustafsson,et al.  Comparison and validation of community structures in complex networks , 2006 .

[75]  Gergely Palla,et al.  Preferential attachment of communities: The same principle, but a higher level , 2006 .

[76]  M. Hastings Community detection as an inference problem. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[77]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[78]  Petter Holme,et al.  Currency and commodity metabolites: their identification and relation to the modularity of metabolic networks. , 2006, IET systems biology.

[79]  Ke Wang,et al.  Proceedings of the Eighth SIAM International Conference on Data Mining , 2008, SDM 2008.