LEARNING OVERLAPPING COMMUNITIES IN COMPLEX NETWORKS VIA NON-NEGATIVE MATRIX FACTORIZATION

Community structure is an important topological phenomenon typical of complex networks. Accurately unveiling communities is thus crucial to understand and capture the many-faceted nature of complex networks. Communities in real world frequently overlap, i.e. nodes can belong to more than one community. Therefore, quantitatively evaluating the extent to which a node belongs to a community is a key step to find overlapping boundaries between communities. Non-negative matrix factorization (NMF) is a technique that has been used to detect overlapping communities. However, previous efforts in this direction present: (i) limitations in the interpretation of meaningful overlaps and (ii) lack of accuracy in predicting the correct number of communities. In this paper, a hybrid method of NMF to overcome both limitations is presented. This approach effectively estimates the number of communities and is more interpretable and more accurate in identifying overlapping communities in undirected networks than previous approaches. Validations on synthetic and real world networks show that the proposed community learning framework can effectively reveal overlapping communities in complex networks.

[1]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Alexander Rives,et al.  Modular organization of cellular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Michael W. Berry,et al.  Email Surveillance Using Non-negative Matrix Factorization , 2005, Comput. Math. Organ. Theory.

[4]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[5]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[6]  Martin Rosvall,et al.  An information-theoretic framework for resolving community structure in complex networks , 2007, Proceedings of the National Academy of Sciences.

[7]  Shihua Zhang,et al.  Uncovering fuzzy community structure in complex networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Stephen Roberts,et al.  Overlapping community detection using Bayesian non-negative matrix factorization. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  T. Nepusz,et al.  Fuzzy communities and the concept of bridgeness in complex networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Shihua Zhang,et al.  Identification of overlapping community structure in complex networks using fuzzy c-means clustering , 2007 .

[12]  Fei Wang,et al.  Community discovery using nonnegative matrix factorization , 2011, Data Mining and Knowledge Discovery.

[13]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[14]  T. Vicsek,et al.  Weighted network modules , 2007, cond-mat/0703706.

[15]  Steve Gregory,et al.  Finding Overlapping Communities Using Disjoint Community Detection Algorithms , 2009, CompleNet.

[16]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Christopher M. Bishop,et al.  Bayesian PCA , 1998, NIPS.

[18]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[19]  Jean-Charles Delvenne,et al.  Stability of graph communities across time scales , 2008, Proceedings of the National Academy of Sciences.

[20]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[21]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[22]  T. Vicsek,et al.  Directed network modules , 2007, physics/0703248.

[23]  Muhammad Usman,et al.  Signal processing with adaptive sparse structured representations (SPARS'09) , 2009 .

[24]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[26]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[28]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.