Agglomerative Clustering Based on Label Propagation for Detecting Overlapping and Hierarchical Communities in Complex Networks

Community detection is an important issue to understand the structural and functional properties of complex networks, which still remains a challenging subject. In some complex networks, a node may belong to multiple communities, implying overlapping community structure. Moreover, complex networks often show a hierarchical structure where small communities group together to form larger ones. In this paper, we propose a novel parameter-free algorithm called agglomerative clustering based on label propagation algorithm (ACLPA) to detect both overlapping and hierarchical community structure in complex networks. By combining the advantages of agglomerative clustering and label propagation, our algorithm can build the hierarchical tree of overlapping communities in large-scale networks. The tests on both synthetic and real-world networks give excellent results and demonstrate the effectiveness and efficiency of our algorithm.

[1]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[3]  S. Strogatz Exploring complex networks , 2001, Nature.

[4]  David F. Gleich,et al.  Neighborhoods are good communities , 2011, ArXiv.

[5]  A. Arenas,et al.  Models of social networks based on social distance attachment. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[7]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[8]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[9]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Jiawei Han,et al.  Density-based shrinkage for revealing hierarchical and overlapping community structure in networks , 2011 .

[15]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[16]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[17]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[18]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[19]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[20]  Marko Bajec,et al.  Unfolding communities in large complex networks: Combining defensive and offensive label propagation for core extraction , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[22]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[23]  Marko Bajec,et al.  Robust network community detection using balanced propagation , 2011, ArXiv.

[24]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[26]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[27]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[29]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Alex Arenas,et al.  Synchronization reveals topological scales in complex networks. , 2006, Physical review letters.

[31]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[32]  David Lusseau,et al.  The emergent properties of a dolphin social network , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[33]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[34]  V. Carchiolo,et al.  Extending the definition of modularity to directed graphs with overlapping communities , 2008, 0801.1647.

[35]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Fergal Reid,et al.  Seeding for pervasively overlapping communities , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  T. S. Evans,et al.  Clique graphs and overlapping communities , 2010, ArXiv.

[38]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[39]  Pietro Liò,et al.  Towards real-time community detection in large networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[41]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[42]  Huawei Shen,et al.  Quantifying and identifying the overlapping community structure in networks , 2009, 0905.2666.

[43]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[44]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[45]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Santosh S. Vempala,et al.  On clusterings: Good, bad and spectral , 2004, JACM.

[47]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.