LinkLPA: A Link‐Based Label Propagation Algorithm for Overlapping Community Detection in Networks

Community detection is an important methodology for understanding the intrinsic structure and function of complex networks. Because overlapping community is one of the characteristics of real‐world networks and should be considered for community detection, in this article, we propose an algorithm, called link‐based label propagation algorithm (LinkLPA), to detect overlapping communities. Because the link partition is conceptually natural for the problem of overlapping community detection, LinkLPA first transforms node partition problem into link partition problem and employs a new label propagation algorithm with preference on links instead of nodes to detect communities due to the simplicity and efficiency of label propagation algorithm. Then the proposed LinkLPA performs a postprocessing to refine the detected overlapping communities by avoiding over‐overlapping and incorrect partition of weak ties. Experimental results on a large number of real‐world and synthetic networks show that the proposed method achieves high accuracy on detecting overlapping communities in networks.

[1]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[3]  Vincenza Carchiolo,et al.  Extending modularity definition for directed graphs with overlapping communities , 2008 .

[4]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[5]  J. Sambles,et al.  Slow waves caused by cuts perpendicular to a single subwavelength slit in metal , 2007 .

[6]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[7]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[8]  Boleslaw K. Szymanski,et al.  On Measuring the Quality of a Network Community Structure , 2013, 2013 International Conference on Social Computing.

[9]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[10]  Dino Pedreschi,et al.  DEMON: a local-first discovery method for overlapping communities , 2012, KDD.

[11]  R. Folk,et al.  Poisson‐Voronoi核形成と成長変形における分域構造の時間発展:一次元と三次元の結果 , 2008 .

[12]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[13]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Andrea Baronchelli,et al.  Effects of mobility on ordering dynamics , 2009, 0902.1916.

[15]  L. C. Barbosa,et al.  Raman, hyperraman, hyper-Rayleigh, two-photon excited luminescence and morphology-dependent-modes in a single optical tweezers system , 2005 .

[16]  Jiawei Han,et al.  gSkeletonClu: Density-Based Network Clustering via Structure-Connected Tree Division or Agglomeration , 2010, 2010 IEEE International Conference on Data Mining.

[17]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[18]  Gregor E. Morfill,et al.  Effect of polarization force on the propagation of dust acoustic solitary waves , 2010 .

[19]  Thomas A. Schreiber,et al.  The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[20]  Jianbin Huang,et al.  Towards Online Multiresolution Community Detection in Large-Scale Networks , 2011, PloS one.

[21]  Boleslaw K. Szymanski,et al.  A New Metric for Quality of Network Community Structure , 2015, ArXiv.

[22]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Gregor E. Morfill,et al.  Spectroscopic evaluation of the effect of the microparticles on radiofrequency argon plasma , 2009 .

[25]  Malik Magdon-Ismail,et al.  Efficient Identification of Overlapping Communities , 2005, ISI.

[26]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Qiang Wang,et al.  Topic oriented community detection through social objects and link analysis in social networks , 2012, Knowl. Based Syst..

[28]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[29]  G. J. Fleer,et al.  Stationary dynamics approach to analytical approximations for polymer coexistence curves. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[31]  Xiaoming Liu,et al.  SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[32]  James Bailey,et al.  Information theoretic measures for clusterings comparison: is a correction for chance necessary? , 2009, ICML '09.

[33]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[34]  Boleslaw K. Szymanski,et al.  Extension of Modularity Density for overlapping community structure , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[35]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[36]  M. Markus,et al.  Fluctuation theorem for a deterministic one-particle system. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[38]  Rolf T. Wigand,et al.  Community Detection in Complex Networks: Multi-objective Enhanced Firefly Algorithm , 2013, Knowl. Based Syst..

[39]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[40]  Bin Wu,et al.  A link clustering based overlapping community detection algorithm , 2013, Data Knowl. Eng..

[41]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  L. Bécu,et al.  Evidence for three-dimensional unstable flows in shear-banding wormlike micelles. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[44]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[45]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[46]  T. Vicsek,et al.  Weighted network modules , 2007, cond-mat/0703706.

[47]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[48]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[49]  V. Carchiolo,et al.  Extending the definition of modularity to directed graphs with overlapping communities , 2008, 0801.1647.

[50]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[51]  Jie Tang,et al.  Detecting Community Kernels in Large Social Networks , 2011, 2011 IEEE 11th International Conference on Data Mining.