A Local-Neighborhood Information Based Overlapping Community Detection Algorithm for Large-Scale Complex Networks

As the size of available networks is continuously increasing (even with millions of nodes), large-scale complex networks are receiving significant attention. While existing overlapping-community detection algorithms are quite effective in analyzing complex networks, most of these algorithms suffer from scalability issues when applied to large-scale complex networks, which can have more than 1,000,000 nodes. To address this problem, we propose an efficient local-expansion-based overlapping-community detection algorithm using local-neighborhood information (OCLN). During the iterative expansion process, only neighbors of nodes added in the last iteration (rather than all neighbors) are considered to determine whether they can join the community. This significantly reduces the computational cost and enhances the scalability for community detection in large-scale networks. A belonging coefficient is also proposed in OCLN to filter out incorrectly identified nodes. Theoretical analysis demonstrates that the computational complexity of the proposed OCLN is linear with respect to the size of the network to be detected. Experiments on large-scale LFR benchmark and real-world networks indicate the effectiveness of OCLN for overlapping-community detection in large-scale networks, in terms of both computational efficiency and detected-community quality.

[1]  D. Parisi,et al.  Self-contained algorithms to detect communities in networks , 2004 .

[2]  Srinivasan Parthasarathy,et al.  Identifying functional modules in interaction networks through overlapping Markov clustering , 2012, Bioinform..

[3]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[4]  L. Jiao,et al.  Adjusting from disjoint to overlapping community detection of complex networks , 2009 .

[5]  Sanghamitra Bandyopadhyay,et al.  FOCS: Fast Overlapped Community Search , 2015, IEEE Transactions on Knowledge and Data Engineering.

[6]  Xuelong Li,et al.  Constrained Nonnegative Matrix Factorization for Image Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[8]  Bin Wu,et al.  A link clustering based overlapping community detection algorithm , 2013, Data Knowl. Eng..

[9]  Jianbin Huang,et al.  Towards Online Multiresolution Community Detection in Large-Scale Networks , 2011, PloS one.

[10]  Bin Wu,et al.  A Method for Local Community Detection by Finding Core Nodes , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[11]  Dong Liu,et al.  Fuzzy overlapping community detection based on local random walk and multidimensional scaling , 2013 .

[12]  Xiaoming Liu,et al.  SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[13]  Ignacio Marín,et al.  SurpriseMe: an integrated tool for network community structure characterization using Surprise maximization , 2013, Bioinform..

[14]  Nam P. Nguyen,et al.  Adaptive algorithms for detecting community structure in dynamic social networks , 2011, 2011 Proceedings IEEE INFOCOM.

[15]  Johan A. K. Suykens,et al.  Kernel spectral clustering for community detection in complex networks , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[16]  Aihua Li,et al.  Fast and Accurate Mining the Community Structure: Integrating Center Locating and Membership Optimization , 2016, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yuan Jie,et al.  Community Detection in Complex Networks: Algorithms and Analysis , 2014, ISCTCS.

[18]  Hui-Jia Li,et al.  Social significance of community structure: Statistical view , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  M. Newman Communities, modules and large-scale structure in networks , 2011, Nature Physics.

[20]  Sanjukta Bhowmick,et al.  GenPerm: A Unified Method for Detecting Non-Overlapping and Overlapping Communities , 2016, IEEE Transactions on Knowledge and Data Engineering.

[21]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[22]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[23]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[24]  Guy Melançon,et al.  Model for generating artificial social networks having community structures with small-world and scale-free properties , 2013, Social Network Analysis and Mining.

[25]  Tomas Olovsson,et al.  A local seed selection algorithm for overlapping community detection , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[26]  Byung Ro Moon,et al.  Multiobjective evolutionary algorithms for dynamic social network clustering , 2010, GECCO '10.

[27]  Yilong Yin,et al.  A Maximal Clique Based Multiobjective Evolutionary Algorithm for Overlapping Community Detection , 2017, IEEE Transactions on Evolutionary Computation.

[28]  Giovanni Montana,et al.  Community detection in multiplex networks using Locally Adaptive Random walks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[29]  E. Blanzieri,et al.  Link Clustering with Extended Link Similarity and EQ Evaluation Division , 2013, PloS one.

[30]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[31]  J. Kumpula,et al.  Sequential algorithm for fast clique percolation. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[33]  Dayou Liu,et al.  Hierarchical community detection with applications to real-world network analysis , 2013, Data Knowl. Eng..

[34]  Clara Pizzuti,et al.  Algorithms and tools for protein-protein interaction networks clustering, with a special focus on population-based stochastic methods , 2014, Bioinform..

[35]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[36]  Xiaomei Quan,et al.  Survey: Functional Module Detection from Protein-Protein Interaction Networks , 2014, IEEE Transactions on Knowledge and Data Engineering.

[37]  Yong Wang,et al.  Community structure detection based on Potts model and network's spectral characterization , 2012 .

[38]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[39]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[40]  Malik Magdon-Ismail,et al.  Finding communities by clustering a graph into overlapping subgraphs , 2005, IADIS AC.

[41]  Hao Wang,et al.  Measuring robustness of community structure in complex networks , 2014, ArXiv.

[42]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[44]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[45]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[46]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[48]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[49]  Javier Montero,et al.  A Divide-and-Link algorithm for hierarchical clustering in networks , 2015, Inf. Sci..

[50]  Inderjit S. Dhillon,et al.  Overlapping Community Detection Using Neighborhood-Inflated Seed Expansion , 2015, IEEE Transactions on Knowledge and Data Engineering.

[51]  Guanrong Chen,et al.  Complex networks: small-world, scale-free and beyond , 2003 .

[52]  Enrico Gregori,et al.  Parallel $(k)$-Clique Community Detection on Large-Scale Networks , 2013, IEEE Transactions on Parallel and Distributed Systems.

[53]  Huiru Zheng,et al.  An improved random walk based clustering algorithm for community detection in complex networks , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[54]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  Fergal Reid,et al.  Percolation Computation in Complex Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[56]  Fergal Reid,et al.  Seeding for pervasively overlapping communities , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[58]  Xin Yu,et al.  A semantic overlapping community detection algorithm based on field sampling , 2015, Expert Syst. Appl..

[59]  Inderjit S. Dhillon,et al.  Overlapping community detection using seed set expansion , 2013, CIKM.

[60]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[61]  Qiong Chen,et al.  Detecting local community structures in complex networks based on local degree central nodes , 2013 .

[62]  Andreas W. M. Dress,et al.  A spectral clustering-based framework for detecting community structures in complex networks , 2009, Appl. Math. Lett..