A social community detection algorithm based on parallel grey label propagation

Community detection is one of the important methods for understanding the mechanism behind the function of social networks. The recently developed label propagation algorithm (LPA) has been gaining increasing attention because of its excellent characteristics, such as a succinct framework, linear time and space complexity, easy parallelization, etc. However, several limitations of the LPA algorithm, including random label initialization and greedy label updating, hinder its application to complex networks. A new parallel LPA is proposed in this study. First, grey relational analysis is integrated into the label updating process, which is based on vertex similarity. Second, parallel propagation steps are comprehensively studied to utilize parallel computation power efficiently. Third, randomness in label updating is significantly reduced via automatic label selection and label weight thresholding. Experiments conducted on artificial and real social networks demonstrate that the proposed algorithm is scalable and exhibits high clustering accuracy.

[1]  Qishan Zhang,et al.  Detecting communities in social networks by local affinity propagation with grey relational analysis , 2015, Grey Syst. Theory Appl..

[2]  Kun Guo,et al.  Parallel Multi-label Propagation for Overlapping Community Detection in Large-Scale Networks , 2015, MIWAI.

[3]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Weizhong Zhao,et al.  PSCAN: A Parallel Structural Clustering Algorithm for Big Networks in MapReduce , 2013, 2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA).

[5]  Zhen Lin,et al.  CK-LPA: Efficient community detection algorithm based on label propagation with community kernel , 2014 .

[6]  Huang Jian Mining Hierarchical Community Structure Within Networks from Density-Connected Traveling Orders , 2011 .

[7]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[8]  Michael J. Franklin,et al.  Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.

[9]  Qishan Zhang,et al.  Community discovery by propagating local and global information based on the MapReduce model , 2015, Inf. Sci..

[10]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[11]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Huaiyu Wan,et al.  Balanced Multi-Label Propagation for Overlapping Community Detection in Social Networks , 2012, Journal of Computer Science and Technology.

[13]  Hao Lou,et al.  Detecting community structure using label propagation with weighted coherent neighborhood propinquity , 2013 .

[14]  Panagiotis Symeonidis,et al.  From biological to social networks: Link prediction based on multi-way spectral clustering , 2013, Data Knowl. Eng..

[15]  Boleslaw K. Szymanski,et al.  Parallel Overlapping Community Detection with SLPA , 2013, 2013 International Conference on Social Computing.

[16]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[17]  Jianbin Huang,et al.  Mining Hierarchical Community Structure Within Networks from Density-Connected Traveling Orders: Mining Hierarchical Community Structure Within Networks from Density-Connected Traveling Orders , 2011 .

[18]  Han Zhao,et al.  Identifying influential nodes in complex networks with community structure , 2013, Knowl. Based Syst..

[19]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[20]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[21]  Junhua Tang,et al.  Detecting community structure via synchronous label propagation , 2015, Neurocomputing.

[22]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[23]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[24]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.