The greedy coupled-seeds expansion method for the overlapping community detection in social networks

Community identification refers to the process of selecting dense clusters with sparse connections to the rest of the graph. These communities naturally overlap in many social networks, while each node can participate in several communities. Revealing these structures represents an important task in network analysis. In fact, it allows to understand the features of networks. However, many community detection approaches fail to provide communities with high quality and better ground truth correspondence in reasonable execution time. The key idea of this paper is to propose a coupled-seed expansion method for the overlapping community detection. Specifically, we construct a coupled-seed by choosing a node and its most similar neighbor and then expand this coupled-seed using a fitness function that improves the identification of local communities. The overlapping modularity, the F-score and the extended normalized mutual information measures are used to evaluate the proposed algorithm. The experimental results on 10 instances of real networks and four sets of LFR networks prove that the proposed method is effective and outperforms the existing algorithms (BigClam, OSLOM, SE, Demon, UMSTMO, LC, Ego-Splitting).

[1]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[2]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[3]  Dino Pedreschi,et al.  DEMON: a local-first discovery method for overlapping communities , 2012, KDD.

[4]  Asgarali Bouyer,et al.  LP-LPA: A link influence-based label propagation algorithm for discovering community structures in networks , 2017 .

[5]  Shi Dong,et al.  Improved label propagation algorithm for overlapping community detection , 2020, Computing.

[6]  Yuefeng Li,et al.  A new attributed graph clustering by using label propagation in complex networks , 2020, J. King Saud Univ. Comput. Inf. Sci..

[7]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[8]  Dorothea Wagner,et al.  Structure-preserving sparsification of social networks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[9]  Santo Fortunato,et al.  Finding Statistically Significant Communities in Networks , 2010, PloS one.

[10]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[11]  Konstantin Avrachenkov,et al.  Cooperative Game Theory Approaches for Network Partitioning , 2017, COCOON.

[12]  Mahdi Vasighi,et al.  Community Detection in Complex Networks by Detecting and Expanding Core Nodes Through Extended Local Similarity of Nodes , 2018, IEEE Transactions on Computational Social Systems.

[13]  Inderjit S. Dhillon,et al.  Overlapping Community Detection Using Neighborhood-Inflated Seed Expansion , 2015, IEEE Transactions on Knowledge and Data Engineering.

[14]  Mohamed El Marraki,et al.  Overlapping community detection based on the union of all maximum spanning trees , 2020, Libr. Hi Tech.

[15]  Jure Leskovec,et al.  Overlapping community detection at scale: a nonnegative matrix factorization approach , 2013, WSDM.

[16]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[19]  Silvio Lattanzi,et al.  Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters , 2017, KDD.

[20]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[21]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[22]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[23]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[24]  Xiaoming Liu,et al.  SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[25]  Inderjit S. Dhillon,et al.  Overlapping community detection using seed set expansion , 2013, CIKM.

[26]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[27]  Malik Magdon-Ismail,et al.  Finding communities by clustering a graph into overlapping subgraphs , 2005, IADIS AC.

[28]  Duanbing Chen,et al.  Detecting overlapping communities of weighted networks via a local algorithm , 2010 .