CC-GA: A clustering coefficient based genetic algorithm for detecting communities in social networks

Abstract A community structure is an integral part of a social network. Detecting such communities plays an important role in a wide range of applications, including but not limited to cluster analysis, recommendation systems and understanding the behaviour of complex systems. Researchers have derived many algorithms to discover the community structures of networks. Discovering communities is a challenging task, and there is no single algorithm that produces the best results for all networks. Therefore, despite many elegant solutions, discovering communities remains an active area of research. In this paper, we propose a novel algorithm, the Clustering Coefficient-based Genetic Algorithm (CC-GA), for detecting them in social and complex networks. Researchers have used several genetic algorithms to detect communities, but the proposed algorithm is novel in terms of both the generation of the initial population and the mutation method, and these improve its efficiency and accuracy. Experiments on a variety of real-world datasets and a comparison to state-of-the-art genetic and non-genetic-based algorithms show improved results.

[1]  Onaiza Maqbool,et al.  Network analysis of software change history for understanding software evolution , 2014, 17th IEEE International Multi Topic Conference 2014.

[2]  Erfan Khaji,et al.  A Heuristic Method to Generate Better Initial Population for Evolutionary Methods , 2014, ArXiv.

[3]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[4]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Gang Liu,et al.  A genetic algorithm for community detection in complex networks , 2013, Journal of Central South University.

[6]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[7]  R. Guimerà,et al.  The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Ronghua Shang,et al.  Community detection based on modularity and an improved genetic algorithm , 2013 .

[9]  Mubashar Mushtaq,et al.  Saving lives using social media: Analysis of the role of twitter for personal blood donation requests and dissemination , 2017, Telematics Informatics.

[10]  José Ramón Cano,et al.  CommuniMents: A Framework for Detecting Community Based Sentiments for Events , 2017, Int. J. Semantic Web Inf. Syst..

[11]  Hema Banati,et al.  Modeling Evolutionary Group Search Optimization Approach for Community Detection in Social Networks , 2015, WCI '15.

[12]  Kuru Ratnavelu,et al.  A semi-synchronous label propagation algorithm with constraints for community detection in complex networks , 2017, Scientific Reports.

[13]  Feng Zou,et al.  Community detection in complex networks: Multi-objective discrete backtracking search optimization algorithm with decomposition , 2017, Appl. Soft Comput..

[14]  Neda Binesh,et al.  Fuzzy clustering in community detection based on nonnegative matrix factorization with two novel evaluation criteria , 2017, Appl. Soft Comput..

[15]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[16]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[17]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[18]  A. Orłowski,et al.  Genetic Algorithms Approach to Community Detection , 2010 .

[19]  Yike Guo,et al.  Fast graph clustering with a new description model for community detection , 2017, Inf. Sci..

[20]  Xingpeng Jiang,et al.  Discovering communities in complex networks by edge label propagation , 2016, Scientific Reports.

[21]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[22]  Xingyi Zhang,et al.  Overlapping Community Detection based on Network Decomposition , 2016, Scientific Reports.

[23]  Hongtao Lu,et al.  Adaptive Overlapping Community Detection with Bayesian NonNegative Matrix Factorization , 2017, DASFAA.

[24]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[25]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[27]  Matteo Pellegrini,et al.  Detecting Communities Based on Network Topology , 2014, Scientific Reports.

[28]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[29]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[30]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Aboul Ella Hassanien,et al.  Genetic Algorithms for community detection in social networks , 2012, 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA).

[32]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[33]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[34]  Dumitru Dumitrescu,et al.  Community Detection in Complex Networks Using Collaborative Evolutionary Algorithms , 2007, ECAL.

[35]  Zhao Yang,et al.  A Comparative Analysis of Community Detection Algorithms on Artificial Networks , 2016, Scientific Reports.

[36]  Ali Emrouznejad,et al.  The value of indirect ties in citation networks: SNA analysis with OWA operator weights , 2015, Inf. Sci..

[37]  Matthieu Latapy,et al.  Main-memory triangle computations for very large (sparse (power-law)) graphs , 2008, Theor. Comput. Sci..

[38]  Haifeng Du,et al.  A genetic algorithm with local search strategy for improved detection of community structure , 2010 .

[39]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[40]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Wei Li,et al.  Stepping community detection algorithm based on label propagation and similarity , 2017 .

[42]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[43]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[44]  Peng Wu,et al.  Multi-Objective Community Detection Based on Memetic Algorithm , 2015, PloS one.

[45]  Clara Pizzuti,et al.  Community detection in social networks with genetic algorithms , 2008, GECCO '08.

[46]  Konstantin Avrachenkov,et al.  Cooperative Game Theory Approaches for Network Partitioning , 2017, COCOON.

[47]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[49]  Regina Berretta,et al.  MA-Net: A reliable memetic algorithm for community detection by modularity optimization , 2015 .

[50]  Kun Deng,et al.  Community Detection in Complex Networks based on Improved Genetic Algorithm and Local Optimization , 2016 .

[51]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[53]  Xiao Liu,et al.  Semi-supervised community detection based on non-negative matrix factorization with node popularity , 2017, Inf. Sci..

[54]  Curt Jones,et al.  Finding Good Approximate Vertex and Edge Partitions is NP-Hard , 1992, Inf. Process. Lett..

[55]  Ali Hamzeh,et al.  A heuristic local community detection method (HLCD) , 2016, Applied Intelligence.

[56]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[58]  Hadi Zare,et al.  IEDC: An integrated approach for overlapping and non-overlapping community detection , 2016, Knowl. Based Syst..

[59]  Ali Daud,et al.  Finding Rising Stars in Co-Author Networks via Weighted Mutual Influence , 2017, WWW.

[60]  Mariá Cristina Vasconcelos Nascimento,et al.  GA-LP: A genetic algorithm based on Label Propagation to detect communities in directed networks , 2017, Expert Syst. Appl..

[61]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[62]  Santo Fortunato,et al.  Consensus clustering in complex networks , 2012, Scientific Reports.

[63]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[64]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[65]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[66]  Alexandre Hollocou,et al.  A linear streaming algorithm for community detection in very large networks , 2017, ArXiv.

[67]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[68]  A. Arenas,et al.  Models of social networks based on social distance attachment. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[69]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[70]  Di Dong,et al.  Evolutionary Nonnegative Matrix Factorization Algorithms for Community Detection in Dynamic Networks , 2017, IEEE Transactions on Knowledge and Data Engineering.

[71]  Xun Wang,et al.  Modularity optimization by Global-Local Search , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).