Evolutionary clustering algorithm for community detection using graph-based information

The problem of community detection has become highly relevant due to the growing interest in social networks. The information contained in a social network is often represented as a graph. The idea of graph partitioning of graph theory can be apply to split a graph into node groups based on its topology information. In this paper the problem of detecting communities within a social network is handled applying graph clustering algorithms based on this idea. The new approach proposed is based on a genetic algorithm. A new fitness function has been designed to guide the clustering process combining different measures of network topology (Density, Centralization, Heterogeneity, Neighbourhood, Clustering Coefficient). These different network measures have been experimentally tested using a real-world social network. Experimental results show that the proposed approach is able to detect communities and the results obtained in previous work have been improved.

[1]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[2]  David F. Barrero,et al.  A Multi-Objective Genetic Graph-Based Clustering algorithm with memory optimization , 2013, 2013 IEEE Congress on Evolutionary Computation.

[3]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Clara Pizzuti,et al.  GA-Net: A Genetic Algorithm for Community Detection in Social Networks , 2008, PPSN.

[5]  Albert-László Barabási,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW , 2004 .

[6]  Victor Ginsburgh,et al.  The Eurovision song contest: is voting political or cultural? , 2008 .

[7]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[8]  Panos M. Pardalos,et al.  The maximum clique problem , 1994, J. Glob. Optim..

[9]  Samia Nefti-Meziani,et al.  On the use of divergence distance in fuzzy clustering , 2008, Fuzzy Optim. Decis. Mak..

[10]  T. Vicsek,et al.  Clique percolation in random networks. , 2005, Physical review letters.

[11]  Daniel J. Fenn,et al.  How does Europe Make Its Mind Up? Connections, cliques, and compatibility between countries in the Eurovision Song Contest , 2005, physics/0505071.

[12]  Gad Yair,et al.  ‘Unite Unite Europe’ The political and cultural structures of Europe as reflected in the Eurovision Song Contest , 1995 .

[13]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  David Camacho,et al.  Adaptive k-Means Algorithm for Overlapped Graph Clustering , 2012, Int. J. Neural Syst..

[15]  Jianzhuang Liu,et al.  A genetics-based approach to fuzzy clustering , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[16]  Angel Eduardo Muñoz Zavala,et al.  A hybrid system using PSO and data mining for determining the ranking of a new participant in Eurovision , 2008, GECCO '08.

[17]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[18]  David Camacho,et al.  A Genetic Graph-Based Clustering Algorithm , 2012, IDEAL.

[19]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[20]  S. E. Schaeffer Survey Graph clustering , 2007 .

[21]  Hongtao Lu,et al.  A very fast algorithm for detecting community structures in complex networks , 2007 .

[22]  David Camacho,et al.  Using the Clustering Coefficient to Guide a Genetic-Based Communities Finding Algorithm , 2011, IDEAL.

[23]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[24]  Alex Alves Freitas A Review of evolutionary Algorithms for Data Mining , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[25]  Qingfu Zhang,et al.  Community detection in networks by using multiobjective evolutionary algorithm with decomposition , 2012 .

[26]  Sanghamitra Bandyopadhyay Genetic algorithms for clustering and fuzzy clustering , 2011, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[27]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[28]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[29]  Maurizio Naldi,et al.  A traffic-based evolutionary algorithm for network clustering , 2013, Appl. Soft Comput..

[30]  Hussein A. Abbass,et al.  Separated and overlapping community detection in complex networks using multiobjective Evolutionary Algorithms , 2010, IEEE Congress on Evolutionary Computation.

[31]  B. Wichmann The Eurovision song contest , 1985 .

[32]  Yangyang Li,et al.  A spectral clustering-based adaptive hybrid multi-objective harmony search algorithm for community detection , 2012, 2012 IEEE Congress on Evolutionary Computation.

[33]  Santosh S. Vempala,et al.  On clusterings-good, bad and spectral , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[34]  Xiaoying Tai,et al.  A hierarchical clustering algorithm based on fuzzy graph connectedness , 2006, Fuzzy Sets Syst..

[35]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Clustering , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[37]  Derek Gatherer,et al.  Comparison of Eurovision Song Contest Simulation with Actual Results Reveals Shifting Patterns of Collusive Voting Alliances , 2006, J. Artif. Soc. Soc. Simul..

[38]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[39]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Ronghua Shang,et al.  Community detection based on modularity and an improved genetic algorithm , 2013 .

[41]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.