Overlapping Community Detection in Directed and Undirected Attributed Networks Using a Multiobjective Evolutionary Algorithm

In many real-world networks, the structural connections of networks and the attributes about each node are always available. We typically call such graphs attributed networks, in which attributes always play the same important role in community detection as the topological structure. It is shown that the very existence of overlapping communities is one of the most important characteristics of various complex networks, while the majority of the existing community detection methods was designed for detecting separated communities in attributed networks. Therefore, it is quite challenging to detect meaningful overlapping structures with the combination of node attributes and topological structures. Therefore, in this article, we propose a multiobjective evolutionary algorithm based on the similarity attribute for overlapping community detection in attributed networks (MOEA-SA<inline-formula> <tex-math notation="LaTeX">$_{OV}$ </tex-math></inline-formula>). In MOEA-SA<inline-formula> <tex-math notation="LaTeX">$_{OV}$ </tex-math></inline-formula>, a modified extended modularity <inline-formula> <tex-math notation="LaTeX">$EQ_{OV}$ </tex-math></inline-formula>, dealing with both directed and undirected networks, is well designed as the first objective. Another objective employed is the attribute similarity <inline-formula> <tex-math notation="LaTeX">$S_{A}$ </tex-math></inline-formula>. Then, a novel encoding and decoding strategy is designed to realize the goal of representing overlapping communities efficiently. MOEA-SA<inline-formula> <tex-math notation="LaTeX">$_{OV}$ </tex-math></inline-formula> runs under the framework of the nondominated sorting genetic <xref ref-type="algorithm" rid="alg2">algorithm II</xref> (NSGA-II) and can automatically determine the number of communities. In the experiments, the performance of MOEA-SA<inline-formula> <tex-math notation="LaTeX">$_{OV}$ </tex-math></inline-formula> is validated on both synthetic and real-world networks, and the experimental results demonstrate that our method can effectively find Pareto fronts about overlapping community structures with practical significance in both directed and undirected attributed networks.

[1]  John Doherty,et al.  Offline Data-Driven Evolutionary Optimization Using Selective Surrogate Ensembles , 2019, IEEE Transactions on Evolutionary Computation.

[2]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Bin Wu,et al.  A Method for Local Community Detection by Finding Core Nodes , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[4]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[5]  Youngdo Kim,et al.  Finding communities in directed networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[8]  Yilong Yin,et al.  A Maximal Clique Based Multiobjective Evolutionary Algorithm for Overlapping Community Detection , 2017, IEEE Transactions on Evolutionary Computation.

[9]  Srinivasan Parthasarathy,et al.  Symmetrizations for clustering directed graphs , 2011, EDBT/ICDT '11.

[10]  Ye Tian,et al.  A Knee Point-Driven Evolutionary Algorithm for Many-Objective Optimization , 2015, IEEE Transactions on Evolutionary Computation.

[11]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[12]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[13]  Peter J. Fleming,et al.  Multiobjective optimization and multiple constraint handling with evolutionary algorithms. I. A unified formulation , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[14]  Yang Wang,et al.  Overlapping community detection through an improved multi-objective quantum-behaved particle swarm optimization , 2015, Journal of Heuristics.

[15]  Hong Cheng,et al.  Clustering Large Attributed Graphs: An Efficient Incremental Approach , 2010, 2010 IEEE International Conference on Data Mining.

[16]  Clara Pizzuti,et al.  Overlapped community detection in complex networks , 2009, GECCO.

[17]  Pavlos I. Lazaridis,et al.  A Parallel Surrogate Model Assisted Evolutionary Algorithm for Electromagnetic Design Optimization , 2019, IEEE Transactions on Emerging Topics in Computational Intelligence.

[18]  Hisao Ishibuchi,et al.  A multi-objective genetic local search algorithm and its application to flowshop scheduling , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[19]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[20]  J. Liu,et al.  Traveling salesman problems with PageRank Distance on complex networks reveal community structure , 2016 .

[21]  Witold Pedrycz,et al.  Flexible Linguistic Expressions and Consensus Reaching With Accurate Constraints in Group Decision-Making , 2020, IEEE Transactions on Cybernetics.

[22]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Xingyi Zhang,et al.  A Mixed Representation-Based Multiobjective Evolutionary Algorithm for Overlapping Community Detection , 2017, IEEE Transactions on Cybernetics.

[24]  Jun Zhang,et al.  A Distributed Swarm Optimizer With Adaptive Communication for Large-Scale Optimization , 2020, IEEE Transactions on Cybernetics.

[25]  Clara Pizzuti,et al.  GA-Net: A Genetic Algorithm for Community Detection in Social Networks , 2008, PPSN.

[26]  Hussein A. Abbass,et al.  Separated and overlapping community detection in complex networks using multiobjective Evolutionary Algorithms , 2010, IEEE Congress on Evolutionary Computation.

[27]  Malik Magdon-Ismail,et al.  Defining and Discovering Communities in Social Networks , 2012 .

[28]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[29]  Feng Zhao,et al.  A Cooperative Co-Evolutionary Approach to Large-Scale Multisource Water Distribution Network Optimization , 2019, IEEE Transactions on Evolutionary Computation.

[30]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[31]  Hong Cheng,et al.  A model-based approach to attributed graph clustering , 2012, SIGMOD Conference.

[32]  Zhangtao Li,et al.  A Multiobjective Evolutionary Algorithm Based on Structural and Attribute Similarities for Community Detection in Attributed Networks , 2018, IEEE Transactions on Cybernetics.

[33]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[35]  Emmanuel Viennet,et al.  Community Detection based on Structural and Attribute Similarities , 2012, ICDS 2012.

[36]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[37]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  E. Lazega Introduction : Collegial Phenomenon : The Social Mechanisms of Cooperation Among Peers in a Corporate Law Partnership , 2001 .

[39]  Maoguo Gong,et al.  Memetic algorithm for community detection in networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Hong Cheng,et al.  GBAGC: A General Bayesian Framework for Attributed Graph Clustering , 2014, TKDD.

[41]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[42]  Hocine Cherifi,et al.  Comparative evaluation of community detection algorithms: a topological approach , 2012, ArXiv.

[43]  Handing Wang,et al.  A Random Forest-Assisted Evolutionary Algorithm for Data-Driven Constrained Multiobjective Combinatorial Optimization of Trauma Systems , 2020, IEEE Transactions on Cybernetics.

[44]  Hocine Cherifi,et al.  Community detection algorithm evaluation with ground-truth data , 2017, ArXiv.

[45]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[46]  Xiaoming Liu,et al.  SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[47]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[48]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[49]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[50]  Jun Zhang,et al.  Multimodal Estimation of Distribution Algorithms , 2017, IEEE Transactions on Cybernetics.