A Multi-Objective Community Detection Algorithm for Directed Network Based on Random Walk

Uncovering community structure is an important technique for studying complex networks. While a large bulk of algorithms have been proposed for community detection in recent years, most of them were designed for undirected networks. Considering many real-world networks are by nature directed, it is necessary to develop community detection methods that can handle directed networks. In this work, we formulates a multi-objective framework for community detection in directed networks and proposes a multi-objective evolutionary algorithm for finding efficient solutions under this framework. Specifically, based on the theory that an efficient partition of directed networks should have larger network information flow within the community than that between different communities, we first designed two conflicting objective functions based on PageRank random walk, one of which is to maximize within-community transition probability, and the other is to minimize between-community transition probability. By optimizing these two objectives simultaneously, we modelled the problem of community detection as a multi-objective optimization problem, and then developed a novel multi-objective evolutionary algorithm to solve it. Particularly, to guarantee the capability of searching the optimal solution, our proposed method designed/adopted the directed-network-specific population initialization method and evolutionary operator by introducing label propagation algorithm into multi-objective genetic algorithm. Comparison with other four art-of-the-state algorithms, our method showed the competitive performance on both synthetic and real-world networks. Moreover, attributing to the multi-objective framework, the proposed method could generate multiple optimal network partitions in a single run, which provides a hierarchical description of community structure of the network.

[1]  David Gleich Hierarchical Directed Spectral Graph Partitioning MS&E 337 - Information Networks , 2006 .

[2]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[3]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[4]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[5]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[6]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[7]  Youngdo Kim,et al.  Finding communities in directed networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  David W. Corne,et al.  Approximating the Nondominated Front Using the Pareto Archived Evolution Strategy , 2000, Evolutionary Computation.

[9]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[11]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[12]  Dale Schuurmans,et al.  Web Communities Identification from Random Walks , 2006, PKDD.

[13]  Bin Wu,et al.  Multi-objective community detection in complex networks , 2012, Appl. Soft Comput..

[14]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Srinivasan Parthasarathy,et al.  Symmetrizations for clustering directed graphs , 2011, EDBT/ICDT '11.

[16]  Jianbo Shi,et al.  A Random Walks View of Spectral Segmentation , 2001, AISTATS.

[17]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[18]  Amy Nicole Langville,et al.  A Reordering for the PageRank Problem , 2005, SIAM J. Sci. Comput..

[19]  E A Leicht,et al.  Community structure in directed networks. , 2007, Physical review letters.

[20]  Sergio Gómez,et al.  Size reduction of complex networks preserving modularity , 2007, ArXiv.

[21]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[22]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Jianbo Shi,et al.  Grouping with Directed Relationships , 2001, EMMCVPR.

[24]  Dinggang Shen,et al.  First-year development of modules and hubs in infant brain functional networks , 2019, NeuroImage.

[25]  Hongtao Lu,et al.  Finding communities in directed networks by PageRank random walk induced network embedding , 2010 .

[26]  Xue Li,et al.  Directed LPA: Propagating labels in directed networks , 2019, Physics Letters A.

[27]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[31]  Mariá Cristina Vasconcelos Nascimento,et al.  GA-LP: A genetic algorithm based on Label Propagation to detect communities in directed networks , 2017, Expert Syst. Appl..

[32]  Maoguo Gong,et al.  A multiobjective optimization method based on MOEA/D and fuzzy clustering for change detection in SAR images , 2014, 2014 IEEE Congress on Evolutionary Computation (CEC).

[33]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[34]  Padhraic Smyth,et al.  A Spectral Clustering Approach To Finding Communities in Graph , 2005, SDM.

[35]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[37]  Marco Laumanns,et al.  SPEA2: Improving the Strength Pareto Evolutionary Algorithm For Multiobjective Optimization , 2002 .

[38]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[39]  Qingfu Zhang,et al.  Community detection in networks by using multiobjective evolutionary algorithm with decomposition , 2012 .

[40]  Clara Pizzuti,et al.  Evolutionary Computation for Community Detection in Networks: A Review , 2018, IEEE Transactions on Evolutionary Computation.

[41]  T. Vicsek,et al.  Directed network modules , 2007, physics/0703248.