Community mining using three closely joint techniques based on community mutual membership and refinement strategy

Abstract Community structure has become one of the central studies of the topological structure of complex networks in the past decades. Although many advanced approaches have been proposed to identify community structure, those state-of-the-art methods still lack efficiency in terms of a balance between stability, accuracy and computation time. Here, we propose an algorithm with different stages, called TJA-net, to efficiently identify communities in a large network with a good balance between accuracy, stability and computation time. First, we propose an initial labeling algorithm, called ILPA, combining K-nearest neighbor (KNN) and label propagation algorithm (LPA). To produce a number of sub-communities automatically, ILPA iteratively labels a node in a network using the labels of its adjacent nodes and their index of closeness. Next, we merge sub-communities using the mutual membership of two communities. Finally, a refinement strategy is designed for modifying the label of the wrongly clustered nodes at boundaries. In our approach, we propose and use modularity density as the objective function rather than the commonly used modularity. This can deal with the issue of the resolution limit for different network structures enhancing the result precision. We present a series of experiments with artificial and real data set and compare the results obtained by our proposed algorithm with the ones obtained by the state-of-the-art algorithms, which shows the effectiveness of our proposed approach. The experimental results on large-scale artificial networks and real networks illustrate the superiority of our algorithm.

[1]  Mark E. J. Newman,et al.  Spectral methods for network community detection and graph partitioning , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  A. Arenas,et al.  Models of social networks based on social distance attachment. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[4]  L. Jiao,et al.  A Novel Clonal Selection Algorithm for Community Detection in Complex Networks , 2015, Comput. Intell..

[5]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[6]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[7]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[8]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[9]  U. Brandes,et al.  Maximizing Modularity is hard , 2006, physics/0608255.

[10]  Clara Pizzuti,et al.  GA-Net: A Genetic Algorithm for Community Detection in Social Networks , 2008, PPSN.

[11]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[12]  Xingyuan Wang,et al.  Community detection using local neighborhood in complex networks , 2015 .

[13]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Peng Xiang,et al.  Automatic network clustering via density-constrained optimization with grouping operator , 2016, Appl. Soft Comput..

[15]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[17]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Maoguo Gong,et al.  Memetic algorithm for community detection in networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Bin Wu,et al.  Multi-objective community detection in complex networks , 2012, Appl. Soft Comput..

[20]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[21]  S Boccaletti,et al.  Identification of network modules by optimization of ratio association. , 2006, Chaos.

[22]  Jingrui Zhang,et al.  A hybrid harmony search algorithm with differential evolution for day-ahead scheduling problem of a microgrid with consideration of power flow constraints , 2016 .

[23]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[25]  Hao Wang,et al.  A measure system of zero moment point using wearable inertial sensors , 2016, China Communications.

[26]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[27]  Yalin Chen,et al.  A hybrid particle swarm optimization with small population size to solve the optimal short-term hydro-thermal unit commitment problem , 2016 .

[28]  Tinghuai Ma,et al.  Social Network and Tag Sources Based Augmenting Collaborative Recommender System , 2015, IEICE Trans. Inf. Syst..

[29]  Xingming Sun,et al.  Efficient algorithm for k-barrier coverage based on integer linear programming , 2016, China Communications.

[30]  Konstantin Avrachenkov,et al.  Cooperative Game Theory Approaches for Network Partitioning , 2017, COCOON.

[31]  Lin Yanping,et al.  Web community detection model using particle swarm optimization , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[32]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[33]  Maoguo Gong,et al.  A survey on network community detection based on evolutionary computation , 2016, Int. J. Bio Inspired Comput..

[34]  Zhili Zhou,et al.  Fast and accurate near-duplicate image elimination for visual sensor networks , 2017, Int. J. Distributed Sens. Networks.

[35]  Yi Liu,et al.  Memetic algorithm with simulated annealing strategy and tightness greedy optimization for community detection in networks , 2015, Appl. Soft Comput..

[36]  Peng Zhang,et al.  Comparative definition of community and corresponding identifying algorithm. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  P M J Trevelyan Analytical small-time asymptotic properties of A+B-->C fronts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[39]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[40]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[41]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[42]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  Yalin Chen,et al.  A modified MOEA/D approach to the solution of multi-objective optimal power flow problem , 2016, Appl. Soft Comput..

[44]  Yangyang Li,et al.  Large-scale community detection based on node membership grade and sub-communities integration , 2015, Physica A: Statistical Mechanics and its Applications.

[45]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[47]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[48]  Shoshana J. Wodak,et al.  Markov clustering versus affinity propagation for the partitioning of protein interaction graphs , 2009, BMC Bioinformatics.

[49]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[50]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[51]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Luonan Chen,et al.  Quantitative function for community detection. , 2008 .

[53]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[54]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[55]  Tinghuai Ma,et al.  A novel subgraph K+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K^{+}$$\end{document}-isomorphism method in social , 2017, Soft Computing.

[56]  Jie Liu,et al.  Multi-level learning based memetic algorithm for community detection , 2014, Appl. Soft Comput..

[57]  Yu Xue,et al.  A community integration strategy based on an improved modularity density increment for large-scale networks , 2017 .

[58]  Ling Shao,et al.  A rapid learning algorithm for vehicle classification , 2015, Inf. Sci..

[59]  Jian Zhang,et al.  Energy-efficient data-gathering rendezvous algorithms with mobile sinks for wireless sensor networks , 2017, Int. J. Sens. Networks.

[60]  Chung-Kuan Cheng,et al.  Ratio cut partitioning for hierarchical designs , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[61]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[62]  Ronghua Shang,et al.  Community detection based on modularity and an improved genetic algorithm , 2013 .

[63]  Chengsheng Yuan,et al.  Fingerprint liveness detection based on multi-scale LPQ and PCA , 2016, China Communications.

[64]  Leif E. Peterson K-nearest neighbor , 2009, Scholarpedia.

[65]  Jian Shen,et al.  A lightweight multi-layer authentication protocol for wireless body area networks , 2018, Future Gener. Comput. Syst..

[66]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.