Stacked autoencoder-based community detection method via an ensemble clustering framework

Abstract Community detection is a challenging issue because most existing methods are not well suited for complex social networks with ambiguous structures. In this paper, we propose a novel community detection method named Stacked Autoencoder-Based Community Detection Method via Ensemble Clustering (CDMEC). This is the first time that we have attempted to apply four different complex network similarity representations to the community detection problem. This work makes up for the insufficiency of the single similarity matrix to describe the similarity relationship between nodes. These similarity representations can fully describe and consider the sufficient local information between nodes in a network topology. Our CDMEC framework combines transfer learning and a stacked autoencoder to obtain an efficient low-dimensional feature representation of complex networks and aggregates multiple inputs through a novel ensemble clustering framework. This novel framework first uses the basic clustering results to construct a consistent matrix, and then it employs the nonnegative matrix factorization (NMF)-based clustering method to detect reliable clustering results from the consistent matrix. The results of various extensive experiments on artificial benchmark networks and real-world networks showed that the proposed CDMEC framework is superior to the existing state-of-the-art community detection methods and has great potential in solving the community detection problems.

[1]  Weixiong Zhang,et al.  Joint Identification of Network Communities and Semantics via Integrative Modeling of Network Topologies and Node Contents , 2017, AAAI.

[2]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[3]  P. Choler,et al.  Assessment of Microbial Communities by Graph Partitioning in a Study of Soil Fungi in Two Alpine Meadows , 2009, Applied and Environmental Microbiology.

[4]  Gábor J. Székely,et al.  Hierarchical Clustering via Joint Between-Within Distances: Extending Ward's Minimum Variance Method , 2005, J. Classif..

[5]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Yousef Saad,et al.  Dense Subgraph Extraction with Application to Community Detection , 2012, IEEE Transactions on Knowledge and Data Engineering.

[7]  Andrea Tagarelli,et al.  Ensemble-based community detection in multilayer networks , 2017, Data Mining and Knowledge Discovery.

[8]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[9]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Qinbao Song,et al.  Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network , 2013, IEEE Transactions on Knowledge and Data Engineering.

[11]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[13]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Francisco Herrera,et al.  Consensus reaching in social network DeGroot Model: The roles of the Self-confidence and node degree , 2019, Inf. Sci..

[15]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[16]  Michael K. Ng,et al.  Robust and Non-Negative Collective Matrix Factorization for Text-to-Image Transfer Learning , 2015, IEEE Transactions on Image Processing.

[17]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[18]  Jian Liu,et al.  Comparative Analysis for k-Means Algorithms in Network Community Detection , 2010, ISICA.

[19]  Jing Hua,et al.  Pedestrian recognition in multi-camera networks based on deep transfer learning and feature visualization , 2018, Neurocomputing.

[20]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[22]  Mahdi Vasighi,et al.  Community Detection in Complex Networks by Detecting and Expanding Core Nodes Through Extended Local Similarity of Nodes , 2018, IEEE Transactions on Computational Social Systems.

[23]  Carsten Hahn,et al.  DeepMoVIPS: Visual indoor positioning using transfer learning , 2016, 2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN).

[24]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[25]  Nguyen Xuan Hoai,et al.  An efficient genetic algorithm for maximizing area coverage in wireless sensor networks , 2019, Inf. Sci..

[26]  Tanmoy Chakraborty,et al.  Ensemble-based Overlapping Community Detection using Disjoint Community Structures , 2018, Knowl. Based Syst..

[27]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[29]  Bhaskar DasGupta,et al.  On analyzing and evaluating privacy measures for social networks under active attack , 2018, Inf. Sci..

[30]  Jianwu Dang,et al.  Combined node and link partitions method for finding overlapping communities in complex networks , 2015, Scientific Reports.

[31]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[32]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[33]  Timo Hämäläinen,et al.  Revealing community structures by ensemble clustering using group diffusion , 2018, Inf. Fusion.

[34]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[35]  Arif Mahmood,et al.  Subspace Based Network Community Detection Using Sparse Linear Coding , 2016, IEEE Trans. Knowl. Data Eng..

[36]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[37]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[38]  Rushed Kanawati,et al.  YASCA: An Ensemble-Based Approach for Community Detection in Complex Networks , 2014, COCOON.

[39]  Jianwu Dang,et al.  Incorporating network structure with node contents for community detection on large networks using deep learning , 2018, Neurocomputing.

[40]  Xiaochun Cao,et al.  Modularity Based Community Detection with Deep Learning , 2016, IJCAI.