A model framework for the enhancement of community detection in complex networks

Community detection is an important data analysis problem in many different areas, and how to enhance the quality of community detection in complicated real applications is still a challenge. Current community detection enhancement methods often take the enhancement as a preprocess of community detection. They mainly focus on how to design the suitable topological similarity of nodes to adjust the original network, but did not consider how to make use of this topological similarity more effectively. In order to better utilize the similarity information, we propose a model framework which integrates the enhancement into the whole community detection procedure. First, we calculate the structural similarity of nodes based on network topology. Second, we present a stochastic model to describe the community memberships of nodes; we then model the strong constraint based on structural similarity, i.e., we make each node have the same community membership distribution with its most similar neighbors; and then we model the weak constraint, i.e., if two nodes have a high similarity we will make their community membership distributions close, otherwise we will make them not close. Finally, we present a nonnegative matrix factorization approach to learn the model parameters. We evaluate our method on both synthetic and real-world networks with ground-truths, and compare it with five comparable methods. The experimental results demonstrate the superior performance of our new method over the competing ones for community detection and enhancement.

[1]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[2]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[3]  Mark E. J. Newman,et al.  An efficient and principled method for detecting communities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Wei Ren,et al.  Simple probabilistic algorithm for detecting community structure. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Fei Wang,et al.  Community discovery using nonnegative matrix factorization , 2011, Data Mining and Knowledge Discovery.

[6]  Weixiong Zhang,et al.  Modeling with Node Degree Preservation Can Accurately Find Communities , 2015, AAAI.

[7]  Shuigeng Zhou,et al.  CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets , 2014, Genom. Proteom. Bioinform..

[8]  Jonathan W. Berry,et al.  Tolerating the community detection resolution limit with edge weighting. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Dit-Yan Yeung,et al.  Overlapping community detection via bounded nonnegative matrix tri-factorization , 2012, KDD.

[10]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[11]  M. Hasler,et al.  Network community-detection enhancement by proper weighting. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Hongtao Lu,et al.  Enhanced modularity-based community detection by random walk network preprocessing. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Pasquale De Meo,et al.  Enhancing community detection using a network weighting strategy , 2013, Inf. Sci..

[14]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[15]  Stephen Roberts,et al.  Overlapping community detection using Bayesian non-negative matrix factorization. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[17]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockstructures , 2001 .

[18]  Mason A. Porter,et al.  Social Structure of Facebook Networks , 2011, ArXiv.

[19]  Xueqi Cheng,et al.  Exploring the structural regularities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Peng Gang Sun,et al.  Weighting links based on edge centrality for community detection , 2014 .

[21]  Chao Liu,et al.  Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce , 2010, WWW '10.

[22]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[23]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Norikazu Takahashi,et al.  Global convergence of modified multiplicative updates for nonnegative matrix factorization , 2013, Computational Optimization and Applications.

[25]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Michal Laclavik,et al.  On community detection in real-world networks and the importance of degree assortativity , 2013, KDD.

[28]  Jure Leskovec,et al.  Overlapping community detection at scale: a nonnegative matrix factorization approach , 2013, WSDM.

[29]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.