An Effective Algorithm for Optimizing Surprise in Network Community Detection

Many methods have been proposed to detect communities/modules in various networks such as biological molecular networks and disease networks, while optimizing statistical measures for community structures is one of the most popular ways for community detection. Surprise, which is a statistical measure of interest for community detection, has good performance in many networks, but it still encounters the resolution limit in some cases and it is hard to be optimized due to its strong nonlinearity. Here, we discussed the resolution limit of Surprise by a phase diagram in community-partition transition, and then proposed an improved algorithm for Surprise optimization by introducing three effective strategies: a pre-processing of topological structure based on local random walks (Pre_TS), a pre-processing of community partition (Pre_CS), and a post-processing of community partition (Post_CS). By a series of experimental tests in various networks, we show that Pre_TS can effectively enhance the resolution of Surprise, Pre_CS and Post_CS can improve the optimization performance in different aspects, and as expected, the combination of these strategies can more effectively enhance the ability of Surprise to detect communities in complex networks. Finally, we displayed the effectiveness of the improved algorithm for Surprise optimization in several real-world networks, and applied the algorithm to the analysis of disease-related networks in computational biology.

[1]  Angelo Bifone,et al.  Community detection in weighted brain connectivity networks beyond the resolution limit , 2016, NeuroImage.

[2]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[3]  Chris H. Q. Ding,et al.  Determining modular organization of protein interaction networks by maximizing modularity density , 2010, BMC Systems Biology.

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Bin Wang,et al.  Overlapping Community Detection Based on Information Dynamics , 2018, IEEE Access.

[6]  Angelo Bifone,et al.  Modular structure of brain functional networks: breaking the resolution limit by Surprise , 2016, Scientific Reports.

[7]  Jian Yu,et al.  An efficient community detection algorithm using greedy surprise maximization , 2014 .

[8]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[9]  Jianrui Chen,et al.  A dynamic evolutionary clustering perspective: Community detection in signed networks by reconstructing neighbor sets , 2016 .

[10]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Li Wang,et al.  Community Detection Based on Local Information and Dynamic Expansion , 2019, IEEE Access.

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Yong Wang,et al.  Overlapping Community Detection in Complex Networks using Symmetric Binary Matrix Factorization , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Charo I. Del Genio,et al.  Finding network communities using modularity density , 2016, ArXiv.

[15]  Shu Yan,et al.  Global and local targeted immunization in networks with community structure , 2015 .

[16]  Yan Zhang,et al.  Multi-resolution community detection based on generalized self-loop rescaling strategy , 2014, Physica A: Statistical Mechanics and its Applications.

[17]  Zheng Wang,et al.  Community Detection in Signed Networks Based on the Signed Stochastic Block Model and Exact ICL , 2019, IEEE Access.

[18]  胡涛 Analysis and perturbation of degree correlation in complex networks , 2015 .

[19]  Michele Leone,et al.  (Un)detectable cluster structure in sparse networks. , 2007, Physical review letters.

[20]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  C. Nardini,et al.  Correlation enhanced modularity-based belief propagation method for community detection in networks , 2016 .

[22]  Ju Xiang,et al.  Phase transition of Surprise optimization in community detection , 2018 .

[23]  Vincent A. Traag,et al.  Detecting communities using asymptotical Surprise , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Sidney Redner,et al.  Community structure of the physical review citation network , 2009, J. Informetrics.

[25]  Albert-László Barabási,et al.  A DIseAse MOdule Detection (DIAMOnD) Algorithm Derived from a Systematic Analysis of Connectivity Patterns of Disease Proteins in the Human Interactome , 2015, PLoS Comput. Biol..

[26]  A. Barabasi,et al.  Human symptoms–disease network , 2014, Nature Communications.

[27]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[28]  Ke Hu,et al.  Predicting disease-related genes by path-based similarity and community structure in protein-protein interaction network , 2017 .

[29]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[30]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[31]  Cedric E. Ginestet,et al.  Cognitive relevance of the community structure of the human brain functional coactivation network , 2013, Proceedings of the National Academy of Sciences.

[32]  Tao Hu,et al.  Local modularity for community detection in complex networks , 2016 .

[33]  Athina P. Petropulu,et al.  Detecting community structure using label propagation with consensus weight in complex network , 2014 .

[34]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Dongxiao He,et al.  A model framework for the enhancement of community detection in complex networks , 2016 .

[36]  Jie Cao,et al.  Weighted modularity optimization for crisp and fuzzy community detection in large-scale networks , 2016 .

[37]  Zhihai Rong,et al.  Impact of heterogeneous activity and community structure on the evolutionary success of cooperators in social networks. , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Hongtao Lu,et al.  Enhanced modularity-based community detection by random walk network preprocessing. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Fang-Xiang Wu,et al.  Identification of Prognostic and Heterogeneous Breast Cancer Biomarkers Based on Fusion Network and Multiple Scoring Strategies , 2019, ICIC.

[41]  Liang Tang,et al.  Enhancing community detection by using local structural information , 2016, ArXiv.

[42]  Lei Chen,et al.  Parallel Community Detection Based on Distance Dynamics for Large-Scale Network , 2018, IEEE Access.

[43]  Haixia Wang,et al.  Parallelizing and optimizing overlapping community detection with speaker-listener Label Propagation Algorithm on multi-core architecture , 2017, 2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).

[44]  Vincent A. Traag,et al.  Significant Scales in Community Structure , 2013, Scientific Reports.

[45]  Mingwei Leng,et al.  A divisive spectral method for network community detection , 2015, ArXiv.

[46]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Licheng Jiao,et al.  Density shrinking algorithm for community detection with path based similarity , 2015 .

[48]  Daniel Marbach,et al.  Assessment of network module identification across complex diseases , 2019, Nature Methods.

[49]  Raj Rao Nadakuditi,et al.  Graph spectra and the detectability of community structure in networks , 2012, Physical review letters.

[50]  Jean-Charles Delvenne,et al.  Stability of graph communities across time scales , 2008, Proceedings of the National Academy of Sciences.

[51]  Jinlong Ma,et al.  Traffic dynamics on two-layer complex networks with limited delivering capacity , 2016 .

[52]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[53]  Yu Xin,et al.  The adaptive dynamic community detection algorithm based on the non-homogeneous random walking , 2016 .

[54]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  Ronghua Shang,et al.  A multiobjective evolutionary algorithm to find community structures based on affinity propagation , 2016 .

[56]  Tao You,et al.  Community Detection in Complex Networks Using Density-based Clustering Algorithm , 2015, ArXiv.

[57]  Ignacio Marín,et al.  Surprise maximization reveals the community structure of complex networks , 2013, Scientific Reports.

[58]  A. Barabasi,et al.  Uncovering disease-disease relationships through the incomplete interactome , 2015, Science.

[59]  Dayou Liu,et al.  A Markov random walk under constraint for discovering overlapping communities in complex networks , 2011, ArXiv.

[60]  Yvan Saeys,et al.  A comprehensive evaluation of module detection methods for gene expression data , 2018, Nature Communications.

[61]  Yan Zhang,et al.  Community detection based on significance optimization in complex networks , 2017, ArXiv.

[62]  Lei Chen,et al.  A Modified Distance Dynamics Model for Improvement of Community Detection , 2018, IEEE Access.