Estimating a network from multiple noisy realizations

Complex interactions between entities are often represented as edges in a network. In practice, the network is often constructed from noisy measurements and inevitably contains some errors. In this paper we consider the problem of estimating a network from multiple noisy observations where edges of the original network are recorded with both false positives and false negatives. This problem is motivated by neuroimaging applications where brain networks of a group of patients with a particular brain condition could be viewed as noisy versions of an unobserved true network corresponding to the disease. The key to optimally leveraging these multiple observations is to take advantage of network structure, and here we focus on the case where the true network contains communities. Communities are common in real networks in general and in particular are believed to be presented in brain networks. Under a community structure assumption on the truth, we derive an efficient method to estimate the noise levels and the original network, with theoretical guarantees on the convergence of our estimates. We show on synthetic networks that the performance of our method is close to an oracle method using the true parameter values, and apply our method to fMRI brain data, demonstrating that it constructs stable and plausible estimates of the population network.

[1]  Chao Gao,et al.  Achieving Optimal Misclassification Proportion in Stochastic Block Models , 2015, J. Mach. Learn. Res..

[2]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Can M. Le,et al.  Sparse random graphs: regularization and concentration of the Laplacian , 2015, ArXiv.

[4]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Olaf Sporns,et al.  Complex network measures of brain connectivity: Uses and interpretations , 2010, NeuroImage.

[6]  Patrick J. Wolfe,et al.  Network histograms and universality of blockmodel approximation , 2013, Proceedings of the National Academy of Sciences.

[7]  Timothy O. Laumann,et al.  Functional Network Organization of the Human Brain , 2011, Neuron.

[8]  Genevera I. Allen,et al.  Two Sample Inference for Populations of Graphical Models with Applications to Functional Connectivity , 2015, 1502.03853.

[9]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[10]  Jing Lei,et al.  Network Cross-Validation for Determining the Number of Communities in Network Data , 2014, 1411.1715.

[11]  Can M. Le,et al.  Estimating the number of communities in networks by spectral methods , 2015, ArXiv.

[12]  Zhengwu Zhang,et al.  Common and individual structure of brain networks , 2017, The Annals of Applied Statistics.

[13]  YU BIN,et al.  IMPACT OF REGULARIZATION ON SPECTRAL CLUSTERING , 2016 .

[14]  P. Bickel,et al.  Likelihood-based model selection for stochastic block models , 2015, 1502.02069.

[15]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Martin J. Wainwright,et al.  Statistical guarantees for the EM algorithm: From population to sample-based analysis , 2014, ArXiv.

[17]  Patrick J. Wolfe,et al.  Co-clustering separately exchangeable network data , 2012, ArXiv.

[18]  Daniel S. Margulies,et al.  Connectome Smoothing via Low-Rank Approximations , 2016, IEEE Transactions on Medical Imaging.

[19]  C. Priebe,et al.  Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding , 2013, 1310.0532.

[20]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[21]  S. Taylor,et al.  Social appraisal in chronic psychosis: role of medial frontal and occipital networks. , 2011, Journal of psychiatric research.

[22]  Danielle S. Bassett,et al.  Resolving Structural Variability in Network Models and the Brain , 2013, PLoS Comput. Biol..

[23]  R. Preston McAfee,et al.  Who moderates the moderators?: crowdsourcing abuse detection in user-generated content , 2011, EC '11.

[24]  Peter J. Bickel,et al.  Fitting community models to large sparse networks , 2012, ArXiv.

[25]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[26]  Yong He,et al.  BrainNet Viewer: A Network Visualization Tool for Human Brain Connectomics , 2013, PloS one.

[27]  Bryon A. Mueller,et al.  Altered resting state complexity in schizophrenia , 2012, NeuroImage.

[28]  P. Bickel,et al.  A nonparametric view of network models and Newman–Girvan and other modularities , 2009, Proceedings of the National Academy of Sciences.

[29]  K. Phan,et al.  Abnormal GABAergic Function and Negative Affect in Schizophrenia , 2014, Neuropsychopharmacology.

[30]  Elchanan Mossel,et al.  Spectral redemption in clustering sparse networks , 2013, Proceedings of the National Academy of Sciences.

[31]  Can M. Le,et al.  Concentration and regularization of random graphs , 2015, Random Struct. Algorithms.

[32]  Joel A. Tropp,et al.  User-Friendly Tail Bounds for Sums of Random Matrices , 2010, Found. Comput. Math..

[33]  A. P. Dawid,et al.  Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[34]  Keith A. Johnson,et al.  Cortical Hubs Revealed by Intrinsic Functional Connectivity: Mapping, Assessment of Stability, and Relation to Alzheimer's Disease , 2009, The Journal of Neuroscience.

[35]  Law of Large Graphs , 2015 .

[36]  V. Koltchinskii,et al.  Oracle inequalities in empirical risk minimization and sparse recovery problems , 2011 .

[37]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[38]  Genevera I. Allen,et al.  Mixed Effects Models for Resampled Network Statistics Improves Statistical Power to Find Differences in Multi-Subject Functional Connectivity , 2016, bioRxiv.

[39]  Anup Rao,et al.  Stochastic Block Model and Community Detection in Sparse Graphs: A spectral algorithm with optimal rate of recovery , 2015, COLT.

[40]  C. Priebe,et al.  A central limit theorem for an omnibus embedding of random dot product graphs , 2017, 1705.09355.

[41]  David B. Dunson,et al.  Common and Individual Structure of Multiple Networks , 2017 .

[42]  S. M. Hadi Hosseini,et al.  Comparing connectivity pattern and small-world organization between structural correlation and resting-state networks in healthy adults , 2013, NeuroImage.

[43]  Gábor Lugosi,et al.  Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[44]  E. Abbe,et al.  Community detection and the stochastic block model , 2016 .

[45]  M. Talagrand,et al.  Probability in Banach Spaces: Isoperimetry and Processes , 1991 .

[46]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..