Testing for Differences in Gaussian Graphical Models: Applications to Brain Connectivity

Functional brain networks are well described and estimated from data with Gaussian Graphical Models (GGMs), e.g. using sparse inverse covariance estimators. Comparing functional connectivity of subjects in two population calls for comparing these estimated GGMs. We study the problem of identifying differences in Gaussian Graphical Models (GGMs) known to have similar structure. We aim to characterize the uncertainty of differences with confidence intervals obtained using a para-metric distribution on parameters of a sparse estimator. Sparse penalties enable statistical guarantees and interpretable models even in high-dimensional and low-number-of-samples settings. Quantifying the uncertainty of the parameters selected by the sparse penalty is an important question in applications such as neuroimaging or bioinformatics. Indeed, selected variables can be interpreted to build theoretical understanding or to make therapeutic decisions. Characterizing the distributions of sparse regression models is inherently challenging since the penalties produce a biased estimator. Recent work has shown how one can invoke the sparsity assumptions to effectively remove the bias from a sparse estimator such as the lasso. These distributions can be used to give us confidence intervals on edges in GGMs, and by extension their differences. However, in the case of comparing GGMs, these estimators do not make use of any assumed joint structure among the GGMs. Inspired by priors from brain functional connectivity we focus on deriving the distribution of parameter differences under a joint penalty when parameters are known to be sparse in the difference. This leads us to introduce the debiased multi-task fused lasso. We show that we can debias and characterize the distribution in an efficient manner. We then go on to show how the debiased lasso and multi-task fused lasso can be used to obtain confidence intervals on edge differences in Gaussian graphical models. We validate the techniques proposed on a set of synthetic examples as well as neuro-imaging dataset created for the study of autism.

[1]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[2]  Daniel P. Kennedy,et al.  The Autism Brain Imaging Data Exchange: Towards Large-Scale Evaluation of the Intrinsic Brain Architecture in Autism , 2013, Molecular Psychiatry.

[3]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[4]  Arindam Banerjee,et al.  Generalized Direct Change Estimation in Ising Model Structure , 2016, ICML.

[5]  Gaël Varoquaux,et al.  Multi-subject Dictionary Learning to Segment an Atlas of Brain Spontaneous Activity , 2011, IPMI.

[6]  N. Meinshausen,et al.  High-Dimensional Inference: Confidence Intervals, $p$-Values and R-Software hdi , 2014, 1408.4026.

[7]  R. Cameron Craddock,et al.  Clinical applications of the functional connectome , 2013, NeuroImage.

[8]  T. Cai,et al.  Direct estimation of differential networks. , 2014, Biometrika.

[9]  Wolfgang Polonik,et al.  Local Neighborhood Fusion in Locally Constant Gaussian Graphical Models , 2014, 1410.8766.

[10]  O Sporns,et al.  Predicting human resting-state functional connectivity from structural connectivity , 2009, Proceedings of the National Academy of Sciences.

[11]  Dimitri Van De Ville,et al.  Decoding brain states from fMRI connectivity graphs , 2011, NeuroImage.

[12]  Genevera I. Allen,et al.  Mixed Effects Models to Find Differences in Multi-Subject Functional Connectivity , 2015 .

[13]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[14]  B. Biswal,et al.  Characterizing variation in the functional connectome: promise and pitfalls , 2012, Trends in Cognitive Sciences.

[15]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[16]  Mark W. Woolrich,et al.  Network modelling methods for FMRI , 2011, NeuroImage.

[17]  Xi Chen,et al.  Smoothing Proximal Gradient Method for General Structured Sparse Learning , 2011, UAI.

[18]  Gaël Varoquaux,et al.  Learning and comparing functional connectomes across subjects , 2013, NeuroImage.

[19]  Seungyeop Han,et al.  Structured Learning of Gaussian Graphical Models , 2012, NIPS.

[20]  Jean-Baptiste Poline,et al.  Brain covariance selection: better individual functional connectivity models using population prior , 2010, NIPS.

[21]  Sara van de Geer,et al.  Statistics for High-Dimensional Data , 2011 .

[22]  G. Marsaglia CONDITIONAL MEANS AND COVARIANCES OF NORMAL VARIABLES WITH SINGULAR COVARIANCE MATRIX , 1964 .

[23]  R. Tibshirani,et al.  A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[24]  Henry Kennedy,et al.  Cortical High-Density Counterstream Architectures , 2013, Science.

[25]  M. Lindquist The Statistical Analysis of fMRI Data. , 2008, 0906.3662.

[26]  R. Tibshirani,et al.  Adaptive testing for the graphical lasso , 2013, 1307.4765.

[27]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[28]  Adel Javanmard,et al.  Confidence intervals and hypothesis testing for high-dimensional regression , 2013, J. Mach. Learn. Res..

[29]  S. Geer,et al.  Confidence intervals for high-dimensional inverse covariance estimation , 2014, 1403.6752.

[30]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[31]  Dimitris Samaras,et al.  Multi-Task Learning of Gaussian Graphical Models , 2010, ICML.

[32]  M. Greicius,et al.  Decoding subject-driven cognitive states with whole-brain connectivity patterns. , 2012, Cerebral cortex.

[33]  Vincent Frouin,et al.  Randomized parcellation based inference , 2014, NeuroImage.

[34]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.