A Fast and Scalable Joint Estimator for Integrating Additional Knowledge in Learning Multiple Related Sparse Gaussian Graphical Models

We consider the problem of including additional knowledge in estimating sparse Gaussian graphical models (sGGMs) from aggregated samples, arising often in bioinformatics and neuroimaging applications. Previous joint sGGM estimators either fail to use existing knowledge or cannot scale-up to many tasks (large $K$) under a high-dimensional (large $p$) situation. In this paper, we propose a novel \underline{J}oint \underline{E}lementary \underline{E}stimator incorporating additional \underline{K}nowledge (JEEK) to infer multiple related sparse Gaussian Graphical models from large-scale heterogeneous data. Using domain knowledge as weights, we design a novel hybrid norm as the minimization objective to enforce the superposition of two weighted sparsity constraints, one on the shared interactions and the other on the task-specific structural patterns. This enables JEEK to elegantly consider various forms of existing knowledge based on the domain at hand and avoid the need to design knowledge-specific optimization. JEEK is solved through a fast and entry-wise parallelizable solution that largely improves the computational efficiency of the state-of-the-art $O(p^5K^4)$ to $O(p^2K^4)$. We conduct a rigorous statistical analysis showing that JEEK achieves the same convergence rate $O(\log(Kp)/n_{tot})$ as the state-of-the-art estimators that are much harder to compute. Empirically, on multiple synthetic datasets and two real-world data, JEEK outperforms the speed of the state-of-arts significantly while achieving the same level of prediction accuracy. Available as R tool @ this http URL

[1]  Christophe Ambroise,et al.  Inferring multiple graphical structures , 2009, Stat. Comput..

[2]  Robert J. Vanderbei,et al.  The fastclime package for linear programming and large-scale precision matrix estimation in R , 2014, J. Mach. Learn. Res..

[3]  Christoforos Anagnostopoulos,et al.  Learning population and subject-specific brain connectivity networks via Mixed Neighborhood Selection , 2015, 1512.01947.

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Daniel P. Kennedy,et al.  The Autism Brain Imaging Data Exchange: Towards Large-Scale Evaluation of the Intrinsic Brain Architecture in Autism , 2013, Molecular Psychiatry.

[6]  B. Biswal,et al.  Characterizing variation in the functional connectome: promise and pitfalls , 2012, Trends in Cognitive Sciences.

[7]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[8]  Pradeep Ravikumar,et al.  Elementary Estimators for Graphical Models , 2014, NIPS.

[9]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[10]  Su-In Lee,et al.  Node-based learning of multiple Gaussian graphical models , 2013, J. Mach. Learn. Res..

[11]  Jonathan D. Power,et al.  Prediction of Individual Brain Maturity Using fMRI , 2010, Science.

[12]  Bin Yu,et al.  High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence , 2008, 0811.3628.

[13]  Satoru Miyano,et al.  Weighted lasso in graphical Gaussian modeling for large gene network estimation based on microarray data. , 2007, Genome informatics. International Conference on Genome Informatics.

[14]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[15]  Yunqi Bu,et al.  Integrating additional knowledge into the estimation of graphical models , 2017, The international journal of biostatistics.

[16]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[17]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[18]  Martin J. Wainwright,et al.  A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers , 2009, NIPS.

[19]  Beilun Wang,et al.  A constrained $$\ell $$ℓ1 minimization approach for estimating multiple sparse Gaussian or nonparanormal graphical models , 2016, Machine Learning.

[20]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[21]  Li Qingyang,et al.  Towards Automated Analysis of Connectomes: The Configurable Pipeline for the Analysis of Connectomes (C-PAC) , 2013 .

[22]  Alexandre d'Aspremont,et al.  Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[23]  Michael I. Jordan Graphical Models , 2003 .

[24]  Beilun Wang,et al.  A Constrained, Weighted-L1 Minimization Approach for Joint Discovery of Heterogeneous Neural Connectivity Graphs , 2017, ArXiv.

[25]  Adam J. Rothman,et al.  Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[26]  Matthew N. McCall,et al.  The Gene Expression Barcode: leveraging public data repositories to begin cataloging the human and murine transcriptomes , 2010, Nucleic Acids Res..

[27]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[28]  Charles J. Lynch,et al.  Salience network-based classification and prediction of symptom severity in children with autism. , 2013, JAMA psychiatry.

[29]  Russell A. Poldrack,et al.  Guidelines for reporting an fMRI study , 2008, NeuroImage.

[30]  NeuroData,et al.  Towards Automated Analysis of Connectomes: The Configurable Pipeline for the Analysis of Connectomes , 2015 .

[31]  Yue Joseph Wang,et al.  Learning Structural Changes of Gaussian Graphical Models in Controlled Experiments , 2010, UAI.

[32]  Pradeep Ravikumar,et al.  Elementary Estimators for Sparse Covariance Matrices and other Structured Moments , 2014, ICML.

[33]  Adam J. Rothman,et al.  Generalized Thresholding of Large Covariance Matrices , 2009 .

[34]  Jean-Baptiste Poline,et al.  Brain covariance selection: better individual functional connectivity models using population prior , 2010, NIPS.

[35]  Oluwasanmi Koyejo,et al.  Toward open sharing of task-based fMRI data: the OpenfMRI project , 2013, Front. Neuroinform..

[36]  Jeff G. Schneider,et al.  Learning Multiple Tasks with a Sparse Matrix-Normal Penalty , 2010, NIPS.

[37]  Pradeep Ravikumar,et al.  Elementary Estimators for High-Dimensional Linear Regression , 2014, ICML.

[38]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[39]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[40]  Dimitris Samaras,et al.  Multi-Task Learning of Gaussian Graphical Models , 2010, ICML.

[41]  T. Ideker,et al.  Differential network biology , 2012, Molecular systems biology.

[42]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[43]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[44]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[45]  Natalie Wilson Human Protein Reference Database , 2004, Nature Reviews Genetics.

[46]  Essa Yacoub,et al.  The WU-Minn Human Connectome Project: An overview , 2013, NeuroImage.

[47]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[48]  Beilun Wang,et al.  A Fast and Scalable Joint Estimator for Learning Multiple Related Sparse Gaussian Graphical Models , 2017, AISTATS.

[49]  P. Bickel,et al.  Covariance regularization by thresholding , 2009, 0901.3079.

[50]  Xiaotong Shen,et al.  Structural Pursuit Over Multiple Undirected Graphs , 2014, Journal of the American Statistical Association.