Bayesian Graphical Regression

ABSTRACT We consider the problem of modeling conditional independence structures in heterogenous data in the presence of additional subject-level covariates—termed graphical regression. We propose a novel specification of a conditional (in)dependence function of covariates—which allows the structure of a directed graph to vary flexibly with the covariates; imposes sparsity in both edge and covariate selection; produces both subject-specific and predictive graphs; and is computationally tractable. We provide theoretical justifications of our modeling endeavor, in terms of graphical model selection consistency. We demonstrate the performance of our method through rigorous simulation studies. We illustrate our approach in a cancer genomics-based precision medicine paradigm, where-in we explore gene regulatory networks in multiple myeloma taking prognostic clinical factors into account to obtain both population-level and subject-level gene regulatory networks. Supplementary materials for this article are available online.

[1]  David Heckerman,et al.  Knowledge Representation and Inference in Similarity Networks and Bayesian Multinets , 1996, Artif. Intell..

[2]  Sourav Bandyopadhyay,et al.  Rewiring of Genetic Networks in Response to DNA Damage , 2010, Science.

[3]  G. Morgan,et al.  Novel drugs in myeloma: harnessing tumour biology to treat myeloma. , 2011, Recent results in cancer research. Fortschritte der Krebsforschung. Progres dans les recherches sur le cancer.

[4]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[5]  D. Chauhan,et al.  Biologic sequelae of interleukin-6 induced PI3-K/Akt signaling in multiple myeloma , 2001, Oncogene.

[6]  Le Song,et al.  Estimating time-varying networks , 2008, ISMB 2008.

[7]  A. Gelman,et al.  Using Redundant Parameterizations to Fit Hierarchical Models , 2008 .

[8]  Xi Chen,et al.  Graph-Valued Regression , 2010, NIPS.

[9]  S. Lang,et al.  Bayesian P-Splines , 2004 .

[10]  James G. Scott,et al.  Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[11]  J. Crowley,et al.  International staging system for multiple myeloma. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[12]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[13]  J. Gera,et al.  Downstream effectors of oncogenic ras in multiple myeloma cells. , 2003, Blood.

[14]  Donatello Telesca,et al.  Nonlocal Priors for High-Dimensional Estimation , 2014, Journal of the American Statistical Association.

[15]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[16]  Jie Cheng,et al.  A sparse Ising model with covariates. , 2014, Biometrics.

[17]  Giovanni M. Marchetti,et al.  Efficient local updates for undirected graphical models , 2015, Stat. Comput..

[18]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[19]  Alexander J. Hartemink,et al.  Learning Non-Stationary Dynamic Bayesian Networks , 2010, J. Mach. Learn. Res..

[20]  Le Song,et al.  Time-Varying Dynamic Bayesian Networks , 2009, NIPS.

[21]  Jianqing Fan,et al.  Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models , 2014, Journal of the American Statistical Association.

[22]  Dirk Husmeier,et al.  Non-homogeneous dynamic Bayesian networks with Bayesian regularization for inferring gene regulatory networks with gradually time-varying structure , 2012, Machine Learning.

[23]  Guido Consonni,et al.  Objective Bayesian Search of Gaussian Directed Acyclic Graphical Models for Ordered Variables with Non‐Local Priors , 2013, Biometrics.

[24]  P. Loehrer,et al.  International Staging System for Multiple Myeloma , 2006 .

[25]  Giusi Moffa,et al.  Partition MCMC for Inference on Acyclic Digraphs , 2015, 1504.05006.

[26]  C. Der,et al.  Targeting the Raf-MEK-ERK mitogen-activated protein kinase cascade for the treatment of cancer , 2007, Oncogene.

[27]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[28]  Jukka Corander,et al.  Stratified Graphical Models - Context-Specific Independence in Graphical Models , 2013, 1309.6415.

[29]  Larry A. Wasserman,et al.  Time varying undirected graphs , 2008, Machine Learning.

[30]  A. McKenna,et al.  Widespread genetic heterogeneity in multiple myeloma: implications for targeted therapy. , 2014, Cancer cell.

[31]  Marco Grzegorczyk,et al.  Non-homogeneous dynamic Bayesian networks for continuous data , 2011, Machine Learning.

[32]  V. Johnson,et al.  On the use of non‐local prior densities in Bayesian hypothesis tests , 2010 .

[33]  Ali Shojaie,et al.  Penalized likelihood methods for estimation of sparse high-dimensional directed acyclic graphs. , 2009, Biometrika.

[34]  Minsuk Shin,et al.  Scalable Bayesian Variable Selection Using Nonlocal Prior Densities in Ultrahigh-dimensional Settings. , 2015, Statistica Sinica.

[35]  J. Silke,et al.  Regulation of TNFRSF and innate immune signalling complexes by TRAFs and cIAPs , 2010, Cell Death and Differentiation.

[36]  R. Fonseca,et al.  Expression of VEGF and its receptors by myeloma cells , 2003, Leukemia.

[37]  Hong-Ye Gao,et al.  Wavelet Shrinkage Denoising Using the Non-Negative Garrote , 1998 .

[38]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[39]  Peter Müller,et al.  Detecting differential patterns of interaction in molecular pathways. , 2015, Biostatistics.

[40]  L. Fahrmeir,et al.  Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models , 2011, 1105.5250.

[41]  Alan Ashworth,et al.  Translating cancer research into targeted therapeutics , 2010, Nature.

[42]  Francesco C Stingo,et al.  Bayesian nonlinear model selection for gene regulatory networks , 2015, Biometrics.

[43]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[44]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[45]  Joe W. Gray,et al.  Joint estimation of multiple networks from time course data , 2013 .

[46]  V. Johnson,et al.  Bayesian Model Selection in High-Dimensional Settings , 2012, Journal of the American Statistical Association.

[47]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[48]  G. Heppner,et al.  Tumor heterogeneity: biological implications and therapeutic consequences , 2004, Cancer and Metastasis Reviews.

[49]  Veronika Rockova,et al.  EMVS: The EM Approach to Bayesian Variable Selection , 2014 .

[50]  Michael C. Horsch,et al.  Dynamic Bayesian networks , 1990 .

[51]  J. S. Rao,et al.  Spike and slab variable selection: Frequentist and Bayesian strategies , 2005, math/0505633.

[52]  S. Gerondakis,et al.  MEK inhibitors as a chemotherapeutic intervention in multiple myeloma , 2013, Blood Cancer Journal.

[53]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[54]  David W. Scott,et al.  The Curse of Dimensionality and Dimension Reduction , 2008 .

[55]  Eric P. Xing,et al.  On Sparse Nonparametric Conditional Covariance Selection , 2010, ICML.

[56]  Christine B Peterson,et al.  Bayesian Inference of Multiple Gaussian Graphical Models , 2015, Journal of the American Statistical Association.

[57]  M. Gerstein,et al.  Genomic analysis of regulatory network dynamics reveals large topological changes , 2004, Nature.