Joint conditional Gaussian graphical models with multiple sources of genomic data

It is challenging to identify meaningful gene networks because biological interactions are often condition-specific and confounded with external factors. It is necessary to integrate multiple sources of genomic data to facilitate network inference. For example, one can jointly model expression datasets measured from multiple tissues with molecular marker data in so-called genetical genomic studies. In this paper, we propose a joint conditional Gaussian graphical model (JCGGM) that aims for modeling biological processes based on multiple sources of data. This approach is able to integrate multiple sources of information by adopting conditional models combined with joint sparsity regularization. We apply our approach to a real dataset measuring gene expression in four tissues (kidney, liver, heart, and fat) from recombinant inbred rats. Our approach reveals that the liver tissue has the highest level of tissue-specific gene regulations among genes involved in insulin responsive facilitative sugar transporter mediated glucose transport pathway, followed by heart and fat tissues, and this finding can only be attained from our JCGGM approach.

[1]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[2]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[3]  Hongyu Zhao,et al.  Sparse Estimation of Conditional Graphical Models With Application to Gene Networks , 2012, Journal of the American Statistical Association.

[4]  Michael I. Jordan Graphical Models , 2003 .

[5]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[6]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[7]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[8]  Enrico Petretto,et al.  Heritability and Tissue Specificity of Expression Quantitative Trait Loci , 2006, PLoS genetics.

[9]  Makoto Kanzaki,et al.  Regulated membrane trafficking of the insulin-responsive glucose transporter 4 in adipocytes. , 2004, Endocrine reviews.

[10]  Gilbert D. Brum,et al.  Biology: Exploring Life , 1989 .

[11]  E. Petretto,et al.  Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease , 2005, Nature Genetics.

[12]  Hongzhe Li,et al.  Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks. , 2006, Biostatistics.

[13]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[14]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[15]  J. Nap,et al.  Genetical genomics : the added value from segregation , 2001 .

[16]  Hongzhe Li,et al.  A SPARSE CONDITIONAL GAUSSIAN GRAPHICAL MODEL FOR ANALYSIS OF GENETICAL GENOMICS DATA. , 2011, The annals of applied statistics.

[17]  C. O. A. D. P. R. M. A. E. Stimation Covariate Adjusted Precision Matrix Estimation with an Application in Genetical Genomics , 2011 .

[18]  Rainer Breitling,et al.  Expression Quantitative Trait Loci Are Highly Sensitive to Cellular Differentiation State , 2009, PLoS genetics.