Gene Regulation Network Inference With Joint Sparse Gaussian Graphical Models

Revealing biological networks is one key objective in systems biology. With microarrays, researchers now routinely measure expression profiles at the genome level under various conditions, and such data may be used to statistically infer gene regulation networks. Gaussian graphical models (GGMs) have proven useful for this purpose by modeling the Markovian dependence among genes. However, a single GGM may not be adequate to describe the potentially differing networks across various conditions, and hence it is more natural to infer multiple GGMs from such data. In this article we propose a class of nonconvex penalty functions aiming at the estimation of multiple GGMs with a flexible joint sparsity constraint. We illustrate the property of our proposed nonconvex penalty functions by simulation study. We then apply the method to a gene expression dataset from the GenCord Project, and show that our method can identify prominent pathways across different conditions. Supplementary materials for this article are available online.

[1]  B. Chait,et al.  Retinol is essential for growth of activated human B cells , 1990, The Journal of experimental medicine.

[2]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[3]  Jianqing Fan,et al.  Sparsistency and Rates of Convergence in Large Covariance Matrix Estimation. , 2007, Annals of statistics.

[4]  Christophe Ambroise,et al.  Inferring Multiple Graphical Models , 2009 .

[5]  Adam J. Rothman,et al.  Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[6]  Christophe Ambroise,et al.  Inferring multiple graphical structures , 2009, Stat. Comput..

[7]  E. Barta,et al.  PPARgamma regulates the function of human dendritic cells primarily by altering lipid metabolism. , 2007, Blood.

[8]  Marco Grzegorczyk,et al.  Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks , 2006, Bioinform..

[9]  T. Hastie,et al.  SparseNet: Coordinate Descent With Nonconvex Penalties , 2011, Journal of the American Statistical Association.

[10]  H. Zou,et al.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models. , 2008, Annals of statistics.

[11]  D. Altieri,et al.  A Survivin-Ran Complex Regulates Spindle Formation in Tumor Cells , 2008, Molecular and Cellular Biology.

[12]  Cun-Hui Zhang,et al.  A group bridge approach for variable selection , 2009, Biometrika.

[13]  P. Deloukas,et al.  Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent Manner , 2009, Science.

[14]  H. Zou,et al.  Composite quantile regression and the oracle Model Selection Theory , 2008, 0806.2905.

[15]  Robert N. Taylor,et al.  PPAR Action in Human Placental Development and Pregnancy and Its Complications , 2007, PPAR research.

[16]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[17]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[18]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[19]  T. Cai,et al.  A Constrained ℓ1 Minimization Approach to Sparse Precision Matrix Estimation , 2011, 1102.2233.

[20]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[21]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[22]  J.A. Sweetkind-Singer,et al.  Log-penalized linear regression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[23]  Ming Yuan,et al.  High Dimensional Inverse Covariance Matrix Estimation via Linear Programming , 2010, J. Mach. Learn. Res..

[24]  Pei Wang,et al.  Partial Correlation Estimation by Joint Sparse Regression Models , 2008, Journal of the American Statistical Association.

[25]  N. Meinshausen,et al.  Consistent neighbourhood selection for sparse high-dimensional graphs with the Lasso , 2004 .

[26]  Claudio Altafini,et al.  Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data , 2007, Bioinform..

[27]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.