Tests for Gaussian graphical models

Gaussian graphical models are promising tools for analysing genetic networks. In many applications, biologists have some knowledge of the genetic network and may want to assess the quality of their model using gene expression data. This is why one introduces a novel procedure for testing the neighborhoods of a Gaussian graphical model. It is based on the connection between the local Markov property and conditional regression of a Gaussian random variable. Adapting recent results on tests for high-dimensional Gaussian linear models, one proves that the testing procedure inherits appealing theoretical properties. Besides, it applies and is computationally feasible in a high-dimensional setting: the number of nodes may be much larger than the number of observations. A large part of the study is devoted to illustrating and discussing applications to simulated data and to biological data.

[1]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[2]  H Kishino,et al.  Correspondence analysis of genes and tissue types and finding genetic links from microarray data. , 2000, Genome informatics. Workshop on Genome Informatics.

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[5]  Jianhua Z. Huang,et al.  Covariance matrix selection and estimation via penalised normal likelihood , 2006 .

[6]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[7]  P. Bühlmann,et al.  Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana , 2004, Genome Biology.

[8]  M. Drton,et al.  Multiple Testing and Error Control in Gaussian Graphical Model Selection , 2005, math/0508267.

[9]  Franck Picard,et al.  A mixture model for random graphs , 2008, Stat. Comput..

[10]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[11]  D. Edwards Introduction to graphical modelling , 1995 .

[12]  P. Bühlmann,et al.  Statistical Applications in Genetics and Molecular Biology Low-Order Conditional Independence Graphs for Inferring Genetic Networks , 2011 .

[13]  Hiroyuki Toh,et al.  Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling , 2002, Bioinform..

[14]  Mathias Drton,et al.  A SINful approach to Gaussian graphical model selection , 2005 .

[15]  Kalpathi R. Subramanian,et al.  Interactive Analysis of Gene Interactions Using Graphical gaussian model , 2003, BIOKDD.

[16]  Fanny Villers,et al.  Goodness-of-fit tests for high-dimensional Gaussian linear models , 2007, 0711.2119.

[17]  Korbinian Strimmer,et al.  An empirical Bayes approach to inferring large-scale gene association networks , 2005, Bioinform..