Bayesian Covariance Selection ∗

We present a novel structural learning method called HdBCS that performs covariance selection in a Bayesian framework for datasets with tens of thousands of variables. HdBCS is based on the intrinsic connection between graphical models on undirected graphs and graphical models on directed acyclic graphs (Bayesian networks). We show how to produce and explore the corresponding association networks by Bayesian model averaging across the models identified. We illustrate the use of HdBCS with an example from a large-scale gene expression study of breast cancer.

[1]  S. E. Hills,et al.  Illustration of Bayesian Inference in Normal Data Models Using Gibbs Sampling , 1990 .

[2]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[3]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[4]  David Maxwell Chickering,et al.  A Transformational Characterization of Equivalent Bayesian Network Structures , 1995, UAI.

[5]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[6]  D. Madigan,et al.  Bayesian model averaging and model selection for markov equivalence classes of acyclic digraphs , 1996 .

[7]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[8]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[9]  E. George,et al.  APPROACHES FOR BAYESIAN VARIABLE SELECTION , 1997 .

[10]  Michael I. Jordan Graphical Models , 2003 .

[11]  David Heckerman,et al.  Parameter Priors for Directed Acyclic Graphical Models and the Characteriration of Several Probability Distributions , 1999, UAI.

[12]  David Maxwell Chickering,et al.  Dependency Networks for Inference, Collaborative Filtering, and Data Visualization , 2000, J. Mach. Learn. Res..

[13]  Edward I. George,et al.  The Practical Implementation of Bayesian Model Selection , 2001 .

[14]  W. Wong,et al.  Transitive functional annotation by shortest-path analysis of gene expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  A. Roverato Hyper Inverse Wishart Distribution for Non-decomposable Graphs and its Application to Bayesian Inference for Gaussian Graphical Models , 2002 .

[16]  M. West,et al.  Gene expression predictors of breast cancer outcomes , 2003, The Lancet.

[17]  G. Roberts,et al.  Bayesian Inference For Nondecomposable Graphical Gaussian Models. , 2003 .

[18]  A. Atay-Kayis,et al.  A Monte Carlo method to compute the marginal likelihood in non decomposable graphical Gaussian models , 2003 .

[19]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[20]  Michael A. West,et al.  Archival Version including Appendicies : Experiments in Stochastic Computation for High-Dimensional Graphical Models , 2005 .

[21]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[22]  G Leclercq,et al.  About GATA3, HNF3A, and XBP1, three genes co-expressed with the oestrogen receptor-α gene (ESR1) in breast cancer , 2004, Molecular and Cellular Endocrinology.

[23]  M. West,et al.  Covariance decomposition in multivariate analysis , 2004 .

[24]  M. West,et al.  Sparse graphical models for exploring gene expression data , 2004 .

[25]  Mike West,et al.  DIG - a system for gene annotation and functional discovery , 2005, Bioinform..