Wishart distributions for decomposable covariance graph models

Gaussian covariance graph models encode marginal independence among the components of a multivariate random vector by means of a graph G. These models are distinctly different from the traditional concentration graph models (often also referred to as Gaussian graphical models or covariance selection models) since the zeros in the parameter are now reflected in the covariance matrix Σ, as compared to the concentration matrix Ω = Σ ―1 . The parameter space of interest for covariance graph models is the cone P G of positive definite matrices with fixed zeros corresponding to the missing edges of G. As in Letac and Massam [Ann. Statist. 35 (2007) 1278―1323], we consider the case where G is decomposable. In this paper, we construct on the cone P G a family of Wishart distributions which serve a similar purpose in the covariance graph setting as those constructed by Letac and Massam [Ann. Statist. 35 (2007) 1278―1323] and Dawid and Lauritzen [Ann. Statist. 21 (1993) 1272―1317] do in the concentration graph setting. We proceed to undertake a rigorous study of these "covariance" Wishart distributions and derive several deep and useful properties of this class. First, they form a rich conjugate family of priors with multiple shape parameters for covariance graph models. Second, we show how to sample from these distributions by using a block Gibbs sampling algorithm and prove convergence of this block Gibbs sampler. Development of this class of distributions enables Bayesian inference, which, in turn, allows for the estimation of E, even in the case when the sample size is less than the dimension of the data (i.e., when "n < p"), otherwise not generally possible in the maximum likelihood framework. Third, we prove that when G is a homogeneous graph, our covariance priors correspond to standard conjugate priors for appropriate directed acyclic graph (DAG) models. This correspondence enables closed form expressions for normalizing constants and expected values, and also establishes hyper-Markov properties for our class of priors. We also note that when G is homogeneous, the family IW QG of Letac and Massam [Ann. Statist. 35 (2007) 1278―1323] is a special case of our covariance Wishart distributions. Fourth, and finally, we illustrate the use of our family of conjugate priors on real and simulated data.

[1]  A. Butte,et al.  Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[2]  D. Edwards Introduction to graphical modelling , 1995 .

[3]  Kshitij Khare,et al.  Wishart Distributions for Covariance Graph Models , 2009 .

[4]  A. Roverato Cholesky decomposition of a hyper inverse Wishart matrix , 2000 .

[5]  Brendan J. Frey,et al.  Convolutional Factor Graphs as Probabilistic Models , 2004, UAI.

[6]  G. Kauermann On a dualization of graphical Gaussian models , 1996 .

[7]  R. Muirhead Aspects of Multivariate Statistical Theory , 1982, Wiley Series in Probability and Statistics.

[8]  Guido Consonni,et al.  Enriched conjugate and reference priors for the Wishart family on symmetric cones , 2003 .

[9]  N. Wermuth Linear Recursive Equations, Covariance Selection, and Path Analysis , 1980 .

[10]  N. Wermuth,et al.  Linear Dependencies Represented by Chain Graphs , 1993 .

[11]  Michel Grzebyk,et al.  On identification of multi-factor models with correlated residuals , 2004 .

[12]  Steen A. Andersson,et al.  Wishart Distributions on Homogeneous Cones , 2004 .

[13]  G'erard Letac,et al.  Wishart distributions for decomposable graphs , 2007, 0708.2380.

[14]  Zoubin Ghahramani,et al.  The Hidden Life of Latent Variables: Bayesian Learning with Mixed Graph Models , 2009, J. Mach. Learn. Res..

[15]  K. Athreya,et al.  ON THE CONVERGENCE OF THE MARKOV CHAIN SIMULATION METHOD , 1996 .

[16]  T. Richardson,et al.  Estimation of a covariance matrix with zeros , 2005, math/0508268.

[17]  H. Massam,et al.  FLEXIBLE COVARIANCE ESTIMATION IN GRAPHICAL , 2008 .

[18]  P. Diaconis,et al.  Conjugate Priors for Exponential Families , 1979 .

[19]  A. Roverato Hyper Inverse Wishart Distribution for Non-decomposable Graphs and its Application to Bayesian Inference for Gaussian Graphical Models , 2002 .

[20]  Jianhua Z. Huang,et al.  Covariance matrix selection and estimation via penalised normal likelihood , 2006 .

[21]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[22]  M. Pourahmadi Cholesky Decompositions and Estimation of A Covariance Matrix: Orthogonality of Variance--Correlation Parameters , 2007 .

[23]  Charles R. Johnson,et al.  Positive definite completions of partial Hermitian matrices , 1984 .

[24]  Nanny Wermuth,et al.  Multivariate Dependencies: Models, Analysis and Interpretation , 1996 .

[25]  Vern I. Paulsen,et al.  Schur Products and Matrix Completions , 1989 .

[26]  Thomas S. Richardson,et al.  Graphical Methods for Efficient Likelihood Inference in Gaussian Covariance Models , 2007, J. Mach. Learn. Res..

[27]  M. Pourahmadi,et al.  Bayesian analysis of covariance matrices and dynamic models for longitudinal data , 2002 .

[28]  A. Dawid,et al.  Hyper Markov Laws in the Statistical Analysis of Decomposable Graphical Models , 1993 .