Gaussian Graphical Models

This chapter describes graphical models for multivariate continuous data based on the Gaussian (normal) distribution. We gently introduce the undirected models by examining the partial correlation structure of two sets of data, one relating to meat composition of pig carcasses and the other to body fat measurements. We then give a concise exposition of the model theory, covering topics such as maximum likelihood estimation using the IPS algorithm, hypothesis testing, and decomposability. We also explain the close relation between the models and linear regression models. We describe various approaches to model selection, including stepwise selection, the glasso algorithm and the SIN algorithm and apply these to the example datasets. We then turn to directed Gaussian graphical models that can be represented as DAGs. We explain a key concept, Markov equivalence, and describe how certain mixed graphs called pDAGS and essential graphs are used to represent equivalence classes of models. We describe various model selection algorithms for directed Gaussian models, including PC algorithm, the hill-climbing algorithm, and the max-min hill-climbing algorithm and apply them to the example datasets. Finally, we briefly describe Gaussian chain graph models and illustrate use of a model selection algorithm for these models.

[1]  Z. Šidák Rectangular Confidence Regions for the Means of Multivariate Normal Distributions , 1967 .

[2]  H. Akaike A new look at the statistical model identification , 1974 .

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[5]  T. Speed,et al.  Gaussian Markov Distributions over Finite Graphs , 1986 .

[6]  Andrew P. Sage,et al.  Uncertainty in Artificial Intelligence , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[8]  M. Frydenberg The chain graph Markov property , 1990 .

[9]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[10]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[11]  Roger W. Johnson Fitting Percentage of Body Fat to Simple Body Measurements: College Women , 1996, Journal of Statistics and Data Science Education.

[12]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[13]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[14]  H. Busk,et al.  Determination of lean meat in pig carcasses with the Autofom classification system. , 1999, Meat science.

[15]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[16]  F. Tobin,et al.  PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL FLORIDA ARTIFICIAL INTELLIGENCE RESEARCH SOCIETY CONFERENCE , 2003 .

[17]  Mathias Drton,et al.  A SINful approach to Gaussian graphical model selection , 2005 .

[18]  M. Drton,et al.  Multiple Testing and Error Control in Gaussian Graphical Model Selection , 2005, math/0508267.

[19]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[20]  Zhi Geng,et al.  Structural Learning of Chain Graphs via Decomposition. , 2008, Journal of machine learning research : JMLR.