Strategies for Graphical Model Selection

We consider the problem of model selection for Bayesian graphical models, and embed it in the larger context of accounting for model uncertainty. Data analysts typically select a single model from some class of models, and then condition all subsequent inference on this model. However, this approach ignores model uncertainty, leading to poorly calibrated predictions: it will often be seen in retrospect that one’s uncertainty bands were not wide enough. The Bayesian analyst solves this problem by averaging over all plausible models when making inferences about quantities of interest. In many applications, however, because of the size of the model space and awkward integrals, this averaging will not be a practical proposition, and approximations are required. Here we examine the predictive performance of two recently proposed model averaging schemes. In the examples considered, both schemes outperform any single model that might reasonably have been selected.

[1]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[2]  Norman E. Breslow,et al.  Biostatistics and Bayes , 1990 .

[3]  R. Almond,et al.  KS-3000 : An Application of DDUCKS to Bone-Marrow Transplant Patient Support , 1992 .

[4]  D. Rubin,et al.  A Research Agenda for Assessment and Propagation of Model Uncertainty , 1987 .

[5]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[6]  David Madigan,et al.  Bayesian Methods for Estimating the Size of a Closed Population , 1992 .

[7]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[8]  D. Edwards,et al.  A fast procedure for model search in multidimensional contingency tables , 1985 .

[9]  James S. Hodges,et al.  Uncertainty, Policy Analysis and Statistics , 1987 .

[10]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[11]  J. A. Mercer Geoacoustic parameters and the depth-dependent Green's function for a silty sediment at 100 Hz in shallow water , 1988 .

[12]  R. Regal,et al.  The effects of model selection on confidence intervals for the size of a closed population. , 1991, Statistics in medicine.

[13]  Graham J. G. Upton,et al.  The Exploratory Analysis of Survey Data Using Log‐Linear Models , 1991 .

[14]  Adrian E. Raftery,et al.  Approximate Bayes factors for generalized linear models , 1988 .

[15]  E. Fowlkes,et al.  Evaluating Logistic Models for Large Contingency Tables , 1988 .

[16]  Peter C. Cheeseman,et al.  Bayesian prediction for artificial intelligence , 1987, Int. J. Approx. Reason..

[17]  L. Tierney Markov Chains for Exploring Posterior Distributions , 1994 .