Assessment and Propagation of Model Uncertainty

Assessment and Propagation of Model Uncertainty By DAVID DRAPER* University of Bath, UK SUMMARY In most examples of inference and prediction, the expression of uncertainty about unknown quantities y on the basis of known quantities x is based on a model M that formalizes assumptions about how x and y are related. M will typically have two parts: structural assumptions S, such as the form of the link function and the choice of error distribution in a generalized linear model, and parameters 6 whose meaning is specific to a given choice of S. It is common in statistical theory and practice to acknowledge parametric uncertainty about 9 given a particular assumed structure S; it is less common to acknowledge structural uncertainty about S itself. A widely used approach, in fact, involves enlisting the aid of x to specify a plausible single best choice S* for S, and then proceeding as if S* were known to be correct. In general this approach fails to fully assess and propagate structural uncertainty, and may lead to miscali- brated uncertainty assessments about y given x. When miscalibration occurs it will often be in the direction of understatement of inferential or predictive uncertainty about y, leading to inaccurate scientific summaries and overconfi- dent decisions that do not incorporate sufficient hedging against uncertainty. In this paper I discuss a Bayesian approach to solving this problem that has long been available in principle but is only now becoming routinely feasible, by virtue of recent computational advances, and examine its implementation in examples that involve forecasting the price of oil and estimating the chance of catastrophic failure of the U.S. Space Shuttle. Keywords: BAYES FACTORS; CALIBRATION; FORECASTING; HIERARCHICAL MODELS; INFERENCE; MODEL SPECIFICATION; OVER-FITTING; PREDICTION; ROBUSTNESS; SENSITIVITY ANALYSIS; UNCERTAINTY ASSESSMENT 1. INTRODUCTION The general framework of problems in inference and prediction involves two sets of ingredients: unknown (s) y—such as the causal effect of a treatment in inference, or the price of something next year in prediction—and known (s) x, which will typically include both data and context. The desire is usually to express uncertainty about y in light of x, for instance through a probability specification of the form p(y\x). Specifications of this type that involve conditioning only on things that are known are rare, even in comparatively simple settings (e.g., Lindley, 1982); instead one typically appeals to a model M that formalizes judgments about how x and y are related. 1.1. Structural Uncertainty The model may be expressed (e.g., Draper et al., 1987; Hodges, 1987) in two parts as M = (S, 9), where S represents one or more sets of structural assumptions— such as a particular link function in a generalized linear model, or a particular form of heteroscedasticity or time dependence with non-IID data—and 9 represents parameters whose meaning is specific to the chosen structure(s). (It will often be possible to express a given model M in more than one way using this notation, but that does not affect the discussion that follows.) Once S is chosen, 9 typically follows ' Address for correspondence: Statistics Group, School of Mathematical Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK (d.draper@maths.bath.ac.uk).

[1]  Rick L. Edgeman,et al.  LISP-STAT: An Object-Oriented Environment for Statistical Computing and Dynamic Graphics , 1992 .

[2]  Alan J. Miller Subset Selection in Regression , 1992 .

[3]  S. Fienberg,et al.  The Geometry of a Two by Two Contingency Table , 1970 .

[4]  Peter E. Rossi,et al.  Bayesian analysis of dichotomous quantal response models , 1984 .

[5]  J. Rissanen Stochastic Complexity and Modeling , 1986 .

[6]  David Lindley The Bayesian Approach to Statistics. , 1980 .

[7]  Michael Lavine,et al.  Problems in Extrapolation Illustrated with Space Shuttle O-Ring Data , 1991 .

[8]  Adrian F. M. Smith,et al.  Bayesian computation via the gibbs sampler and related markov chain monte carlo methods (with discus , 1993 .

[9]  Peter C. Cheeseman,et al.  Bayesian prediction for artificial intelligence , 1987, Int. J. Approx. Reason..

[10]  J. Faraway On the Cost of Data Analysis , 1992 .

[11]  S. Geisser,et al.  A Predictive Approach to Model Selection , 1979 .

[12]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[13]  H. F. Martz,et al.  The risk of catastrophic failure of the solid rocket boosters on the space shuttle , 1992 .

[14]  Roger Koenker,et al.  [Rank-Based Robust Analysis of Linear Models. I. Exposition and Review]: Comment , 1988 .

[15]  James S. Hodges,et al.  Uncertainty, Policy Analysis and Statistics , 1987 .

[16]  D. P. Gaver,et al.  Combining Information: Statistical Issues and Opportunities for Research , 1993 .

[17]  Leland Stewart,et al.  Hierarchical Bayesian Analysis using Monte Carlo Integration: Computing Posterior Distributions when , 1987 .

[18]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[19]  David Lindley,et al.  Plural analysis: Multiple approaches to quantitative research , 1986 .

[20]  Adrian E. Raftery,et al.  Approximate Bayes factors for generalized linear models , 1988 .

[21]  D. Lindley The Use of Prior Probability Distributions in Statistical Inference and Decisions , 1961 .

[22]  Jeremy M. G. Taylor The Cost of Generalizing Logistic Regression , 1988 .

[23]  J. E. H. Shaw,et al.  Bayesian modelling and sensitivity analysis , 1986 .

[24]  James M. Dickey,et al.  Scientific Reporting and Personal Probabilities: Student's Hypothesis , 1973 .

[25]  Leland Stewart,et al.  Bayesian Posterior Distributions Over Sets of Possible Models with Inferences Computed by Monte Carlo Integration , 1986 .

[26]  W. H. Williams,et al.  A Simple Method for the Construction of Empirical Confidence Limits for Economic Forecasts , 1971 .

[27]  B. M. Pötscher Effects of Model Selection on Inference , 1991, Econometric Theory.

[28]  F. J. Schuurmann,et al.  On a two stage shrinkage testimator of the mean of a normal distribution , 1984 .

[29]  Jeffrey E. Harris,et al.  Bayes Methods for Combining the Results of Cancer Studies in Humans and other Species , 1983 .

[30]  S Greenland,et al.  Methods for epidemiologic analyses of multiple exposures: a review and comparative study of maximum-likelihood, preliminary-testing, and empirical-Bayes regression. , 1993, Statistics in medicine.

[31]  Henry E. Kyburg,et al.  Probability, Induction and Statistics. Bruno de Finetti , 1973 .

[32]  D. Spiegelhalter,et al.  Bayes Factors for Linear and Log‐Linear Models with Vague Prior Information , 1982 .

[33]  M. Seltzer,et al.  Sensitivity Analysis for Fixed Effects in the Hierarchical Model: A Gibbs Sampling Approach , 1993 .

[34]  A. P. Dawid,et al.  Present position and potential developments: some personal views , 1984 .

[35]  S. Weisberg Applied Linear Regression , 1981 .

[36]  D. Ruppert,et al.  Transformation and Weighting in Regression , 1988 .

[37]  B. Fischhoff,et al.  Assessing uncertainty in physical constants , 1986 .

[38]  B. Efron,et al.  Stein's Estimation Rule and Its Competitors- An Empirical Bayes Approach , 1973 .

[39]  George E. P. Box,et al.  Sampling and Bayes' inference in scientific modelling and robustness , 1980 .

[40]  Rangasami L. Kashyap,et al.  Optimal Choice of AR and MA Parts in Autoregressive Moving Average Models , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  J M Taylor,et al.  Models for the HIV infection and AIDS epidemic in the United States. , 1989, Statistics in medicine.

[42]  Adrian E. Raftery,et al.  Bayes factors and model uncertainty , 1995 .

[43]  D. Cox Tests of Separate Families of Hypotheses , 1961 .

[44]  A. Zellner An Introduction to Bayesian Inference in Econometrics , 1971 .

[45]  G. Y. Wong,et al.  The Hierarchical Logistic Regression Model for Multilevel Analysis , 1985 .

[46]  A. D. Sandoval,et al.  Outlook for world oil prices. [1982-1995] , 1982 .

[47]  D. Freedman,et al.  On the Impact of Variable Selection in Fitting Regression Equations , 1988 .

[48]  Arnold Zellner,et al.  To combine or not to combine? Issues of combining forecasts , 1992 .

[49]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[50]  J. I The Design of Experiments , 1936, Nature.

[51]  Mitchell J. Mergenthaler Nonparametrics: Statistical Methods Based on Ranks , 1979 .

[52]  William W. Davis,et al.  Approximate Bayesian Predictive Distributions and Model Selection , 1979 .

[53]  C. Morris,et al.  Non-Optimality of Preliminary-Test Estimators for the Mean of a Multivariate Normal Distribution , 1972 .

[54]  A F Smith,et al.  Monitoring renal transplants: an application of the multiprocess Kalman filter. , 1983, Biometrics.

[55]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[56]  J. Geweke,et al.  Bayesian Inference in Econometric Models Using Monte Carlo Integration , 1989 .

[57]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[58]  B. Efron,et al.  A Leisurely Look at the Bootstrap, the Jackknife, and , 1983 .

[59]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[60]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .

[61]  Kenneth W. Wachter,et al.  The Future of Meta-Analysis , 1991 .

[62]  G. C. Tiao,et al.  A Further Look at Robustness via Bayes's Theorem , 1962 .

[63]  M. Geisel Bayesian Comparisons of Simple Macroeconomic Models , 1973 .

[64]  J. Friedman Multivariate adaptive regression splines , 1990 .

[65]  A. F. M. Smith,et al.  Progress with numerical and graphical methods for practical Bayesian statistics , 1987 .

[66]  L. M. Berliner,et al.  Robust Bayes and Empirical Bayes Analysis with $_\epsilon$-Contaminated Priors , 1986 .

[67]  D. Cox,et al.  Applied Statistics: Principles and Examples , 1982 .

[68]  D. Freedman,et al.  On the consistency of Bayes estimates , 1986 .

[69]  A. F. M. Smith,et al.  Present Position and Potential Developments: Some Personal Views Bayesian Statistics , 1984 .

[70]  Jerome H. Friedman Multivariate adaptive regression splines (with discussion) , 1991 .

[71]  U. Hjorth On model selection in the computer age , 1989 .

[72]  M. Lavine More Aspects of Polya Tree Distributions for Statistical Modelling , 1992 .

[73]  Edward B. Fowlkes,et al.  Risk analysis of the space shuttle: Pre-Challenger prediction of failure , 1989 .

[74]  D. Rubin,et al.  A Research Agenda for Assessment and Propagation of Model Uncertainty , 1987 .

[75]  A. F. Smith,et al.  Bayesian Methods in Practice: Experiences in the Pharmaceutical Industry , 1986 .

[76]  Brent R. Moulton A Bayesian approach to regression selection and estimation, with application to a price index for radio services , 1991 .

[77]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[78]  C. Mallows,et al.  Exchangeability and data analysis , 1993 .

[79]  A. Dawid Calibration-Based Empirical Probability , 1985 .

[80]  Daniel M. Kammen,et al.  Sea-level rise or fall? , 1992, Nature.