Model choice: A minimum posterior predictive loss approach

SUMMARY Model choice is a fundamental and much discussed activity in the analysis of datasets. Nonnested hierarchical models introducing random effects may not be handled by classical methods. Bayesian approaches using predictive distributions can be used though the formal solution, which includes Bayes factors as a special case, can be criticised. We propose a predictive criterion where the goal is good prediction of a replicate of the observed data but tempered by fidelity to the observed values. We obtain this criterion by minimising posterior loss for a given model and then, for models under consideration, selecting the one which minimises this criterion. For a broad range of losses, the criterion emerges as a form partitioned into a goodness-of-fit term and a penalty term. We illustrate its performance with an application to a large dataset involving residential property transactions.

[1]  H. Raiffa,et al.  Applied Statistical Decision Theory. , 1961 .

[2]  Alan E. Gelfand,et al.  Model Determination using sampling-based methods , 1996 .

[3]  J. Dickey,et al.  Bayesian Decision Theory and the Simplification of Models , 1980 .

[4]  Howard Raiffa,et al.  Applied Statistical Decision Theory. , 1961 .

[5]  D. Rubin Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician , 1984 .

[6]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[7]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[8]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[9]  Hong Chang,et al.  Model Determination Using Predictive Distributions with Implementation via Sampling-Based Methods , 1992 .

[10]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[11]  S. Chib,et al.  Bayesian Tests and Model Diagnostics in Conditionally Independent Hierarchical Models , 1997 .

[12]  Arnold Zellner,et al.  Bayesian and Non-Bayesian Estimation Using Balanced Loss Functions , 1994 .

[13]  R. Bhansali,et al.  Some properties of the order of an autoregressive model selected by a generalization of Akaike∘s EPF criterion , 1977 .

[14]  Howard Raiffa,et al.  Applied Statistical Decision Theory. , 1961 .

[15]  Alan E. Gelfand,et al.  Spatio-Temporal Modeling of Residential Sales Data , 1998 .

[16]  M. Aitkin Posterior Bayes Factors , 1991 .

[17]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[18]  Purushottam W. Laud,et al.  Predictive Model Selection , 1995 .

[19]  David R. Cox,et al.  Further Results on Tests of Separate Families of Hypotheses , 1962 .

[20]  George E. P. Box,et al.  Sampling and Bayes' inference in scientific modelling and robustness , 1980 .

[21]  P. Jones Making Decisions , 1971, Nature.