Methods and Criteria for Model Selection

Model selection is an important part of any statistical analysis and, indeed, is central to the pursuit of science in general. Many authors have examined the question of model selection from both frequentist and Bayesian perspectives, and many tools for selecting the “best model” have been suggested in the literature. This paper considers the various proposals from a Bayesian decision–theoretic perspective.

[1]  L. M. M.-T. Theory of Probability , 1929, Nature.

[2]  I. Good,et al.  Probability and the Weighting of Evidence. , 1951 .

[3]  D. Lindley A STATISTICAL PARADOX , 1957 .

[4]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[5]  D. Lindley The Choice of Variables in Multiple Regression , 1968 .

[6]  Robert W. Kennard,et al.  A Note on the Cp Statistic , 1971 .

[7]  Robert W. Kennard,et al.  A Note on the Cp Statistic , 1971 .

[8]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[9]  C. L. Mallows Some comments on C_p , 1973 .

[10]  C. L. Mallows Some Comments onCp , 1973 .

[11]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[12]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[13]  David M. Allen,et al.  The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[14]  David Lindley,et al.  A Class of Utility Functions , 1976 .

[15]  Franklin A. Graybill,et al.  Theory and Application of the Linear Model , 1976 .

[16]  M. Stone,et al.  Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[17]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[18]  J. Lawless,et al.  Efficient Screening of Nonnormal Regression Models , 1978 .

[19]  S. Geisser,et al.  A Predictive Approach to Model Selection , 1979 .

[20]  D. Spiegelhalter,et al.  Bayes Factors and Choice Criteria for Linear Models , 1980 .

[21]  Wayne S. Smith,et al.  Interactive Elicitation of Opinion for a Normal Linear Model , 1980 .

[22]  George E. P. Box,et al.  Sampling and Bayes' inference in scientific modelling and robustness , 1980 .

[23]  J. Dickey,et al.  Bayesian Decision Theory and the Simplification of Models , 1980 .

[24]  John Geweke,et al.  Estimating regression models of finite but unknown order , 1981 .

[25]  Norman R. Draper,et al.  Applied regression analysis (2. ed.) , 1981, Wiley series in probability and mathematical statistics.

[26]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[27]  R. Katz On Some Criteria for Estimating the Order of a Markov Chain , 1981 .

[28]  G. Shafer Lindley's Paradox , 1982 .

[29]  Fulvio Spezzaferri,et al.  A Predictive Model Selection Criterion , 1984 .

[30]  More comments , 1984 .

[31]  S. Weisberg,et al.  Applied Linear Regression (2nd ed.). , 1986 .

[32]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[33]  T. J. Mitchell,et al.  Bayesian Variable Selection in Linear Regression , 1988 .

[34]  A. Koehler,et al.  A Comparison of the Akaike and Schwarz Criteria for Selecting Model Order , 1988 .

[35]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[36]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[37]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[38]  M. Aitkin Posterior Bayes Factors , 1991 .

[39]  Nicholas G. Polson,et al.  Inference for nonconjugate Bayesian Models using the Gibbs sampler , 1991 .

[40]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[41]  Paul H. Garthwaite,et al.  Elicitation of Prior Distributions for Variable-Selection Problems in Regression , 1992 .

[42]  A. Atkinson Subset Selection in Regression , 1992 .

[43]  Alan J. Miller Subset Selection in Regression , 1992 .

[44]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[45]  L. Breiman The Little Bootstrap and other Methods for Dimensionality Selection in Regression: X-Fixed Prediction Error , 1992 .

[46]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[47]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[48]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[49]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[50]  Dean P. Foster,et al.  The risk inflation criterion for multiple regression , 1994 .

[51]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[52]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[53]  B. Carlin,et al.  Bayesian Model Choice Via Markov Chain Monte Carlo Methods , 1995 .

[54]  Purushottam W. Laud,et al.  Predictive Model Selection , 1995 .

[55]  A. O'Hagan,et al.  Fractional Bayes factors for model comparison , 1995 .

[56]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[57]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[58]  C. Mallows More comments on C p , 1995 .

[59]  S. Chib Marginal Likelihood from the Gibbs Output , 1995 .

[60]  J. Berger,et al.  The Intrinsic Bayes Factor for Model Selection and Prediction , 1996 .

[61]  Xiao-Li Meng,et al.  SIMULATING RATIOS OF NORMALIZING CONSTANTS VIA A SIMPLE IDENTITY: A THEORETICAL EXPLORATION , 1996 .

[62]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[63]  D. S. Sivia,et al.  Data Analysis , 1996, Encyclopedia of Evolutionary Psychological Science.

[64]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[65]  P. Green,et al.  On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion) , 1997 .

[66]  David Lindley,et al.  Some comments on Bayes factors , 1997 .

[67]  Discussion on "Choosing among models when none of them are true", by Key, J.T., Pericchi, L.R. and Smith, A.F.M. , 1997 .

[68]  Dean Phillips Foster,et al.  Calibration and Empirical Bayes Variable Selection , 1997 .

[69]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[70]  A. O’Hagan,et al.  Properties of intrinsic and fractional Bayes factors , 1997 .

[71]  E. George,et al.  APPROACHES FOR BAYESIAN VARIABLE SELECTION , 1997 .

[72]  J. Kadane,et al.  Experiences in elicitation , 1998 .

[73]  Xiao-Li Meng,et al.  Simulating Normalizing Constants: From Importance Sampling to Bridge Sampling to Path Sampling , 1998 .

[74]  T. Fearn,et al.  Multivariate Bayesian variable selection and prediction , 1998 .

[75]  Anthony O'Hagan,et al.  Eliciting expert beliefs in substantial practical applications , 1998 .

[76]  C. H. Oh,et al.  Some comments on , 1998 .

[77]  T. Fearn,et al.  The choice of variables in multivariate regression: a non-conjugate Bayesian decision theory approach , 1999 .

[78]  Alan E. Gelfand,et al.  Model choice: A minimum posterior predictive loss approach , 1998, AISTATS.

[79]  R. Tibshirani,et al.  The Covariance Inflation Criterion for Adaptive Model Selection , 1999 .

[80]  E. George The Variable Selection Problem , 2000 .

[81]  Robert W. Wilson,et al.  Regressions by Leaps and Bounds , 2000, Technometrics.

[82]  Paul H. Garthwaite,et al.  Non‐conjugate prior distribution assessment for multivariate normal sampling , 2001 .

[83]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo Methods for Computing Bayes Factors , 2001 .

[84]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[85]  Luis R. Pericchi,et al.  ACCURATE AND STABLE BAYESIAN MODEL SELECTION: THE MEDIAN INTRINSIC BAYES FACTOR* , 2002 .

[86]  J. Bernardo,et al.  Bayesian Hypothesis Testing: a Reference Approach , 2002 .

[87]  K. Roeder,et al.  Journal of the American Statistical Association: Comment , 2006 .

[88]  Peter Congdon,et al.  Bayesian model choice based on Monte Carlo estimates of posterior model probabilities , 2006, Comput. Stat. Data Anal..