Multimodel Inference

The model selection literature has been generally poor at reflecting the deep foundations of the Akaike information criterion (AIC) and at making appropriate comparisons to the Bayesian information criterion (BIC). There is a clear philosophy, a sound criterion based in information theory, and a rigorous statistical foundation for AIC. AIC can be justified as Bayesian using a “savvy” prior on models that is a function of sample size and the number of model parameters. Furthermore, BIC can be derived as a non-Bayesian result. Therefore, arguments about using AIC versus BIC for model selection cannot be from a Bayes versus frequentist perspective. The philosophical context of what is assumed about reality, approximating models, and the intent of model-based inference should determine whether AIC or BIC is used. Various facets of such multimodel inference are presented here, particularly methods of model averaging.

[1]  T. C. Chamberlin The Method of Multiple Working Hypotheses , 1931, The Journal of Geology.

[2]  T. C. CHAMBERLIN The Method of Multiple Working Hypotheses , 1931, The Journal of Geology.

[3]  T. C. Chamberlin The Method of Multiple Working Hypotheses: With this method the dangers of parental affection for a favorite theory can be circumvented. , 1965, Science.

[4]  H. Akaike INFORMATION THEORY AS AN EXTENSION OF THE MAXIMUM LIKELIHOOD , 1973 .

[5]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[6]  H. Akaike A new look at the statistical model identification , 1974 .

[7]  M. Stone,et al.  Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[8]  M. Stone An Asymptotic Equivalence of Choice of Model by Cross‐Validation and Akaike's Criterion , 1977 .

[9]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[10]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[11]  H. Akaike Likelihood of a model and information criteria , 1981 .

[12]  H. Akaike Prediction and Entropy , 1985 .

[13]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[14]  J. Deleeuw,et al.  Introduction to Akaike (1973) Information Theory and an Extension of the Maximum Likelihood Principle , 1992 .

[15]  L. Breiman The Little Bootstrap and other Methods for Dimensionality Selection in Regression: X-Fixed Prediction Error , 1992 .

[16]  Hirotugu Akaike,et al.  Implications of Informational Point of View on the Development of Statistical Science , 1994 .

[17]  Malcolm R. Forster,et al.  How to Tell When Simpler, More Unified, or Less Ad Hoc Theories will Provide More Accurate Predictions , 1994, The British Journal for the Philosophy of Science.

[18]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[19]  A. Raftery Bayesian Model Selection in Social Research , 1995 .

[20]  Clifford M. Hurvich,et al.  Model selection for extended quasi-likelihood models in small samples. , 1995, Biometrics.

[21]  Roger W. Johnson Fitting Percentage of Body Fat to Simple Body Measurements: College Women , 1996, Journal of Statistics and Data Science Education.

[22]  A. Azzalini Statistical Inference Based on the likelihood , 1996 .

[23]  A. Raftery Approximate Bayes factors and accounting for model uncertainty in generalised linear models , 1996 .

[24]  Erhard Reschenhofer,et al.  Prediction with vague prior knowledge , 1996 .

[25]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[26]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[27]  R. Royall Statistical Evidence: A Likelihood Paradigm , 1997 .

[28]  M. Hansen,et al.  Spline Adaptation in Extended Linear Models , 1998 .

[29]  A. McQuarrie,et al.  Regression and Time Series Model Selection , 1998 .

[30]  Genshiro Kitagawa,et al.  Selected papers of Hirotugu Akaike , 1998 .

[31]  J. Cavanaugh,et al.  Generalizing the derivation of the schwarz information criterion , 1999 .

[32]  D. Weakliem A Critique of the Bayesian Information Criterion for Model Selection , 1999 .

[33]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[34]  Wasserman,et al.  Bayesian Model Selection and Model Averaging. , 2000, Journal of mathematical psychology.

[35]  M. Forster,et al.  Key Concepts in Model Selection: Performance and Generalizability. , 2000, Journal of mathematical psychology.

[36]  N. Reid,et al.  Likelihood , 1993 .

[37]  David S. Williams Weighing the odds : a course in probability and statistics , 2001 .

[38]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[39]  M. Forster The new science of simplicity 1 , 2001 .

[40]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[41]  Charles Kooperberg,et al.  Spline Adaptation in Extended Linear Models (with comments and a rejoinder by the authors , 2002 .

[42]  David R. Anderson,et al.  Avoiding pitfalls when using information-theoretic methods , 2002 .

[43]  A. Davison,et al.  Report of the Editors—2001 , 2002 .

[44]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[45]  Purushottam W. Laud,et al.  Predictive Variable Selection in Generalized Linear Models , 2002 .

[46]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[47]  Edward I. George,et al.  Bayesian Model Selection , 2006 .

[48]  Tony O’Hagan Bayes factors , 2006 .