Frequentist Model Average Estimators

The traditional use of model selection methods in practice is to proceed as if the final selected model had been chosen in advance, without acknowledging the additional uncertainty introduced by model selection. This often means underreporting of variability and too optimistic confidence intervals. We build a general large-sample likelihood apparatus in which limiting distributions and risk properties of estimators post-selection as well as of model average estimators are precisely described, also explicitly taking modeling bias into account. This allows a drastic reduction in complexity, as competing model averaging schemes may be developed, discussed, and compared inside a statistical prototype experiment where only a few crucial quantities matter. In particular, we offer a frequentist view on Bayesian model averaging methods and give a link to generalized ridge estimators. Our work also leads to new model selection criteria. The methods are illustrated with real data applications.

[1]  N. Hjort,et al.  The Focused Information Criterion , 2003 .

[2]  M. Peruggia Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach (2nd ed.) , 2003 .

[3]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[4]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[5]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[6]  Yuhong Yang Adaptive Regression by Mixing , 2001 .

[7]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[8]  Hannes Leeb,et al.  The Finite-Sample Distribution of Post-Model-Selection Estimators, and Uniform Versus Non-Uniform Approximations , 2000 .

[9]  P. Green,et al.  Trans-dimensional Markov chain Monte Carlo , 2000 .

[10]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[11]  Peter Bühlmann,et al.  Efficient and adaptive post-model-selection estimators , 1999 .

[12]  Paul Kabaila,et al.  VALID CONFIDENCE INTERVALS IN REGRESSION AFTER VARIABLE SELECTION , 1998, Econometric Theory.

[13]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[14]  Robert Tibshirani,et al.  The out-of-bootstrap method for model averaging and selection , 1997 .

[15]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[16]  Paul Kabaila,et al.  The Effect of Model Selection on Confidence Regions and Prediction Regions , 1995, Econometric Theory.

[17]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[18]  Dean P. Foster,et al.  The risk inflation criterion for multiple regression , 1994 .

[19]  D. Pollard,et al.  Asymptotics for minimisers of convex processes , 2011, 1107.3806.

[20]  B. M. Pötscher Effects of Model Selection on Inference , 1991, Econometric Theory.

[21]  Clifford M. Hurvich,et al.  The impact of model selection on inference in linear regression , 1990 .

[22]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[23]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[24]  D. Cox,et al.  Asymptotic techniques for use in statistics , 1989 .

[25]  P. Sen,et al.  On preliminary test and shrinkage m-estimation in linear models , 1987 .

[26]  E. George Combining Minimax Shrinkage Estimators , 1986 .

[27]  E. George Minimax Multiple Shrinkage Estimation , 1986 .

[28]  N. Hjort Bayes estimators and asymptotic efficiency in parametric counting process models , 1986 .

[29]  P. Bickel Parametric Robustness: Small Biases can be Worthwhile , 1984 .

[30]  P. Bickel MINIMAX ESTIMATION OF THE MEAN OF A NORMAL DISTRIBUTION SUBJECT TO DOING WELL AT A POINT , 1983 .

[31]  P. Bickel Minimax Estimation of the Mean of a Normal Distribution when the Parameter Space is Restricted , 1981 .

[32]  R. Serfling Approximation Theorems of Mathematical Statistics , 1980 .

[33]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[34]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[35]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .