Least squares model averaging by Mallows criterion

This paper is in response to a recent paper by Hansen (2007) who proposed an optimal model average estimator with weights selected by minimizing a Mallows criterion. The main contribution of Hansen's paper is a demonstration that the Mallows criterion is asymptotically equivalent to the squared error, so the model average estimator that minimizes the Mallows criterion also minimizes the squared error in large samples. We are concerned with two assumptions that accompany Hansen's approach. The first is the assumption that the approximating models are strictly nested in a way that depends on the ordering of regressors. Often there is no clear basis for the ordering and the approach does not permit non-nested models which are more realistic from a practical viewpoint. Second, for the optimality result to hold the model weights are required to lie within a special discrete set. In fact, Hansen noted both difficulties and called for extensions of the proof techniques. We provide an alternative proof which shows that the result on the optimality of the Mallows criterion in fact holds for continuous model weights and under a non-nested set-up that allows any linear combination of regressors in the approximating models that make up the model average estimator. These results provide a stronger theoretical basis for the use of the Mallows criterion in model averaging by strengthening existing findings.

[1]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[2]  B. M. Pötscher,et al.  CAN ONE ESTIMATE THE UNCONDITIONAL DISTRIBUTION OF POST-MODEL-SELECTION ESTIMATORS? , 2007, Econometric Theory.

[3]  Yuhong Yang Adaptive Regression by Mixing , 2001 .

[4]  V. Chernozhukov,et al.  Bayesian Econometrics , 2007 .

[5]  Andrew R. Barron,et al.  Information Theory and Mixing Least-Squares Regressions , 2006, IEEE Transactions on Information Theory.

[6]  C. L. Mallows Some comments on C_p , 1973 .

[7]  W. Newey,et al.  Convergence rates and asymptotic normality for series estimators , 1997 .

[8]  R. Shibata Asymptotically Efficient Selection of the Order of the Model for Estimating Parameters of a Linear Process , 1980 .

[9]  Bruce E. Hansen,et al.  Averaging estimators for autoregressions with a near unit root , 2010 .

[10]  J. Shao AN ASYMPTOTIC THEORY FOR LINEAR MODEL SELECTION , 1997 .

[11]  B. Hansen Least Squares Model Averaging , 2007 .

[12]  Yuhong Yang,et al.  Combining linear regression models: when and how , 2005 .

[13]  N. Hjort,et al.  Frequentist Model Average Estimators , 2003 .

[14]  Hannes Leeb,et al.  The Finite-Sample Distribution of Post-Model-Selection Estimators, and Uniform Versus Non-Uniform Approximations , 2000 .

[15]  C. Mallows Some Comments on Cp , 2000, Technometrics.

[16]  Benedikt M. Potscher,et al.  The Distribution of Model Averaging Estimators and an Impossibility Result Regarding Its Estimation , 2006, math/0702781.

[17]  P. Whittle,et al.  Bounds for the Moments of Linear and Quadratic Forms in Independent Variables , 1960 .

[18]  Bruce E. Hansen,et al.  Least-squares forecast averaging , 2008 .

[19]  R. Shibata An optimal selection of regression variables , 1981 .

[20]  H. Leeb,et al.  CAN ONE ESTIMATE THE UNCONDITIONAL DISTRIBUTION OF POST-MODEL-SELECTION ESTIMATORS? , 2003, Econometric Theory.

[21]  Paul Kabaila,et al.  ON VARIABLE SELECTION IN LINEAR REGRESSION , 2002, Econometric Theory.

[22]  Ker-Chau Li,et al.  Asymptotic Optimality for $C_p, C_L$, Cross-Validation and Generalized Cross-Validation: Discrete Index Set , 1987 .

[23]  Bruce E. Hansen,et al.  AVERAGING ESTIMATORS FOR REGRESSIONS WITH A POSSIBLE STRUCTURAL BREAK , 2009, Econometric Theory.

[24]  B. M. Pötscher,et al.  MODEL SELECTION AND INFERENCE: FACTS AND FICTION , 2005, Econometric Theory.