论文信息 - Greedy Model Averaging

Greedy Model Averaging

This paper considers the problem of combining multiple models to achieve a prediction accuracy not much worse than that of the best single model for least squares regression. It is known that if the models are mis-specified, model averaging is superior to model selection. Specifically, let n be the sample size, then the worst case regret of the former decays at the rate of O(1/n) while the worst case regret of the latter decays at the rate of O(1/√n). In the literature, the most important and widely studied model averaging method that achieves the optimal O(1/n) average regret is the exponential weighted model averaging (EWMA) algorithm. However this method suffers from several limitations. The purpose of this paper is to present a new greedy model averaging procedure that improves EWMA. We prove strong theoretical guarantees for the new procedure and illustrate our theoretical results with empirical examples.

Tong Zhang | Dong Dai | Tong Zhang | Dong Dai

[1] Yuhong Yang. Adaptive Regression by Mixing , 2001 .

[2] Arnak S. Dalalyan,et al. Optimal aggregation of affine estimators , 2011, COLT.

[3] L. Jones. A Simple Lemma on Greedy Approximation in Hilbert Space and Convergence Rates for Projection Pursuit Regression and Neural Network Training , 1992 .

[4] Olivier Catoni,et al. Statistical learning theory and stochastic optimization , 2004 .

[5] Jean-Yves Audibert,et al. Progressive mixture rules are deviation suboptimal , 2007, NIPS.

[6] A. Tsybakov,et al. Exponential Screening and optimal rates of sparse estimation , 2010, 1003.2654.

[7] A. Juditsky,et al. Learning by mirror averaging , 2005, math/0511468.

[8] Philippe Rigollet,et al. Kullback-Leibler aggregation and misspecified generalized linear models , 2009, 0911.2919.

[9] Andrew R. Barron,et al. Information Theory and Mixing Least-Squares Regressions , 2006, IEEE Transactions on Information Theory.