论文信息 - An Improvement of AdaBoost to Avoid Overfitting

An Improvement of AdaBoost to Avoid Overfitting

Recent work has shown that combining multiple versions of weak classiiers such as decision trees or neural networks results in reduced test set error. To study this in greater detail, we analyze the asymptotic behavior of AdaBoost. The theoretical analysis establishes the relation between the distribution of margins of the training examples and the generated voting classiication rule. The paper shows asymptotic experimental results with RBF networks for the binary classi-cation case underlining the theoretical ndings. Our experiments show that AdaBoost does overrt, indeed. In order to avoid this and to get better generalization performance, we propose a regularized improved version of AdaBoost, which is called AdaBoostreg. We show the usefulness of this improvement in numerical simulations.

Gunnar Rätsch | Klaus-Robert Müller | Takashi Onoda

[1] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[2] Harris Drucker,et al. Learning algorithms for classification: A comparison on handwritten digit recognition , 1995 .

[3] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[4] Gunnar Rätsch,et al. An asymptotic analysis of AdaBoost in the binary classification case , 1998 .

[5] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6] Dale Schuurmans,et al. Boosting in the Limit: Maximizing the Margin of Learned Ensembles , 1998, AAAI/IAAI.

[7] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8] Yoshua Bengio,et al. AdaBoosting Neural Networks: Application to on-line Character Recognition , 1997, ICANN.

[9] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[10] Leo Breiman,et al. Prediction Games and Arcing Algorithms , 1999, Neural Computation.