论文信息 - Why averaging classifiers can protect against overfitting

Why averaging classifiers can protect against overfitting

We study a simple learning algorithm for binary classification. Instead of predicting with the best hypothesis in the hypothesis class, this algorithm predicts with a weighted average of all hypotheses, weighted exponentially with respect to their training error. We show that the prediction of this algorithm is much more stable than the prediction of an algorithm that predicts with the best hypothesis. By allowing the algorithm to abstain from predicting on some examples, we show that the predictions it makes when it does not abstain are very reliable. Finally, we show that the probability that the algorithm abstains is at most about twice the generalization error of the best hypothesis in the class.

[1] László Györfi,et al. A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[2] D. Mackay,et al. Bayesian methods for adaptive models , 1992 .

[3] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[4] John Shawe-Taylor,et al. A PAC analysis of a Bayesian estimator , 1997, COLT '97.

[5] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[6] David Haussler,et al. How to use expert advice , 1993, STOC.

[7] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..

[8] Journal of the Association for Computing Machinery , 1961, Nature.

[9] David A. McAllester. Some PAC-Bayesian Theorems , 1998, COLT' 98.

[10] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[11] W. Hoeffding. Probability Inequalities for sums of Bounded Random Variables , 1963 .

[12] Colin McDiarmid,et al. Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[13] David Haussler,et al. Occam's Razor , 1987, Inf. Process. Lett..

[14] Yoram Singer,et al. Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..