论文信息 - Multiclass learning, boosting, and error-correcting codes

Multiclass learning, boosting, and error-correcting codes

We focus on methods to solve multiclass learning problems by using only simple and efficient binary learners. We investigate the approach of Dietterich and Bakiri [2] based on error-correcting codes (which we call ECC). We distill error correlation as one of the key parameters influencing the performance of the ECC approach, and prove upper and lower bounds on the training error of the final hypothesis in terms of the error-correlation between the various binary hypotheses. Boosting is a powerful and well-studied learning technique that appears to annul error correlation disadvantages by cleverly weighting training examples and hypotheses. An interesting algorithm called ADABOOST.OC [12] combines boosting with the ECC approach and gives an algorithm that has the performance advantages of boosting and at the same time relies only on simple binary weak learners. We propose a variant of this algorithm, which we call ADABOOST.ECC, that, by using a different weighting of the votes of the weak hypotheses, is able to improve on the performance of ADABOOST.OC, both theoretically and experimentally, and in addition is arguably a more direct reduction of multiclass learning to binary learning problems than previous multiclass boosting algorithms.

Venkatesan Guruswami | Amit Sahai | A. Sahai | V. Guruswami

[1] David W. Opitz,et al. An Empirical Evaluation of Bagging and Boosting , 1997, AAAI/IAAI.

[2] Terrence J. Sejnowski,et al. Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[3] J. Ross Quinlan,et al. Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[4] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[5] Thomas G. Dietterich,et al. Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[6] Yoram Singer,et al. Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[7] Robert E. Schapire,et al. Using output codes to boost multiclass learning problems , 1997, ICML.

[8] Yoav Freund,et al. Boosting a weak learning algorithm by majority , 1990, COLT '90.

[9] Robert E. Schapire,et al. The strength of weak learnability , 1990, Mach. Learn..

[10] 金田重郎,et al. C4.5: Programs for Machine Learning (書評) , 1995 .

[11] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[12] Aiko M. Hormann,et al. Programs for Machine Learning. Part I , 1962, Inf. Control..

[13] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[14] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[15] Christopher J. Merz,et al. UCI Repository of Machine Learning Databases , 1996 .