论文信息 - Online policy adaptation for ensemble classifiers

Online policy adaptation for ensemble classifiers

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper, the idea of using an adaptive policy for training and combining the base classifiers is put forward. The effectiveness of this approach for online learning is demonstrated by experimental results on several UCI benchmark databases.

Samy Bengio | Christos Dimitrakakis

[1] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[4] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[5] Yi Li,et al. The Relaxed Online Maximum Margin Algorithm , 1999, Machine Learning.

[6] Charles W. Anderson,et al. Reinforcement Learning with Modular Neural Networks for Control , 1994 .

[7] Peter L. Bartlett,et al. Reinforcement Learning in POMDP's via Direct Gradient Ascent , 2000, ICML.

[8] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[9] Peter L. Bartlett,et al. Improved Generalization Through Explicit Optimization of Margins , 2000, Machine Learning.

[10] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[11] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[12] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[13] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[14] Alex M. Andrew,et al. Reinforcement Learning: : An Introduction , 1998 .

[15] L. Breiman. Arcing the edge , 1997 .