Online policy adaptation for ensemble classifiers
暂无分享,去创建一个
[1] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.
[4] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.
[5] Yi Li,et al. The Relaxed Online Maximum Margin Algorithm , 1999, Machine Learning.
[6] Charles W. Anderson,et al. Reinforcement Learning with Modular Neural Networks for Control , 1994 .
[7] Peter L. Bartlett,et al. Reinforcement Learning in POMDP's via Direct Gradient Ascent , 2000, ICML.
[8] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.
[9] Peter L. Bartlett,et al. Improved Generalization Through Explicit Optimization of Margins , 2000, Machine Learning.
[10] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .
[11] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[12] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.
[13] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.
[14] Alex M. Andrew,et al. Reinforcement Learning: : An Introduction , 1998 .
[15] L. Breiman. Arcing the edge , 1997 .