Learning Classifier System with Convergence and Generalization
暂无分享,去创建一个
Osamu Katai | Keiki Takadama | Katsunori Shimohara | Atsushi Wada | O. Katai | K. Takadama | K. Shimohara | A. Wada
[1] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.
[2] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[3] Tommi S. Jaakkola,et al. Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.
[4] M. Pelikán,et al. Analyzing the evolutionary pressures in XCS , 2001 .
[5] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[6] Martin V. Butz,et al. Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems , 2005, IEEE Transactions on Evolutionary Computation.
[7] D. Goldberg,et al. Bounding Learning Time in XCS , 2004, GECCO.
[8] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.
[9] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .
[10] L. Baird. Reinforcement Learning Through Gradient Descent , 1999 .
[11] Robert E. Smith,et al. The Fighter Aircraft LCS: A Case of Different LCS Goals and Techniques , 1999, Learning Classifier Systems.
[12] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.
[13] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .
[14] Artur Merke,et al. Convergence of synchronous reinforcement learning with linear function approximation , 2004, ICML '04.
[15] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[16] Stewart W. Wilson. ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.
[17] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .
[18] Pier Luca Lanzi,et al. An Analysis of Generalization in the XCS Classifier System , 1999, Evolutionary Computation.
[19] Martin V. Butz,et al. An algorithmic description of XCS , 2000, Soft Comput..
[20] Pier Luca Lanzi,et al. Learning classifier systems from a reinforcement learning perspective , 2002, Soft Comput..
[21] Stewart W. Wilson. Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.
[22] Dave Cliff,et al. Adding Temporary Memory to ZCS , 1994, Adapt. Behav..
[23] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[24] Marco Dorigo,et al. A comparison of Q-learning and classifier systems , 1994 .
[25] Michael I. Jordan,et al. Reinforcement Learning with Soft State Aggregation , 1994, NIPS.
[26] Rick L. Riolo,et al. Lookahead planning and latent learning in a classifier system , 1991 .