论文信息 - Learning Classifier System with Convergence and Generalization - 字舞流文

Learning Classifier System with Convergence and Generalization

Osamu Katai | Keiki Takadama | Katsunori Shimohara | Atsushi Wada | O. Katai | K. Takadama | K. Shimohara | A. Wada

[1] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[2] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[3] Tommi S. Jaakkola,et al. Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[4] M. Pelikán,et al. Analyzing the evolutionary pressures in XCS , 2001 .

[5] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[6] Martin V. Butz,et al. Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems , 2005, IEEE Transactions on Evolutionary Computation.

[7] D. Goldberg,et al. Bounding Learning Time in XCS , 2004, GECCO.

[8] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.

[9] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[10] L. Baird. Reinforcement Learning Through Gradient Descent , 1999 .

[11] Robert E. Smith,et al. The Fighter Aircraft LCS: A Case of Different LCS Goals and Techniques , 1999, Learning Classifier Systems.

[12] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[13] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[14] Artur Merke,et al. Convergence of synchronous reinforcement learning with linear function approximation , 2004, ICML '04.

[15] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[16] Stewart W. Wilson. ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.

[17] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[18] Pier Luca Lanzi,et al. An Analysis of Generalization in the XCS Classifier System , 1999, Evolutionary Computation.

[19] Martin V. Butz,et al. An algorithmic description of XCS , 2000, Soft Comput..

[20] Pier Luca Lanzi,et al. Learning classifier systems from a reinforcement learning perspective , 2002, Soft Comput..

[21] Stewart W. Wilson. Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.

[22] Dave Cliff,et al. Adding Temporary Memory to ZCS , 1994, Adapt. Behav..

[23] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[24] Marco Dorigo,et al. A comparison of Q-learning and classifier systems , 1994 .

[25] Michael I. Jordan,et al. Reinforcement Learning with Soft State Aggregation , 1994, NIPS.

[26] Rick L. Riolo,et al. Lookahead planning and latent learning in a classifier system , 1991 .