Learning Classifier System with Convergence and Generalization

[1]  John N. Tsitsiklis,et al.  Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[2]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[3]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[4]  M. Pelikán,et al.  Analyzing the evolutionary pressures in XCS , 2001 .

[5]  Leemon C. Baird,et al.  Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[6]  Martin V. Butz,et al.  Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems , 2005, IEEE Transactions on Evolutionary Computation.

[7]  D. Goldberg,et al.  Bounding Learning Time in XCS , 2004, GECCO.

[8]  Geoffrey J. Gordon Stable Function Approximation in Dynamic Programming , 1995, ICML.

[9]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[10]  L. Baird Reinforcement Learning Through Gradient Descent , 1999 .

[11]  Robert E. Smith,et al.  The Fighter Aircraft LCS: A Case of Different LCS Goals and Techniques , 1999, Learning Classifier Systems.

[12]  Stewart W. Wilson Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[13]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[14]  Artur Merke,et al.  Convergence of synchronous reinforcement learning with linear function approximation , 2004, ICML '04.

[15]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[16]  Stewart W. Wilson ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.

[17]  Richard S. Sutton,et al.  Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[18]  Pier Luca Lanzi,et al.  An Analysis of Generalization in the XCS Classifier System , 1999, Evolutionary Computation.

[19]  Martin V. Butz,et al.  An algorithmic description of XCS , 2000, Soft Comput..

[20]  Pier Luca Lanzi,et al.  Learning classifier systems from a reinforcement learning perspective , 2002, Soft Comput..

[21]  Stewart W. Wilson Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.

[22]  Dave Cliff,et al.  Adding Temporary Memory to ZCS , 1994, Adapt. Behav..

[23]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[24]  Marco Dorigo,et al.  A comparison of Q-learning and classifier systems , 1994 .

[25]  Michael I. Jordan,et al.  Reinforcement Learning with Soft State Aggregation , 1994, NIPS.

[26]  Rick L. Riolo,et al.  Lookahead planning and latent learning in a classifier system , 1991 .