X-TCS: accuracy-based learning classifier system robotics

Most research in the held of learning classifier systems today concentrates on the accuracy-based XCS. This paper presents initial results from an extension of XCS that operates in continuous environments on a physical robot. This is compared with a similar extension based upon the simpler ZCS. The new system is shown to be capable of near optimal performance in a simple robotic task. To the best of our knowledge, this is the first application of an accuracy-based LCS to controlling a physical agent in the real world without a priori discretization.

[1]  Stewart W. Wilson Mining Oblique Data with XCS , 2000, IWLCS.

[2]  Inman Harvey,et al.  Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics , 1995, ECAL.

[3]  Seiji Yamada,et al.  Real Robot Learning with Human Teaching , 2002 .

[4]  Seiji Yamada,et al.  Interactive classifier system for real robot learning , 2000, Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499).

[5]  Sridhar Mahadevan,et al.  Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[6]  Stewart W. Wilson ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.

[7]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[8]  Stefano Nolfi,et al.  How to Evolve Autonomous Robots: Different Approaches in Evolutionary Robotics , 1994 .

[9]  Inman Harvey,et al.  Explorations in Evolutionary Robotics , 1993, Adapt. Behav..

[10]  Dave Cliff,et al.  Adding Temporary Memory to ZCS , 1994, Adapt. Behav..

[11]  Christopher Stone,et al.  For Real! XCS with Continuous-Valued Inputs , 2003, Evolutionary Computation.

[12]  Francesco Mondada,et al.  Evolution of homing navigation in a real mobile robot , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[13]  John H. Holland,et al.  COGNITIVE SYSTEMS BASED ON ADAPTIVE ALGORITHMS1 , 1978 .

[14]  Minoru Asada,et al.  Purposive Behavior Acquisition for a Real Robot by Vision-Based Reinforcement Learning , 2005, Machine Learning.

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  Stewart W. Wilson Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[17]  Ashwin Ram,et al.  Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[18]  Larry Bull,et al.  TCS Learning Classifier System Controller on a Real Robot , 2002, PPSN.

[19]  Larry Bull,et al.  ZCS Redux , 2002, Evolutionary Computation.

[20]  José M. Molina López,et al.  Genetic learning of fuzzy reactive controllers , 1998, Robotics Auton. Syst..

[21]  John H. Holland,et al.  Cognitive systems based on adaptive algorithms , 1977, SGAR.

[22]  Leslie Pack Kaelbling,et al.  Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[23]  Dario Floreano,et al.  Active vision and feature selection in evolutionary behavioral systems , 2002 .

[24]  Marco Colombetti,et al.  Robot Shaping: An Experiment in Behavior Engineering , 1997 .

[25]  Larry Bull,et al.  Learning Classifier Systems , 2002, Annual Conference on Genetic and Evolutionary Computation.

[26]  Andrea Bonarini,et al.  Learning to compose fuzzy behaviors for autonomous agents , 1997, Int. J. Approx. Reason..

[27]  Larry Bull,et al.  Using the XCS Classifier System for Multi-objective Reinforcement Learning Problems , 2007, Artificial Life.

[28]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[29]  Sebastian Thrun,et al.  Issues in Using Function Approximation for Reinforcement Learning , 1999 .

[30]  Larry Bull,et al.  Self-adaptive mutation in classifier system controllers , 2000 .

[31]  Stewart W. Wilson Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.