论文信息 - X-TCS: accuracy-based learning classifier system robotics

X-TCS: accuracy-based learning classifier system robotics

Most research in the held of learning classifier systems today concentrates on the accuracy-based XCS. This paper presents initial results from an extension of XCS that operates in continuous environments on a physical robot. This is compared with a similar extension based upon the simpler ZCS. The new system is shown to be capable of near optimal performance in a simple robotic task. To the best of our knowledge, this is the first application of an accuracy-based LCS to controlling a physical agent in the real world without a priori discretization.

Larry Bull | Matthew Studley | L. Bull | M. Studley

[1] Stewart W. Wilson. Mining Oblique Data with XCS , 2000, IWLCS.

[2] Inman Harvey,et al. Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics , 1995, ECAL.

[3] Seiji Yamada,et al. Real Robot Learning with Human Teaching , 2002 .

[4] Seiji Yamada,et al. Interactive classifier system for real robot learning , 2000, Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499).

[5] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[6] Stewart W. Wilson. ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.

[7] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[8] Stefano Nolfi,et al. How to Evolve Autonomous Robots: Different Approaches in Evolutionary Robotics , 1994 .

[9] Inman Harvey,et al. Explorations in Evolutionary Robotics , 1993, Adapt. Behav..

[10] Dave Cliff,et al. Adding Temporary Memory to ZCS , 1994, Adapt. Behav..

[11] Christopher Stone,et al. For Real! XCS with Continuous-Valued Inputs , 2003, Evolutionary Computation.

[12] Francesco Mondada,et al. Evolution of homing navigation in a real mobile robot , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[13] John H. Holland,et al. COGNITIVE SYSTEMS BASED ON ADAPTIVE ALGORITHMS1 , 1978 .

[14] Minoru Asada,et al. Purposive Behavior Acquisition for a Real Robot by Vision-Based Reinforcement Learning , 2005, Machine Learning.

[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[17] Ashwin Ram,et al. Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[18] Larry Bull,et al. TCS Learning Classifier System Controller on a Real Robot , 2002, PPSN.

[19] Larry Bull,et al. ZCS Redux , 2002, Evolutionary Computation.

[20] José M. Molina López,et al. Genetic learning of fuzzy reactive controllers , 1998, Robotics Auton. Syst..

[21] John H. Holland,et al. Cognitive systems based on adaptive algorithms , 1977, SGAR.

[22] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[23] Dario Floreano,et al. Active vision and feature selection in evolutionary behavioral systems , 2002 .

[24] Marco Colombetti,et al. Robot Shaping: An Experiment in Behavior Engineering , 1997 .

[25] Larry Bull,et al. Learning Classifier Systems , 2002, Annual Conference on Genetic and Evolutionary Computation.

[26] Andrea Bonarini,et al. Learning to compose fuzzy behaviors for autonomous agents , 1997, Int. J. Approx. Reason..

[27] Larry Bull,et al. Using the XCS Classifier System for Multi-objective Reinforcement Learning Problems , 2007, Artificial Life.

[28] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[29] Sebastian Thrun,et al. Issues in Using Function Approximation for Reinforcement Learning , 1999 .

[30] Larry Bull,et al. Self-adaptive mutation in classifier system controllers , 2000 .

[31] Stewart W. Wilson. Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.