Robot reinforcement learning accuracy-based learning classifier systems with Fuzzy Policy Gradient descent(XCS-FPGRL)

This paper presented a novel approach XCS-FPGRL to research on robot reinforcement learning. XCS-FPGRL combines covering operator and genetic algorithm. The systems is responsible for adjusting precision and reducing search space according to some reward obtained from the environment, acts as an innovation discovery component which is responsible for discovering new better reinforcement learning rules. The experiment and simulation showed that robot reinforcement learning can achieved convergence very quickly.

[1]  Diane J. Cook,et al.  User-guided reinforcement learning of robot assistive tasks for an intelligent environment , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[2]  Petr Musílek,et al.  Enhanced learning classifier system for robot navigation , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Larry Bull,et al.  Towards distributed adaptive control for road traffic junction signals using learning classifier systems , 2004 .

[4]  Marcus Gemeinder,et al.  GA-based path planning for mobile robot systems employing an active search algorithm , 2003, Appl. Soft Comput..

[5]  Jie Shao,et al.  Research on Convergence of Robot Path Planning Based on LCS , 2009, 2009 Chinese Conference on Pattern Recognition.

[6]  Larry Bull A simple accuracy-based learning classifier system , 2003 .

[7]  Martin J. Oates,et al.  A Preliminary Investigation of Modified XCS as a Generic Data Mining Tool , 2001, IWLCS.

[8]  Yang Jing-yu,et al.  Research on Cnvergence of Multi-Robots Path Planning Based on Learning Classifier System , 2010 .

[9]  Tim Kovacs,et al.  Foundations of learning classifier systems: An introduction , 2005 .

[10]  Matthew Studley,et al.  Learning Classifier System Ensembles With Rule-Sharing , 2007, IEEE Transactions on Evolutionary Computation.

[11]  Abdullah Zawawi Talib,et al.  Learning Process Enhancement for Robot Behaviors , 2007 .

[12]  R.A. Salam,et al.  Applying steady state in genetic algorithm for robot behaviors , 2008, 2008 International Conference on Electronic Design.

[13]  Liu Shirong Research on Multi-Robot System Inspired by Biological Swarm Intelligence , 2007 .

[14]  John S. Bay Learning classifier systems for single and multiple mobile robots in unstructured environments , 1995, Other Conferences.