Multi-controller fusion in multi-layered reinforcement learning

Proposes multi-controller fusion in multi-layered reinforcement learning based on which an autonomous robot learns from lower level behaviors to higher level ones through its life. In our previous work (2000), we proposed a method that enables the behavior learning system to acquire knowledges/policies, to assign sub-tasks to learning modules by itself, to organize its own hierarchical structure, and to simplify the whole system by using only one kind of learning mechanism in all learning modules. However, it has a few drawbacks. The system cannot handle the change of the state variables. It is easily caught by a curse of dimension, if number of the state variables is large. In this paper, we propose an approach of decomposing the large state space at the bottom level into several subspaces and merge those subspaces at the higher level. This allows the system to reuse the policies learned before, to learn the policy against the new features, and therefore to avoid the curse of dimensionality. To show the validity of the proposed method, we apply it to a simple soccer situation in the context of RoboCup, and show the experimental results.

[1]  Minoru Asada,et al.  Observation strategy for decision making based on information criterion , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[2]  Stefano Nolfi,et al.  Self-Organization of Modules and Their Hierarchy in Robot Learning Problems: A Dynamical Systems App , 1997 .

[3]  Takashi Minato,et al.  Environmental Change Adaptation for Mobile Robot Navigation , 2000 .

[4]  Minoru Asada,et al.  Continuous valued Q-learning for vision-guided behavior acquisition , 1999, Proceedings. 1999 IEEE/SICE/RSJ. International Conference on Multisensor Fusion and Integration for Intelligent Systems. MFI'99 (Cat. No.99TH8480).

[5]  Sebastian Thrun,et al.  A lifelong learning perspective for mobile robot control , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[6]  Minoru Asada,et al.  Vision-guided behavior acquisition of a mobile robot by multi-layered reinforcement learning , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[7]  M. Yamamura,et al.  An approach to Lifelong Reinforcement Learning through Multiple Environments , 1998 .

[8]  Minoru Asada,et al.  Cooperative Behavior Acquisition in a Multiple Mobile Robot Environment by Co-evolution , 1998, RoboCup.

[9]  Minoru Asada,et al.  Observation Strategy for Decision Making Based on Information Criterion , 2000, RoboCup.