论文信息 - Autonomous reconstruction of state space for learning of robot behavior

Autonomous reconstruction of state space for learning of robot behavior

When an autonomous robot is to learn its behavior, whether an appropriate state space is available or not is a critical issue for the flexibility and efficiency of the learning process. What is problematic is that it is usually very difficult to prepare such an ideal state space manually beforehand. We propose a new state space "reconstruction" method. With this, behavior-based robots can autonomously "rebuild" their state spaces after they accumulate behavior experience using initial state spaces. This reconstruction approach is more advantageous than the conventional state space construction methods or incremental state partitioning methods in that it achieves both the efficiency in the learning process and the optimality of the resultant behavior performance.

Shinichi Nakasuka | Takehisa Yairi | Koichi Hori

[1] Minoru Asada,et al. Action-based sensor space categorization for robot learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[2] Minoru Asada,et al. Reasonable performance in less learning time by real robot based on incremental state space segmentation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[3] J. Ross Quinlan,et al. Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[4] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[5] A. Meystel,et al. Multiresolutional intelligent controller for baby robot , 1995, Proceedings of Tenth International Symposium on Intelligent Control.

[6] Andrew W. Moore,et al. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.

[7] Shinichi Nakasuka,et al. Simultaneous learning of situation classification based on rewards and behavior selection based on the situation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.