论文信息 - On Learning Robot Behaviors.

On Learning Robot Behaviors.

Reinforcement learning has recently been receiving increased attention as a method for robot learning with little or no a priori knowledge and higher capability of reactive and adaptive behaviors. This paper presents a framework of the reinforcement learning, and several issues in applying the method to real robot tasks. Then, examples of real robot applications, especially vision-based reinforcement learning methods are intorduced to show how they cope with these issues.

Minoru Asada

[1] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[2] Rodney A. Brooks,et al. Learning to Coordinate Behaviors , 1990, AAAI.

[3] Gregory Z. Grudic,et al. Human-to-robot skill transfer using the SPORE approximation , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[4] Jonas Karlsson,et al. Learning Multiple Goal Behavior via Task Decomposition and Dynamic Policy Merging , 1993 .

[5] Minoru Asada,et al. Vision-based reinforcement learning for purposive behavior acquisition , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[6] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[7] Steven D. Whitehead,et al. A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.

[8] Sridhar Mahadevan,et al. Rapid Task Learning for Real Robots , 1993 .

[9] Minoru Asada,et al. Motion Sketch: Acquisition of Visual Motion Guided Behaviors , 1995, IJCAI.

[10] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.

[11] Patrick Reignier,et al. Learning to categorize perceptual space of a mobile robot using fuzzy-ART neural network , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[12] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[13] Pattie Maes,et al. The Dynamics of Action Selection , 1989, IJCAI.

[14] Maja J. Matarić,et al. Leaning to behave socially , 1994 .

[15] Minoru Asada,et al. Stereo sketch: stereo vision-based target reaching behavior acquisition with occlusion detection and avoidance , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[16] Rodney A. Brooks,et al. Elephants don't play chess , 1990, Robotics Auton. Syst..