论文信息 - Environmental change adaptation for mobile robot navigation

Environmental change adaptation for mobile robot navigation

Most of existing robot learning methods have considered the environment where their robots work unchanged, therefore, the robots have to learn from scratch if they encounter new environments. This paper proposes a method which adapts robots to environmental changes by efficiently transferring a learned policy in the previous environments into a new one and effectively modifying it to cope with these changes. The resultant policy (a part of state transition map) does not seem optimal in each individual environment, but may absorb the differences between multiple environments. We apply the method to a mobile robot navigation problem of which task is to reach the target avoiding obstacles based on uninterpreted sonar and visual information. Experimental results show the validity of the method and discussion is given.

Minoru Asada | Takashi Minato

[1] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.

[2] Sebastian Thrun,et al. Discovering Structure in Multiple Learning Tasks: The TC Algorithm , 1996, ICML.

[3] Takayuki Nakamura,et al. Direct coupling of multisensor information and actions for mobile robot behavior acquisition , 1996, 1996 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Integration for Intelligent Systems (Cat. No.96TH8242).

[4] Sridhar Mahadevan,et al. Robot Learning , 1993 .

[5] Sebastian Thrun,et al. A lifelong learning perspective for mobile robot control , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[6] Minoru Asada,et al. Behavior coordination for a mobile robot using modular reinforcement learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[7] Sebastian Thrun,et al. Lifelong robot learning , 1993, Robotics Auton. Syst..

[8] M. Yamamura,et al. An approach to Lifelong Reinforcement Learning through Multiple Environments , 1998 .