Learning of Parameter-Adaptive Reactive Controllers for Robotic Navigation

Reactive controllers are widely used in mobile robots because they are able to achieve successful performance in real-time. However, the connguration of a reactive controller depends highly on the operating conditions of the robot and the environment; thus, a reactive controller con-gured for one class of environments may not perform adequately in another. This paper presents a formulation of parameter-adaptive reactive controllers. Parameter-adaptive reactive controllers inherit all the advantages of traditional reactive controllers, but in addition they are able to adjust themselves to the current operating conditions of the robot and the environment in order to improve task performance. Additionally, the paper describes a multistrategy learning algorithm that combines ideas from case-based reasoning and reinforcement learning to construct a mapping between the operating conditions of the mobile robot and the appropriate controller conngura-tion; this mapping is in turn used to adapt the controller connguration dynamically. The algorithm is implemented and evaluated in a robotic navigation system that controls a Denning MRV-III mobile robot.

[1]  R. Larson,et al.  A survey of dynamic programming computational procedures , 1967, IEEE Transactions on Automatic Control.

[2]  Ashwin Ram,et al.  Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[3]  Pattie Maes,et al.  Situated agents can have goals , 1990, Robotics Auton. Syst..

[4]  David W. Payton,et al.  An architecture for reflexive autonomous vehicle control , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[5]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[6]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[7]  Martial Hebert,et al.  A behavior-based system for off-road navigation , 1994, IEEE Trans. Robotics Autom..

[8]  Ronald C. Arkin,et al.  Motor Schema — Based Mobile Robot Navigation , 1989, Int. J. Robotics Res..

[9]  Russell J. Clark,et al.  Case-Based Reactive Navigation: A Cased Based Method for On-Line Selection and Adaptation of Reactive Control Parameters in Autonomous Robotics Systems , 1992 .

[10]  Sridhar Mahadevan,et al.  Scaling Reinforcement Learning to Robotics by Exploiting the Subsumption Architecture , 1991, ML.

[11]  David K. Smith,et al.  Dynamic Programming and Optimal Control. Volume 1 , 1996 .

[12]  Richard S. Sutton,et al.  Reinforcement Learning with Replacing Eligibility Traces , 2005, Machine Learning.

[13]  B. Habibi,et al.  Pengi : An Implementation of A Theory of Activity , 1998 .

[14]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[15]  Richard S. Sutton,et al.  Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[16]  Mahesan Niranjan,et al.  On-line Q-learning using connectionist systems , 1994 .