Learning emergent tasks for an autonomous mobile robot

We present an implementation of a reinforcement learning algorithm through the use of a special neural network topology, the AHC (adaptive heuristic critic). The AHC is used as a fusion supervisor of primitive behaviors in order to execute more complex robot behaviors, for example go to goal, surveillance or follow a path. The fusion supervisor is part of an architecture for the execution of mobile robot tasks which are composed of several primitive behaviors which act in a simultaneous or concurrent fashion. The architecture allows for learning to take place at the execution level, it incorporates the experience gained in executing primitive behaviors as well as the overall task. The implementation of this autonomous learning approach has been tested within OPMOR, a simulation environment for mobile robots and with our mobile platform, the UPM Robuter. Both, simulated and actual results are presented. The performance of the AHC neural network is adequate. Portions of this work has been implemented within the EEC ESPRIT 2483 PANORAMA Project.<<ETX>>

[1]  Ronald C. Arkin,et al.  Motor Schema — Based Mobile Robot Navigation , 1989, Int. J. Robotics Res..

[2]  Albert Jones,et al.  Generic architecture for intelligent control systems , 1992 .

[3]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[4]  Walter Fritz World view and learning systems , 1992, Robotics Auton. Syst..

[5]  D. Gachet,et al.  A software architecture for behavioral control strategies of autonomous systems , 1992, Proceedings of the 1992 International Conference on Industrial Electronics, Control, Instrumentation, and Automation.

[6]  Herbert A. Simon,et al.  WHY SHOULD MACHINES LEARN , 1983 .

[7]  Luis Moreno,et al.  A neural network supervisor for behavioral primitives of autonomous systems , 1992, Proceedings of the 1992 International Conference on Industrial Electronics, Control, Instrumentation, and Automation.

[8]  Luis Moreno,et al.  A software development environment for autonomous mobile robots , 1994, Proceedings of IECON'94 - 20th Annual Conference of IEEE Industrial Electronics.

[9]  Fazel Famili,et al.  Integrating learning and decision-making in intelligent manufacturing systems , 1990, J. Intell. Robotic Syst..

[10]  E. A. Puente,et al.  OPMOR: optimization of motion control algorithms for mobile robots , 1992, Proceedings of the 1992 International Conference on Industrial Electronics, Control, Instrumentation, and Automation.

[11]  José del R. Millán,et al.  Learning To Avoid Obstacles Through Reinforcement: Noise-tolerance, Generalization And Dynamic Capabilities , 1992, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[13]  Grantham Pang A framework for intelligent control , 1991, J. Intell. Robotic Syst..

[14]  Ramón García-Martínez,et al.  The autonomous intelligent system , 1989, Robotics Auton. Syst..