Reactive navigation of a mobile robot using a hierarchical set of learning agents

Within the context of learning sequences of basic tasks to build a complex behavior, a method is proposed which uses a hierarchical set of incrementally learning agents. Each one has to respect a particular perceptive constraint. To do so, an agent must choose either to execute basic tasks or to call another agent in order to use its decision-making competency, according to its perception. The learning procedure of each agent is achieved by a reinforcement learning inspired algorithm based on a heuristic which does not need internal parameters. A validation of the method is given, using a simulated Khepera robot. A hierarchical set of 4 agents is created. Each one is dedicated to the exploitation of particular perceptive data. They use 5 basic tasks in order to achieve a goal-reaching behavior which is formulated by a high level strategy composed of logical rules using perceptive primitives.

[1]  Minoru Asada,et al.  Coordination of multiple behaviors acquired by a vision-based reinforcement learning , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[2]  Francesco Mondada,et al.  Mobile Robot Miniaturisation: A Tool for Investigation in Control Algorithms , 1993, ISER.

[3]  H. Bersini,et al.  Three connectionist implementations of dynamic programming for optimal control: a preliminary comparative analysis , 1996, Proceedings of International Workshop on Neural Networks for Identification, Control, Robotics and Signal/Image Processing.

[4]  Minoru Asada,et al.  Action-based sensor space categorization for robot learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[5]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .