Robot Awareness in Cooperative Mobile Robot Learning

Most of the straight-forward learning approaches in cooperative robotics imply for each learning robot a state space growth exponential in the number of team members. To remedy the exponentially large state space, we propose to investigate a less demanding cooperation mechanism—i.e., various levels of awareness—instead of communication. We define awareness as the perception of other robots locations and actions. We recognize four different levels (or degrees) of awareness which imply different amounts of additional information and therefore have different impacts on the search space size (Θ(0), Θ(1), Θ(N), o(N),1 where N is the number of robots in the team). There are trivial arguments in favor of avoiding binding the increase of the search space size to the number of team members. We advocate that, by studying the maximum number of neighbor robots in the application context, it is possible to tune the parameters associated with a Θ(1) increase of the search space size and allow good learning performance. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application to illustrate our method. We verify that awareness allows cooperation, that cooperation shows better performance than a purely collective behavior and that learned cooperation shows better results than learned collective behavior.

[1]  Juan Miguel Santos,et al.  Exploration tuned reinforcement function , 1999, Neurocomputing.

[2]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[3]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[4]  Charles W. Anderson,et al.  Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[5]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[6]  David W. Aha,et al.  Lazy Learning , 1997, Springer Netherlands.

[7]  Maja J. Mataric,et al.  Learning social behavior , 1997, Robotics Auton. Syst..

[8]  M. Dorigo Introduction to the Special Issue on Learning Autonomous Robots , 1996 .

[9]  Norihiko Ono,et al.  A Modular Approach to Multi-Agent Reinforcement Learning , 1996, ECAI Workshop LDAIS / ICMAS Workshop LIOME.

[10]  Alex Fukunaga,et al.  Cooperative mobile robotics: antecedents and directions , 1995 .

[11]  Hong Zhang,et al.  Collective Robotics: From Social Insects to Robots , 1993, Adapt. Behav..

[12]  Jonas Karlsson,et al.  Learning Multiple Goal Behavior via Task Decomposition and Dynamic Policy Merging , 1993 .

[13]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[14]  Rodney A. Brooks,et al.  Intelligence Without Reason , 1991, IJCAI.

[15]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[16]  Andrew B. Kahng,et al.  Cooperative Mobile Robotics: Antecedents and Directions , 1997, Auton. Robots.

[17]  Lynne E. Parker The effect of action recognition and robot awareness in cooperative robotic teams , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[18]  T. Michael Knasel,et al.  Robotics and autonomous systems , 1988, Robotics Auton. Syst..

[19]  Claude F. Touzet,et al.  Reinforcement learning and neural reinforcement learning , 1994, ESANN.

[20]  A. F. Adams,et al.  The Survey , 2021, Dyslexia in Higher Education.

[21]  Claude F. Touzet,et al.  Neural reinforcement learning for behaviour synthesis , 1997, Robotics Auton. Syst..

[22]  Lynne E. Parker Cooperative motion control for multi-target observation , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[23]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[24]  Tucker R. Balch,et al.  Communication in reactive multiagent robotic systems , 1995, Auton. Robots.

[25]  Noel E. Sharkey,et al.  Learning subsumptions for an autonomous robot , 1996 .