Learning and Meta-Learning for Coordination of Autonomous Unmanned Vehicles - A Preliminary Analysis

We study models of coordination, negotiation and collaboration in multi-agent systems (MAS). More specifically, we investigate scalable models and protocols for various distributed consensus coordination problems in large-scale MAS. Examples of such problems include conflict avoidance, leader election and coalition formation. We are particularly interested in application domains where robotic or unmanned vehicle agents interact with each other in real-time, as they try to jointly complete various tasks in complex dynamic environments, and where decisions often need to be made “on the fly”. Such MAS applications, we argue, necessitate a multi-tiered approach to learning how to coordinate effectively. One such collaborative MAS application domain is ensembles of autonomous micro unmanned aerial vehicles (micro-UAVs). A large ensemble of micro-UAVs on a complex, multi-stage mission comprised of many diverse tasks with varying time and other resource requirements provides an excellent framework for studying multi-tiered learning how to better coordinate. A variety of tasks and their resource demands, complexity and unpredictability of the overall environment, types of coordination problems that the UAVs may encounter in the course of their mission, multiple time scales at which the overall system can use learning and adaptation in order to perform better in the future, and multiple logical and organizational levels at which large ensembles of micro-UAVs can be analyzed and optimized, all suggest the need for a multi-tiered approach to learning. We outline our theoretical and conceptual framework that integrates reinforcement learning and meta-learning, and discuss potential benefits that our framework could provide for enabling autonomous micro-UAVs (and other types of autonomous vehicles) to coordinate more effectively.

[1]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[2]  Sarit Kraus,et al.  Methods for Task Allocation via Agent Coalition Formation , 1998, Artif. Intell..

[3]  Ron Sun META-LEARNING PROCESSES IN MULTI-AGENT SYSTEMS , 2001 .

[4]  Craig Boutilier,et al.  Coordination in multiagent reinforcement learning: a Bayesian approach , 2003, AAMAS '03.

[5]  Lynne E. Parker,et al.  Multi-Robot Systems: From Swarms to Intelligent Automata , 2002, Springer Netherlands.

[6]  Myeong-Wuk Jang On efficient communication and service agent discovery in multi-agent systems , 2004, ICSE 2004.

[7]  Denise de Oliveira,et al.  Towards Joint Learning in Multiagent Systems Through Opportunistic , 2007 .

[8]  Michael Wooldridge,et al.  Introduction to multiagent systems , 2001 .

[9]  Craig Boutilier,et al.  Bayesian reinforcement learning for coalition formation under uncertainty , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[10]  Victor R. Lesser,et al.  Organization-based cooperative coalition formation , 2004, Proceedings. IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2004. (IAT 2004)..

[11]  Somesh Jha,et al.  Multi-Agent Coordination through Coalition Formation , 1997, ATAL.

[12]  Ricardo Vilalta,et al.  A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems , 2010, ICCS.

[13]  Gul Agha,et al.  AN ACTOR-BASED SIMULATION FOR STUDYING UAV COORDINATION , 2003 .

[14]  Lovekesh Vig,et al.  Issues in Multi-Robot Coalition Formation , 2005 .

[15]  Yoav Shoham,et al.  Multi-Agent Reinforcement Learning:a critical survey , 2003 .

[16]  Leen-Kiat Soh,et al.  Investigating reinforcement learning in multiagent coalition formation , 2004, AAAI 2004.

[17]  Xin Li,et al.  An Integrated Multilevel Learning Approach to Multiagent Coalition Formation , 2003, IJCAI.

[18]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[19]  Sarit Kraus,et al.  Task Allocation Via Coalition Formation Among Autonomous Agents , 1995, IJCAI.

[20]  Onn Shehory,et al.  Coalition structure generation with worst case guarantees , 2022 .

[21]  Barbara Messing,et al.  An Introduction to MultiAgent Systems , 2002, Künstliche Intell..

[22]  Ricardo Vilalta,et al.  Meta-Learning - Concepts and Techniques , 2010, Data Mining and Knowledge Discovery Handbook.

[23]  Gul A. Agha,et al.  Maximal Clique Based Distributed Coalition Formation for Task Allocation in Large-Scale Multi-agent Systems , 2004, MMAS.

[24]  Ricardo Vilalta,et al.  Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.

[25]  Julie A. Adams,et al.  Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , 2001, AI Mag..

[26]  Raffaello D'Andrea,et al.  Coordinating Hundreds of Cooperative, Autonomous Vehicles in Warehouses , 2007, AI Mag..

[27]  Smitha Reddy,et al.  Modeling A System Of UAVs On A , 2003 .