Accelerated K-Serial Stable Coalition for Dynamic Capture and Resource Defense

Coalition is an important mean of multi-robot systems to collaborate on common tasks. An effective and adaptive coalition strategy is essential for the online performance in dynamic and unknown environments. In this work, the problem of territory defense by large-scale heterogeneous robotic teams is considered. The tasks include surveillance, capture of dynamic targets, and perimeter defense over valuable resources. Since each robot can choose among many tasks, it remains a challenging problem to coordinate jointly these robots such that the overall utility is maximized. This work proposes a generic coalition strategy called K-serial stable coalition algorithm (KS-COAL). Different from centralized approaches, it is distributed and anytime, meaning that only local communication is required and a K-serial Nash-stable solution is ensured. Furthermore, to accelerate adaptation to dynamic targets and resource distribution that are only perceived online, a heterogeneous graph attention network (HGAN)-based heuristic is learned to select more appropriate parameters and promising initial solutions during local optimization. Compared with manual heuristics or end-to-end predictors, it is shown to both improve online adaptability and retain the quality guarantee. The proposed methods are validated rigorously via large-scale simulations with hundreds of robots, against several strong baselines including GreedyNE and FastMaxSum.

[1]  Vijay R. Kumar,et al.  Graph Neural Networks for Decentralized Multi-Robot Target Tracking , 2022, IEEE International Symposium on Safety, Security and Rescue Robotics.

[2]  Dimitra Panagou,et al.  Multiagent Planning and Control for Swarm Herding in 2-D Obstacle Environments Under Bounded Inputs , 2021, IEEE Transactions on Robotics.

[3]  Nicholas M. Stiffler,et al.  A Visibility Roadmap Sampling Approach for a Multi-Robot Visibility-Based Pursuit-Evasion Problem , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Akansel Cosgun,et al.  Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning , 2020, IEEE Robotics and Automation Letters.

[5]  Quoc Bao Vo,et al.  An Anytime Algorithm for Large-scale Heterogeneous Task Allocation , 2020, 2020 25th International Conference on Engineering of Complex Computer Systems (ICECCS).

[6]  Frank L. Lewis,et al.  Solutions for Multiagent Pursuit-Evasion Games on Communication Graphs: Finite-Time Capture and Asymptotic Behaviors , 2020, IEEE Transactions on Automatic Control.

[7]  Edison Pignaton de Freitas,et al.  Multi-UAV Based Crowd Monitoring System , 2020, IEEE Transactions on Aerospace and Electronic Systems.

[8]  Meir Pachter,et al.  An Introduction to Pursuit-evasion Differential Games , 2020, 2020 American Control Conference (ACC).

[9]  F. Heintz,et al.  An anytime algorithm for optimal simultaneous coalition structure generation and assignment , 2020, Autonomous Agents and Multi-Agent Systems.

[10]  Lihua Xie,et al.  Cooperative Pursuit With Multi-Pursuer and One Faster Free-Moving Evader , 2020, IEEE Transactions on Cybernetics.

[11]  Eloy Garcia,et al.  Multiple Pursuer Multiple Evader Differential Games , 2019, IEEE Transactions on Automatic Control.

[12]  Yanfang Ye,et al.  Heterogeneous Graph Attention Network , 2019, WWW.

[13]  Jan Eric Lenssen,et al.  Fast Graph Representation Learning with PyTorch Geometric , 2019, ArXiv.

[14]  Vijay Kumar,et al.  Decentralization of Multiagent Policies by Learning What to Communicate , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[15]  Eloy Garcia,et al.  Pursuit-evasion of an Evader by Multiple Pursuers , 2018, 2018 International Conference on Unmanned Aircraft Systems (ICUAS).

[16]  Philippe Ciblat,et al.  A Coalition Formation Game for Distributed Node Clustering in Mobile Ad Hoc Networks , 2017, IEEE Transactions on Wireless Communications.

[17]  Maria L. Gini Multi-Robot Allocation of Tasks with Temporal and Ordering Constraints , 2017, AAAI.

[18]  Nicholas R. Jennings,et al.  Coalition structure generation: A survey , 2015, Artif. Intell..

[19]  Nicholas R. Jennings,et al.  Deploying the max-sum algorithm for decentralised coordination and task allocation of unmanned aerial vehicles for live aerial imagery collection , 2012, 2012 IEEE International Conference on Robotics and Automation.

[20]  Geoffrey A. Hollinger,et al.  Search and pursuit-evasion in mobile robotics , 2011, Auton. Robots.

[21]  Tomoki Toda,et al.  Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Lovekesh Vig,et al.  Multi-robot coalition formation , 2006, IEEE Transactions on Robotics.

[23]  Calin Belta,et al.  Discrete abstractions for robot motion planning and control in polygonal environments , 2005, IEEE Transactions on Robotics.

[24]  Renato F. Werneck,et al.  Robust Branch-and-Cut-and-Price for the Capacitated Vehicle Routing Problem , 2004, Math. Program..

[25]  R. Jonker,et al.  Improving the Hungarian assignment algorithm , 1986 .

[26]  Bernhard Rinner,et al.  Cooperative Robots to Observe Moving Targets: Review , 2018, IEEE Transactions on Cybernetics.

[27]  Nicholas R. Jennings,et al.  A hybrid exact algorithm for complete set partitioning , 2016, Artif. Intell..