With Whom to Communicate: Learning Efficient Communication for Multi-Robot Collision Avoidance

Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions as a means to cope with the lack of a central system coordinating the efforts of all robots. Especially in complex dynamic environments, the coordination boost allowed by communication is critical to avoid collisions between cooperating robots. However, the risk of collision between a pair of robots fluctuates through their motion and communication is not always needed. Additionally, constant communication makes much of the still valuable information shared in previous time steps redundant. This paper presents an efficient communication method that solves the problem of "when" and with "whom" to communicate in multi-robot collision avoidance scenarios. In this approach, every robot learns to reason about other robots' states and considers the risk of future collisions before asking for the trajectory plans of other robots. We evaluate and verify the proposed communication strategy in simulation with four quadrotors and compare it with three baseline strategies: non-communicating, broadcasting and a distance-based method broadcasting information with quadrotors within a predefined distance.

[1]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[2]  Saptarshi Bandyopadhyay,et al.  Fast, On-line Collision Avoidance for Dynamic Vehicles Using Buffered Voronoi Cells , 2017, IEEE Robotics and Automation Letters.

[3]  Dinesh Manocha,et al.  DCAD: Decentralized Collision Avoidance With Dynamics Constraints for Agile Quadrotor Swarms , 2020, IEEE Robotics and Automation Letters.

[4]  Zongqing Lu,et al.  Learning Attentional Communication for Multi-Agent Cooperation , 2018, NeurIPS.

[5]  Roland Siegwart,et al.  Robust collision avoidance for multiple micro aerial vehicles using nonlinear model predictive control , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  Robert Fitch,et al.  Planning-Aware Communication for Decentralised Multi-Robot Coordination , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[8]  Rudy R. Negenborn,et al.  Coordination of Multiple Vessels Via Distributed Nonlinear Model Predictive Control , 2018, 2018 European Control Conference (ECC).

[9]  Jonathan P. How,et al.  Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[11]  Joelle Pineau,et al.  TarMAC: Targeted Multi-Agent Communication , 2018, ICML.

[12]  Angela P. Schoellig,et al.  Online Trajectory Generation With Distributed Model Predictive Control for Multi-Robot Motion Planning , 2020, IEEE Robotics and Automation Letters.

[13]  Dinesh Manocha,et al.  Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[14]  Manuela M. Veloso,et al.  Reasoning about joint beliefs for execution-time communication decisions , 2005, AAMAS '05.

[15]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[16]  Pieter Abbeel,et al.  Emergence of Grounded Compositional Language in Multi-Agent Populations , 2017, AAAI.

[17]  Jonathan P. How,et al.  Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning , 2019, IEEE Access.

[18]  Robert Fitch,et al.  Communication-efficient motion coordination and data fusion in information gathering teams , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[20]  Dinesh Manocha,et al.  Reciprocal Velocity Obstacles for real-time multi-agent navigation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[21]  Roland Siegwart,et al.  Cooperative Collision Avoidance for Nonholonomic Robots , 2018, IEEE Transactions on Robotics.

[22]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[23]  Javier Alonso-Mora,et al.  Chance-Constrained Collision Avoidance for MAVs in Dynamic Environments , 2019, IEEE Robotics and Automation Letters.

[24]  Javier Alonso-Mora,et al.  B-UAVC: Buffered Uncertainty-Aware Voronoi Cells for Probabilistic Multi-Robot Collision Avoidance , 2019, 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS).

[25]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[26]  Victor R. Lesser,et al.  Analyzing myopic approaches for multi-agent communication , 2005, IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[27]  Jur P. van den Berg,et al.  Generalized reciprocal collision avoidance , 2015, Int. J. Robotics Res..