Learning Connectivity for Data Distribution in Robot Teams

Many algorithms for control of multi-robot teams operate under the assumption that low-latency, global state information necessary to coordinate agent actions can readily be disseminated among the team. However, in harsh environments with no existing communication infrastructure, robots must form ad-hoc networks, forcing the team to operate in a distributed fashion. To overcome this challenge, we propose a task-agnostic, decentralized, low-latency method for data distribution in ad-hoc networks using Graph Neural Networks (GNN). Our approach enables multi-agent algorithms based on global state information to function by ensuring it is available at each robot. To do this, agents glean information about the topology of the network from packet transmissions and feed it to a GNN running locally which instructs the agent when and where to transmit the latest state information. We train the distributed GNN communication policies via reinforcement learning using the average Age of Information as the reward function and show that it improves training stability compared to task-specific reward functions. Our approach performs favorably compared to industry-standard methods for data distribution such as random flooding and round robin. We also show that the trained policies generalize to larger teams of both static and mobile agents.

[1]  Razvan Pascanu,et al.  Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.

[2]  Rajeev Shorey,et al.  Mobile, Wireless and Sensor Networks: Technology, Applications and Future Directions , 2005 .

[3]  Sebastian Thrun,et al.  Multi-robot SLAM with Sparse Extended Information Filers , 2003, ISRR.

[4]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[5]  Jonathan M. Garibaldi,et al.  Multi-Robot Search and Rescue: A Potential Field Based Approach , 2007 .

[6]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[7]  Tracy Camp,et al.  Comparison of broadcasting techniques for mobile ad hoc networks , 2002, MobiHoc '02.

[8]  Nicola Basilico,et al.  Multirobot Exploration of Communication-Restricted Environments: A Survey , 2017, IEEE Intelligent Systems.

[9]  Abbas Jamalipour,et al.  Wireless communications , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[10]  Vijay Kumar,et al.  Multi-Robot Coverage and Exploration using Spatial Graph Neural Networks , 2020, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Brendan Englot,et al.  Self-Learning Exploration and Mapping for Mobile Robots via Deep Reinforcement Learning , 2019, AIAA Scitech 2019 Forum.

[12]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[13]  Yen-Cheng Liu,et al.  When2com: Multi-Agent Perception via Communication Graph Grouping , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Wolfram Burgard,et al.  A real-time algorithm for mobile robot mapping with applications to multi-robot and 3D mapping , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[15]  George J. Pappas,et al.  Stable flocking of mobile agents part I: dynamic topology , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[16]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Antonio G. Marques,et al.  Convolutional Neural Network Architectures for Signals Supported on Graphs , 2018, IEEE Transactions on Signal Processing.

[18]  Abbas Jamalipour,et al.  Modeling air-to-ground path loss for low altitude platforms in urban environments , 2014, 2014 IEEE Global Communications Conference.

[19]  Vijay Kumar,et al.  Anytime Planning for Decentralized Multirobot Active Information Gathering , 2018, IEEE Robotics and Automation Letters.

[20]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[21]  Vijay Kumar,et al.  Neurosymbolic Transformers for Multi-Agent Communication , 2021, NeurIPS.

[22]  Vijay Kumar,et al.  Learning Decentralized Controllers for Robot Swarms with Graph Neural Networks , 2019, CoRL.

[23]  Walid Saad,et al.  Network Formation in the Sky: Unmanned Aerial Vehicles for Multi-Hop Wireless Backhauling , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[24]  Antonio G. Marques,et al.  Aggregation Graph Neural Networks , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Amanpreet Singh,et al.  Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks , 2018, ICLR.

[26]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[27]  Xavier Bresson,et al.  An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem , 2019, ArXiv.

[28]  Vijay Kumar,et al.  Decentralization of Multiagent Policies by Learning What to Communicate , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[29]  G. Whelan,et al.  Cooperative search and rescue with a team of mobile robots , 1997, 1997 8th International Conference on Advanced Robotics. Proceedings. ICAR'97.

[30]  Yu-Chee Tseng,et al.  The Broadcast Storm Problem in a Mobile Ad Hoc Network , 1999, Wirel. Networks.

[31]  Le Song,et al.  Learning to Plan via Neural Exploration-Exploitation Trees , 2019, ArXiv.