Scalable Perception-Action-Communication Loops With Convolutional and Graph Neural Networks

In this paper, we present a perception-action-communication loop design using Vision-based Graph Aggregation and Inference (VGAI). This multi-agent decentralized learning-to-control framework maps raw visual observations to agent actions, aided by local communication among neighboring agents. Our framework is implemented by a cascade of a convolutional and a graph neural network (CNN/GNN), addressing agent-level visual perception and feature learning, as well as swarm-level communication, local information aggregation and agent action inference, respectively. By jointly training the CNN and GNN, image features and communication messages are learned in conjunction to better address the specific task. We use imitation learning to train the VGAI controller in an offline phase, relying on a centralized expert controller. This results in a learned VGAI controller that can be deployed in a distributed manner for online execution. Additionally, the controller exhibits good scaling properties, with training in smaller teams and application in larger teams. Through a multi-agent flocking application, we demonstrate that VGAI yields performance comparable to or better than other decentralized controllers, using only the visual input modality and without accessing precise location or motion state information.

[1]  Tamás Vicsek,et al.  Outdoor flocking and formation flight with autonomous aerial robots , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Vijay Kumar,et al.  Minimum snap trajectory generation and control for quadrotors , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4]  Herbert G. Tanner Flocking with obstacle avoidance in switching networks of interconnected vehicles , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[5]  Carlos R. del-Blanco,et al.  DroNet: Learning to Fly by Driving , 2018, IEEE Robotics and Automation Letters.

[6]  Vijay Kumar,et al.  Towards a swarm of agile micro quadrotors , 2012, Autonomous Robots.

[7]  José M. F. Moura,et al.  Discrete Signal Processing on Graphs , 2012, IEEE Transactions on Signal Processing.

[8]  Libor Preucil,et al.  Low-cost embedded system for relative localization in robotic swarms , 2013, 2013 IEEE International Conference on Robotics and Automation.

[9]  Sergey Levine,et al.  (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[10]  Robert J. Wood,et al.  Science, technology and the future of small autonomous drones , 2015, Nature.

[11]  Martial Hebert,et al.  Learning monocular reactive UAV control in cluttered natural environments , 2012, 2013 IEEE International Conference on Robotics and Automation.

[12]  Libor Preucil,et al.  A Practical Multirobot Localization System , 2014, J. Intell. Robotic Syst..

[13]  Vijay Kumar,et al.  Visual Inertial Odometry Swarm: An Autonomous Swarm of Vision-Based Quadrotors , 2018, IEEE Robotics and Automation Letters.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Fernando Gama,et al.  Distributed Linear-Quadratic Control with Graph Neural Networks , 2021, ArXiv.

[16]  Gaurav S. Sukhatme,et al.  Crazyswarm: A large nano-quadcopter swarm , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[17]  J. Andrew Bagnell,et al.  Efficient Reductions for Imitation Learning , 2010, AISTATS.

[18]  Fernando Gama,et al.  Graphs, Convolutions, and Neural Networks: From Graph Filters to Graph Neural Networks , 2020, IEEE Signal Processing Magazine.

[19]  Pieter Abbeel,et al.  An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.

[20]  Tamás Vicsek,et al.  Optimized flocking of autonomous drones in confined environments , 2018, Science Robotics.

[21]  George J. Pappas,et al.  Stable flocking of mobile agents part I: dynamic topology , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[22]  Antonio G. Marques,et al.  Convolutional Neural Network Architectures for Signals Supported on Graphs , 2018, IEEE Transactions on Signal Processing.

[23]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[24]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[25]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[26]  Nikolai Smolyanskiy,et al.  Toward low-flying autonomous MAV trail navigation using deep neural networks for environmental awareness , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[27]  Dario Floreano,et al.  Learning Vision-Based Flight in Drone Swarms by Imitation , 2019, IEEE Robotics and Automation Letters.

[28]  Vijay Kumar,et al.  Decentralization of Multiagent Policies by Learning What to Communicate , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[29]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[30]  H. Witsenhausen A Counterexample in Stochastic Optimum Control , 1968 .

[31]  Fernando Gama,et al.  Graph Neural Networks: Architectures, Stability, and Transferability , 2020, Proceedings of the IEEE.

[32]  Fernando Gama,et al.  Stability Properties of Graph Neural Networks , 2019, IEEE Transactions on Signal Processing.

[33]  Vijay Kumar,et al.  Vision-based Multi-MAV Localization with Anonymous Relative Measurements Using Coupled Probabilistic Data Association Filter , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[35]  Vijay Kumar,et al.  Learning Decentralized Controllers for Robot Swarms with Graph Neural Networks , 2019, CoRL.

[36]  Pierre Vandergheynst,et al.  Graph Signal Processing: Overview, Challenges, and Applications , 2017, Proceedings of the IEEE.

[37]  Ke Sun,et al.  Feedback Enhanced Motion Planning for Autonomous Vehicles , 2020, ArXiv.

[38]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[39]  Jürgen Schmidhuber,et al.  A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots , 2016, IEEE Robotics and Automation Letters.

[40]  Giuseppe Loianno,et al.  Experimental Evaluation and Characterization of Radioactive Source Effects on Robot Visual Localization and Mapping , 2020, IEEE Robotics and Automation Letters.

[42]  Heesung Kwon,et al.  Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Fernando Gama,et al.  Discriminability of Single-Layer Graph Neural Networks , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[44]  Brian M. Sadler,et al.  VGAI: End-to-End Learning of Vision-Based Decentralized Controllers for Robot Swarms , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[45]  Luca Maria Gambardella,et al.  A simple and efficient approach for cooperative incremental learning in robot swarms , 2013, 2013 16th International Conference on Advanced Robotics (ICAR).