Learning to Cooperate via an Attention-Based Communication Neural Network in Decentralized Multi-Robot Exploration †

In a decentralized multi-robot exploration problem, the robots have to cooperate effectively to map a strange environment as soon as possible without a centralized controller. In the past few decades, a set of “human-designed” cooperation strategies have been proposed to address this problem, such as the well-known frontier-based approach. However, many real-world settings, especially the ones that are constantly changing, are too complex for humans to design efficient and decentralized strategies. This paper presents a novel approach, the Attention-based Communication neural network (CommAttn), to “learn” the cooperation strategies automatically in the decentralized multi-robot exploration problem. The communication neural network enables the robots to learn the cooperation strategies with explicit communication. Moreover, the attention mechanism we introduced additionally can precisely calculate whether the communication is necessary for each pair of agents by considering the relevance of each received message, which enables the robots to communicate only with the necessary partners. The empirical results on a simulated multi-robot disaster exploration scenario demonstrate that our proposal outperforms the traditional “human-designed” methods, as well as other competing “learning-based” methods in the exploration task.

[1]  Huaimin Wang,et al.  Deep Learning-based Cooperative Trail Following for Multi-Robot System , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[2]  Héctor H. González-Baños,et al.  Navigation Strategies for Exploring Indoor Environments , 2002, Int. J. Robotics Res..

[3]  Brian Yamauchi,et al.  Frontier-based exploration using multiple robots , 1998, AGENTS '98.

[4]  Rob Fergus,et al.  MazeBase: A Sandbox for Learning from Games , 2015, ArXiv.

[5]  Sanem Sariel,et al.  REAL TIME AUCTION BASED ALLOCATION OF TASKS FOR MULTI-ROBOT EXPLORATION PROBLEM IN DYNAMIC ENVIRONMENTS , 2005 .

[6]  Vijay Kumar,et al.  Information-theoretic mapping using Cauchy-Schwarz Quadratic Mutual Information , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Xing Zhou,et al.  Learning to Cooperate in Decentralized Multi-robot Exploration of Dynamic Environments , 2018, ICONIP.

[8]  Tiejian Luo,et al.  Learning to Communicate via Supervised Attentional Message Processing , 2018, CASA.

[9]  Jaime L. Bernstein,et al.  Current Status of Cost Utility Analyses in Total Joint Arthroplasty: A Systematic Review , 2015, Clinical orthopaedics and related research.

[10]  Jaime Valls Miró,et al.  Mutual information-based exploration on continuous occupancy maps , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Long Wang,et al.  Recent Advances in Consensus of Multi-Agent Systems: A Brief Survey , 2017, IEEE Transactions on Industrial Electronics.

[12]  Timothy Patten,et al.  Decentralised Monte Carlo Tree Search for Active Perception , 2016, WAFR.

[13]  Ahmed M. Elmogy,et al.  Multi-robot Task Allocation: A Review of the State-of-the-Art , 2015, Advances in Social Media Analysis.

[14]  Jen Jen Chung,et al.  Implicit adaptive multi-robot coordination in dynamic environments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Arturo Gil,et al.  A comparison of path planning strategies for autonomous exploration and mapping of unknown environments , 2012, Auton. Robots.

[16]  Shimon Whiteson,et al.  Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks , 2016, ArXiv.

[17]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  Michail G. Lagoudakis,et al.  The Generation of Bidding Rules for Auction-Based Robot Coordination , 2005 .

[20]  Evangelos Markakis,et al.  Auction-Based Multi-Robot Routing , 2005, Robotics: Science and Systems.

[21]  Lynne E. Parker,et al.  Guest editorial advances in multirobot systems , 2002, IEEE Trans. Robotics Autom..

[22]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[23]  Jonathan P. How,et al.  Deep Decentralized Multi-task Multi-Agent RL under Partial Observability , 2017 .

[24]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[25]  Evangelos Markakis,et al.  The Power of Sequential Single-Item Auctions for Agent Coordination , 2006, AAAI.

[26]  L. Preucil,et al.  Balancing the information gain against the movement cost for multi-robot frontier exploration , 2007 .

[27]  Henrik I. Christensen,et al.  Coordination strategies for multi-robot exploration and mapping , 2014, Int. J. Robotics Res..

[28]  William Whittaker,et al.  Computationally efficient information-theoretic exploration of pits and caves , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29]  Mingyang Geng,et al.  Sensor Fusion-Based Cooperative Trail Following for Autonomous Multi-Robot System † , 2019, Sensors.

[30]  Wolfram Burgard,et al.  A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation , 2016 .

[31]  Murray Shanahan,et al.  Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Jun Wang,et al.  Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games , 2017, ArXiv.

[33]  Yedid Hoshen,et al.  VAIN: Attentional Multi-agent Predictive Modeling , 2017, NIPS.

[34]  Shane T. Barratt Active Robotic Mapping through Deep Reinforcement Learning , 2017, ArXiv.

[35]  Risto Ritala,et al.  Planning for robotic exploration based on forward simulation , 2015, Robotics Auton. Syst..

[36]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[37]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[38]  Sven Koenig,et al.  Progress on Agent Coordination with Cooperative Auctions , 2010, AAAI.