论文信息 - Distance-Based Multi-Robot Coordination on Pocket Drones

Distance-Based Multi-Robot Coordination on Pocket Drones

We present a fully realised system illustrating decentralised coordination on Micro Aerial Vehicles (MAV) or pocket drones, based on distance information. This entails the development of an ultra light hardware solution to determine the distances between the drones and also the development of a model to learn good control policies. The model we present is a combination of a recurrent neural network and a Deep Q-Learning Network (DQN). The recurrent network provides bearing information to the DQN. The DQN itself is responsible for choosing movement actions to avoid collisions and to reach a desired position. Overall we are able provide a complete system which is capable of letting multiple drones navigate in a confined space only based on UWB-distance information and velocity input. We tackle the problem of neural networks and real world sensor noise, by combining the network with a particle filter and show that the combination outperforms the traditional particle filter in terms of converge speed and robustness. A video is available at: https://youtu.be/yj6QqhOzpok.

[1] Jun S. Liu,et al. Sequential Monte Carlo methods for dynamic systems , 1997 .

[2] Dario Floreano,et al. 3-D relative positioning sensor for indoor flying robots , 2012, Autonomous Robots.

[3] Jonathan P. How,et al. Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability , 2017, ICML.

[4] Karl Tuyls,et al. Multi-robot collision avoidance with localization uncertainty , 2012, AAMAS.

[5] Wolfram Burgard,et al. Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[6] S Lanzisera,et al. Radio Frequency Time-of-Flight Distance Measurement for Low-Cost Wireless Sensor Localization , 2011, IEEE Sensors Journal.

[7] 渡辺亮平,et al. Sequential Monte Carlo , 2005, Nonlinear Time Series Analysis.

[8] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract) , 2012, IJCAI.

[9] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[10] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[11] S. Alers,et al. 13 Bio-inspired multi-robot systems , 2015 .

[12] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[13] Larry P. Heck,et al. Domain Adaptation of Recurrent Neural Networks for Natural Language Understanding , 2016, INTERSPEECH.

[14] Dario Floreano,et al. Audio-based Relative Positioning System for Multiple Micro Air Vehicle Systems , 2013, Robotics: Science and Systems.

[15] Jia Pan,et al. Deep-Learned Collision Avoidance Policy for Distributed Multiagent Navigation , 2016, IEEE Robotics and Automation Letters.

[16] Andrew W. Senior,et al. Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition , 2014, ArXiv.

[17] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[18] Jonathan P. How,et al. Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).