论文信息 - Obtaining Robust Control and Navigation Policies for Multi-robot Navigation via Deep Reinforcement Learning - 字舞流文

Obtaining Robust Control and Navigation Policies for Multi-robot Navigation via Deep Reinforcement Learning

H. Surmann | Oliver Urbann | Jonas Stenzel | Christian Jestel | Marius Brehler

[1] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[2] Wojciech M. Czarnecki,et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.

[3] O. Purwin,et al. Path Planning by Negotiation for Decentralized Agents , 2007, 2007 American Control Conference.

[4] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[5] Jia Pan,et al. Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios , 2020, Int. J. Robotics Res..

[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[7] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.

[8] Jonas Stenzel,et al. Deep Reinforcement Learning for Mobile Robot Navigation , 2019, 2019 4th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS).

[9] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.

[10] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[11] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[12] Roni Stern,et al. Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks , 2019, SOCS.

[13] J. Schulman,et al. Leveraging Procedural Generation to Benchmark Reinforcement Learning , 2019, ICML.

[14] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[15] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[16] Michal Cáp,et al. Prioritized Planning Algorithms for Trajectory Coordination of Multiple Mobile Robots , 2014, IEEE Transactions on Automation Science and Engineering.

[17] Stefan Kohlbrecher,et al. A flexible and scalable SLAM system with full 3D motion estimation , 2011, 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics.

[18] Dinesh Manocha,et al. Reciprocal Velocity Obstacles for real-time multi-agent navigation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[19] Igor Mordatch,et al. Emergent Tool Use From Multi-Agent Autocurricula , 2019, ICLR.

[20] Hui Cheng,et al. Connectivity Guaranteed Multi-robot Navigation via Deep Reinforcement Learning , 2019, CoRL.