暂无分享,去创建一个
Edward Grefenstette | Tim Rocktäschel | Heinrich Küttler | Viswanath Sivakumar | Nantas Nardelli | Thibaut Lavril | Marco Selvatici | Edward Grefenstette | Tim Rocktäschel | Nantas Nardelli | Heinrich Küttler | Thibaut Lavril | V. Sivakumar | Marco Selvatici
[1] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[2] Tian Tian,et al. MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments , 2019 .
[3] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[4] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[5] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[6] Joelle Pineau,et al. MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions , 2019, ArXiv.
[7] Nicolas Usunier,et al. High-Level Strategy Selection under Partial Observability in StarCraft: Brood War , 2018, ArXiv.
[8] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[9] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[10] Guy Lever,et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning , 2018, Science.
[11] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[12] Shimon Whiteson,et al. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning , 2017, ICML.
[13] Demis Hassabis,et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.
[14] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[15] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..