暂无分享,去创建一个
Tom Schaul | Georg Ostrovski | Diana Borsa | Iurii Kemaev | T. Schaul | Georg Ostrovski | Diana Borsa | Iurii Kemaev
[1] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[2] Pieter Abbeel,et al. Decoupling Representation Learning from Reinforcement Learning , 2020, ICML.
[3] John Thangarajah,et al. Adapting to Reward Progressivity via Spectral Reinforcement Learning , 2021, ICLR.
[4] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[5] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[6] Marlos C. Machado,et al. Generalization and Regularization in DQN , 2018, ArXiv.
[7] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[8] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[9] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[10] David Silver,et al. Learning values across many orders of magnitude , 2016, NIPS.
[11] Debadeepta Dey,et al. Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing , 2017, AAAI.
[12] David Held,et al. Adaptive Auxiliary Task Weighting for Reinforcement Learning , 2019, NeurIPS.
[13] Daniel Guo,et al. Agent57: Outperforming the Atari Human Benchmark , 2020, ICML.
[14] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[15] Rémi Munos,et al. Recurrent Experience Replay in Distributed Reinforcement Learning , 2018, ICLR.
[16] Rémi Munos,et al. Observe and Look Further: Achieving Consistent Performance on Atari , 2018, ArXiv.
[17] David Silver,et al. Meta-Gradient Reinforcement Learning , 2018, NeurIPS.
[18] Zhao Chen,et al. GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks , 2017, ICML.
[19] Tom Schaul,et al. Adapting Behaviour for Learning Progress , 2019, ArXiv.
[20] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[21] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[22] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[23] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[24] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[25] Wojciech Czarnecki,et al. Multi-task Deep Reinforcement Learning with PopArt , 2018, AAAI.
[26] Pierre-Yves Oudeyer,et al. CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning , 2018, ICML.
[27] Max Jaderberg,et al. Population Based Training of Neural Networks , 2017, ArXiv.
[28] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[29] Tom Schaul,et al. Learning from Demonstrations for Real World Reinforcement Learning , 2017, ArXiv.
[30] Roberto Cipolla,et al. Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[31] Tom Schaul,et al. Natural Value Approximators: Learning when to Trust Past Estimates , 2017, NIPS.