暂无分享,去创建一个
Tom Schaul | David Silver | Hado van Hasselt | Diana Borsa | John Quan | R'emi Munos | Andr'e Barreto | Daniel Mankowitz | T. Schaul | André Barreto | R. Munos | H. V. Hasselt | D. Mankowitz | John Quan | Diana Borsa | David Silver
[1] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.
[2] Demis Hassabis,et al. Grounded Language Learning in a Simulated 3D World , 2017, ArXiv.
[3] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[4] L. Kaelbling. "hierarchical Learning in Stochastic Domains" Hierarchical Learning in Stochastic Domains , 2009 .
[5] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[6] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[7] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.
[8] Tom Schaul,et al. Unicorn: Continual Learning with a Universal, Off-policy Agent , 2018, ArXiv.
[9] Wolfram Burgard,et al. Deep reinforcement learning with successor features for navigation across similar environments , 2016, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[10] Dan Klein,et al. Modular Multitask Reinforcement Learning with Policy Sketches , 2016, ICML.
[11] Tom Schaul,et al. Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement , 2018, ICML.
[12] Samuel Gershman,et al. Deep Successor Reinforcement Learning , 2016, ArXiv.
[13] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[14] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[15] Yuval Tassa,et al. Learning and Transfer of Modulated Locomotor Controllers , 2016, ArXiv.
[16] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.
[17] Yee Whye Teh,et al. Distral: Robust multitask reinforcement learning , 2017, NIPS.
[18] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[19] Yoshua Bengio,et al. Universal Successor Representations for Transfer Reinforcement Learning , 2018, ICLR.
[20] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[21] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[22] Honglak Lee,et al. Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning , 2017, ICML.
[23] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[24] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[25] Sergey Levine,et al. Learning modular neural network policies for multi-task and multi-robot transfer , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[26] Csaba Szepesvári,et al. Algorithms for Reinforcement Learning , 2010, Synthesis Lectures on Artificial Intelligence and Machine Learning.
[27] Juergen Schmidhuber,et al. A General Method For Incremental Self-Improvement And Multi-Agent Learning In Unrestricted Environme , 1999 .
[28] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..