暂无分享,去创建一个
Tom Schaul | Junhyuk Oh | David Silver | Matteo Hessel | Hado van Hasselt | André Barreto | Daniel J. Mankowitz | Dan Horgan | John Quan | Augustin Zídek | Junhyuk Oh | Dan Horgan | T. Schaul | D. Silver | Matteo Hessel | H. V. Hasselt | D. Mankowitz | Augustin Zídek | André Barreto | John Quan | David Silver | Augustin Žídek
[1] C. Watkins. Learning from delayed rewards , 1989 .
[2] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.
[3] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[5] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[6] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[7] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[8] Sanjoy Dasgupta,et al. Off-Policy Temporal Difference Learning with Function Approximation , 2001, ICML.
[9] F. Volkmar. Adaptive skills. , 2003, Journal of autism and developmental disorders.
[10] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[11] Jing Peng,et al. Incremental multi-step Q-learning , 1994, Machine Learning.
[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[13] Richard S. Sutton,et al. A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation , 2008, NIPS.
[14] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[15] L. Kaelbling. "hierarchical Learning in Stochastic Domains" Hierarchical Learning in Stochastic Domains , 2009 .
[16] Shalabh Bhatnagar,et al. Toward Off-Policy Learning Control with Function Approximation , 2010, ICML.
[17] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[18] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[19] Michiel van de Panne,et al. Curriculum Learning for Motor Skills , 2012, Canadian Conference on AI.
[20] Shie Mannor,et al. Scaling Up Robust MDPs using Function Approximation , 2014, ICML.
[21] Lihong Li,et al. PAC-inspired Option Discovery in Lifelong Reinforcement Learning , 2014, ICML.
[22] Shie Mannor,et al. Optimizing the CVaR via Sampling , 2014, AAAI.
[23] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[24] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[25] Eric Eaton,et al. Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret , 2015, ICML.
[26] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[27] Shane Legg,et al. DeepMind Lab , 2016, ArXiv.
[28] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[29] Marc G. Bellemare,et al. Safe and Efficient Off-Policy Reinforcement Learning , 2016, NIPS.
[30] Shie Mannor,et al. Adaptive Skills Adaptive Partitions (ASAP) , 2016, NIPS.
[31] Demis Hassabis,et al. Grounded Language Learning in a Simulated 3D World , 2017, ArXiv.
[32] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[33] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[34] Honglak Lee,et al. Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning , 2017, ICML.
[35] Alex Graves,et al. Automated Curriculum Learning for Neural Networks , 2017, ICML.
[36] Shie Mannor,et al. A Deep Hierarchical Approach to Lifelong Learning in Minecraft , 2016, AAAI.
[37] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[38] Pieter Abbeel,et al. Reverse Curriculum Generation for Reinforcement Learning , 2017, CoRL.
[39] Yuval Tassa,et al. Emergence of Locomotion Behaviours in Rich Environments , 2017, ArXiv.
[40] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.
[41] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.
[42] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[43] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[44] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[45] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[46] Shie Mannor,et al. Learning Robust Options , 2018, AAAI.
[47] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[48] Ilya Kostrikov,et al. Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play , 2017, ICLR.
[49] Stefan Wermter,et al. Continual Lifelong Learning with Neural Networks: A Review , 2018, Neural Networks.
[50] Ian Osband,et al. The Uncertainty Bellman Equation and Exploration , 2017, ICML.
[51] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[52] Richard S. Sutton,et al. Multi-step Reinforcement Learning: A Unifying Algorithm , 2017, AAAI.