暂无分享,去创建一个
Martha White | Tom Schaul | Marc G. Bellemare | Doina Precup | Adam White | Shibl Mourad | Joseph Modayil | Hado van Hasselt | Pierre-Luc Bacon | Jean Harb
[1] W. Abraham,et al. Memory retention – the synaptic stability versus plasticity dilemma , 2005, Trends in Neurosciences.
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[4] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[5] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[6] Shimon Whiteson,et al. Multi-Objective Decision Making , 2017, Synthesis Lectures on Artificial Intelligence and Machine Learning.
[7] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.
[8] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[9] Thomas Degris,et al. Scaling-up Knowledge for a Cognizant Robot , 2012, AAAI Spring Symposium: Designing Intelligent Robots.
[10] Marc G. Bellemare,et al. Safe and Efficient Off-Policy Reinforcement Learning , 2016, NIPS.
[11] Rémi Munos,et al. Learning to Search with MCTSnets , 2018, ICML.
[12] Eric Eaton,et al. ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.
[13] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.
[14] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.
[15] Tom Schaul,et al. Better Generalization with Forecasts , 2013, IJCAI.
[16] Stewart W. Wilson,et al. A Possibility for Implementing Curiosity and Boredom in Model-Building Neural Controllers , 1991 .
[17] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.