论文信息 - The Barbados 2018 List of Open Issues in Continual Learning

The Barbados 2018 List of Open Issues in Continual Learning

We want to make progress toward artificial general intelligence, namely general-purpose agents that autonomously learn how to competently act in complex environments. The purpose of this report is to sketch a research outline, share some of the most important open issues we are facing, and stimulate further discussion in the community. The content is based on some of our discussions during a week-long workshop held in Barbados in February 2018.

[1] W. Abraham,et al. Memory retention – the synaptic stability versus plasticity dilemma , 2005, Trends in Neurosciences.

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.

[4] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.

[5] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.

[6] Shimon Whiteson,et al. Multi-Objective Decision Making , 2017, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[7] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.

[8] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.

[9] Thomas Degris,et al. Scaling-up Knowledge for a Cognizant Robot , 2012, AAAI Spring Symposium: Designing Intelligent Robots.

[10] Marc G. Bellemare,et al. Safe and Efficient Off-Policy Reinforcement Learning , 2016, NIPS.

[11] Rémi Munos,et al. Learning to Search with MCTSnets , 2018, ICML.

[12] Eric Eaton,et al. ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.

[13] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.

[14] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.

[15] Tom Schaul,et al. Better Generalization with Forecasts , 2013, IJCAI.

[16] Stewart W. Wilson,et al. A Possibility for Implementing Curiosity and Boredom in Model-Building Neural Controllers , 1991 .

[17] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.