暂无分享,去创建一个
[1] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[2] James L. McClelland,et al. Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.
[3] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[4] Shi Zhong,et al. Efficient online spherical k-means clustering , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..
[5] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[6] Maxim Sviridenko,et al. An Algorithm for Online K-Means Clustering , 2014, ALENEX.
[7] Marlos C. Machado,et al. A Laplacian Framework for Option Discovery in Reinforcement Learning , 2017, ICML.
[8] Demis Hassabis,et al. Neural Episodic Control , 2017, ICML.
[9] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[10] Joel Z. Leibo,et al. Model-Free Episodic Control , 2016, ArXiv.
[11] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[12] Nathan D. Cahill,et al. Memory Efficient Experience Replay for Streaming Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[13] Angie King. Online k-Means Clustering of Nonstationary Data , 2012 .
[14] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[15] David Isele,et al. Selective Experience Replay for Lifelong Learning , 2018, AAAI.
[16] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[17] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[18] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[19] John Langford,et al. Cover trees for nearest neighbor , 2006, ICML.
[20] A. Azzalini,et al. Statistical applications of the multivariate skew normal distribution , 2009, 0911.2093.
[21] W. B. Johnson,et al. Extensions of Lipschitz mappings into Hilbert space , 1984 .
[22] W. Abraham,et al. Memory retention – the synaptic stability versus plasticity dilemma , 2005, Trends in Neurosciences.