暂无分享,去创建一个
[1] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .
[2] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.
[3] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[4] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] Sergey Levine,et al. One-Shot Visual Imitation Learning via Meta-Learning , 2017, CoRL.
[7] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[8] Yoshua Bengio,et al. Bayesian Model-Agnostic Meta-Learning , 2018, NeurIPS.
[9] Joel Z. Leibo,et al. Prefrontal cortex as a meta-reinforcement learning system , 2018, bioRxiv.
[10] M Botvinick,et al. Episodic Control as Meta-Reinforcement Learning , 2018, bioRxiv.
[11] Thomas G. Dietterich,et al. To transfer or not to transfer , 2005, NIPS 2005.
[12] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.
[13] Sergey Levine,et al. Probabilistic Model-Agnostic Meta-Learning , 2018, NeurIPS.
[14] Sebastian Thrun,et al. Lifelong Learning Algorithms , 1998, Learning to Learn.
[15] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[16] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[17] Mohammad Ghavamzadeh,et al. Algorithms for CVaR Optimization in MDPs , 2014, NIPS.
[18] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.