A Simple Neural Attentive Meta-Learner
暂无分享,去创建一个
Pieter Abbeel | Xi Chen | Nikhil Mishra | Mostafa Rohaninejad | P. Abbeel | Xi Chen | Nikhil Mishra | Mostafa Rohaninejad
[1] J. Gittins. Bandit processes and dynamic allocation indices , 1979 .
[2] Richard J. Mammone,et al. Meta-neural networks that learn by learning , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.
[3] Sebastian Thrun,et al. Learning to Learn: Introduction and Overview , 1998, Learning to Learn.
[4] Malcolm J. A. Strens,et al. A Bayesian Framework for Reinforcement Learning , 2000, ICML.
[5] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.
[6] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .
[7] Peter Auer,et al. Near-optimal Regret Bounds for Reinforcement Learning , 2008, J. Mach. Learn. Res..
[8] G. Evans,et al. Learning to Optimize , 2008 .
[9] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.
[10] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[11] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[12] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[13] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[14] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .
[15] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[16] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.
[18] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[19] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[20] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[21] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[22] Bartunov Sergey,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016 .
[23] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[24] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[25] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[26] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.
[27] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Benjamin Van Roy,et al. Why is Posterior Sampling Better than Optimism for Reinforcement Learning? , 2016, ICML.
[29] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.