论文信息 - Meta-SGD: Learning to Learn Quickly for Few Shot Learning

Meta-SGD: Learning to Learn Quickly for Few Shot Learning

Few-shot learning is challenging for learning algorithms that learn each task in isolation and from scratch. In contrast, meta-learning learns from many related tasks a meta-learner that can learn a new task more accurately and faster with fewer examples, where the choice of meta-learners is crucial. In this paper, we develop Meta-SGD, an SGD-like, easily trainable meta-learner that can initialize and adapt any differentiable learner in just one step, on both supervised learning and reinforcement learning. Compared to the popular meta-learner LSTM, Meta-SGD is conceptually simpler, easier to implement, and can be learned more efficiently. Compared to the latest meta-learner MAML, Meta-SGD has a much higher capacity by learning to learn not just the learner initialization, but also the learner update direction and learning rate, all in a single meta-learning process. Meta-SGD shows highly competitive performance for few-shot learning on regression, classification, and reinforcement learning.

[1] Shih-Fu Chang,et al. Learning with Partially Absorbing Random Walks , 2012, NIPS.

[2] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[3] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[4] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[5] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[6] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[7] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[8] Pietro Perona,et al. One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.

[10] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.

[11] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.

[12] Peter R. Conwell,et al. Fixed-weight networks can learn , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[13] Pieter Abbeel,et al. Meta-Learning with Temporal Convolutions , 2017, ArXiv.

[14] Pietro Perona,et al. A Bayesian approach to unsupervised one-shot learning of object categories , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[16] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[17] Yoshua Bengio,et al. Learning a synaptic learning rule , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[18] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[19] Jitendra Malik,et al. Learning to Optimize , 2016, ICLR.

[20] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[21] Daan Wierstra,et al. One-Shot Generalization in Deep Generative Models , 2016, ICML.

[22] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.

[23] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.

[24] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[25] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[26] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[27] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[28] Li Zhang,et al. Learning to Learn: Meta-Critic Networks for Sample Efficient Learning , 2017, ArXiv.

[29] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[30] A. Steven Younger,et al. Fixed-weight on-line learning , 1999, IEEE Trans. Neural Networks.

[31] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.