论文信息 - Reptile: a Scalable Metalearning Algorithm

Reptile: a Scalable Metalearning Algorithm

This paper considers metalearning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution. We present a remarkably simple metalearning algorithm called Reptile, which learns a parameter initialization that can be fine-tuned quickly on a new task. Reptile works by repeatedly sampling a task, training on it, and moving the initialization towards the trained weights on that task. Unlike MAML, which also learns an initialization, Reptile doesn't require differentiating through the optimization process, making it more suitable for optimization problems where many update steps are required. We show that Reptile performs well on some well-established benchmarks for few-shot classification. We provide some theoretical analysis aimed at understanding why Reptile works.

J. Schulman | Alex Nichol | John Schulman

[1] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[2] Nikolaus Hansen,et al. The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.

[3] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Lauren A. Schmidt. Meaning and compositionality as statistical induction of categories and constraints , 2009 .

[5] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.

[6] Joshua B. Tenenbaum,et al. One-Shot Learning with a Hierarchical Nonparametric Bayesian Model , 2011, ICML Unsupervised and Transfer Learning.

[7] W. Marsden. I and J , 2012 .

[8] Trevor Darrell,et al. Part-Based R-CNNs for Fine-Grained Category Detection , 2014, ECCV.

[9] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[10] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.