论文信息 - Efficient Automatic Meta Optimization Search for Few-Shot Learning - 字舞流文

Efficient Automatic Meta Optimization Search for Few-Shot Learning

Previous works on meta-learning either relied on elaborately hand-designed network structures or adopted specialized learning rules to a particular domain. We propose a universal framework to optimize the meta-learning process automatically by adopting neural architecture search technique (NAS). NAS automatically generates and evaluates meta-learner's architecture for few-shot learning problems, while the meta-learner uses meta-learning algorithm to optimize its parameters based on the distribution of learning tasks. Parameter sharing and experience replay are adopted to accelerate the architectures searching process, so it takes only 1-2 GPU days to find good architectures. Extensive experiments on Mini-ImageNet and Omniglot show that our algorithm excels in few-shot learning tasks. The best architecture found on Mini-ImageNet achieves competitive results when transferred to Omniglot, which shows the high transferability of architectures among different computer vision problems.

Zhongchao Shi | Feiyu Xu | Peng Wang | Xinyue Zheng | Qigang Wang | Feiyu Xu | Zhongchao Shi | Peng Wang | Qigang Wang | Xinyue Zheng

[1] Quoc V. Le,et al. Efficient Neural Architecture Search via Parameter Sharing , 2018, ICML.

[2] Pieter Abbeel,et al. Meta-Learning with Temporal Convolutions , 2017, ArXiv.

[3] Ameet Talwalkar,et al. Efficient Hyperparameter Optimization and Infinitely Many Armed Bandits , 2016, ArXiv.

[4] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[5] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.

[6] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[7] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[8] Ameet Talwalkar,et al. Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization , 2016, J. Mach. Learn. Res..

[9] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[10] Daan Wierstra,et al. One-shot Learning with Memory-Augmented Neural Networks , 2016, ArXiv.

[11] J. Schulman,et al. Reptile: a Scalable Metalearning Algorithm , 2018 .

[12] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[13] Tao Xiang,et al. Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[15] Beng Chin Ooi,et al. Object-Level Representation Learning for Few-Shot Image Classification , 2018, ArXiv.

[16] Yang Yuan,et al. Hyperparameter Optimization: A Spectral Approach , 2017, ICLR.

[17] Frank Hutter,et al. CMA-ES for Hyperparameter Optimization of Deep Neural Networks , 2016, ArXiv.

[18] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[19] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[21] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[22] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[23] Hong Yu,et al. Meta Networks , 2017, ICML.

[24] Alok Aggarwal,et al. Regularized Evolution for Image Classifier Architecture Search , 2018, AAAI.

[25] Dawn Xiaodong Song,et al. Differentiable Neural Network Architecture Search , 2018, ICLR.

[26] Quoc V. Le,et al. Large-Scale Evolution of Image Classifiers , 2017, ICML.