论文信息 - Task Attended Meta-Learning for Few-Shot Learning

Task Attended Meta-Learning for Few-Shot Learning

Meta-learning (ML) has emerged as a promising direction in learning models under constrained resource settings like few-shot learning. The popular approaches for ML either learn a generalizable initial model or a generic parametric optimizer through episodic training. The former approaches leverage the knowledge from a batch of tasks to learn an optimal prior. In this work, we study the importance of a batch for ML. Specifically, we first incorporate a batch episodic training regimen to improve the learning of the generic parametric optimizer. We also hypothesize that the common assumption in batch episodic training that each task in a batch has an equal contribution to learning an optimal meta-model need not be true. We propose to weight the tasks in a batch according to their “importance” in improving the meta-model’s learning. To this end, we introduce a training curriculum motivated by selective focus in humans, called task attended meta-training, to weight the tasks in a batch. Task attention is a standalone module that can be integrated with any batch episodic training regimen. The comparisons of the models with their non-task-attended counterparts on complex datasets like miniImageNet and tieredImageNet validate its effectiveness.

[1] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[2] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[3] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[4] Bernt Schiele,et al. Meta-Transfer Learning Through Hard Tasks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Joshua B. Tenenbaum,et al. Meta-Learning for Semi-Supervised Few-Shot Classification , 2018, ICLR.

[6] Bernt Schiele,et al. Meta-Transfer Learning for Few-Shot Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yoshua Bengio,et al. MetaGAN: An Adversarial Approach to Few-Shot Learning , 2018, NeurIPS.

[8] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[9] Sergey Levine,et al. Probabilistic Model-Agnostic Meta-Learning , 2018, NeurIPS.

[10] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.

[11] Debasmit Das,et al. A Two-Stage Approach to Few-Shot Learning for Image Recognition , 2019, IEEE Transactions on Image Processing.

[12] Xilin Chen,et al. Cross Attention Network for Few-shot Classification , 2019, NeurIPS.

[13] Sergey Levine,et al. Meta-Learning with Implicit Gradients , 2019, NeurIPS.

[14] Mubarak Shah,et al. Task Agnostic Meta-Learning for Few-Shot Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Misha Denil,et al. Learned Optimizers that Scale and Generalize , 2017, ICML.

[16] Misha Denil,et al. Learning to Learn without Gradient Descent by Gradient Descent , 2016, ICML.

[17] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[18] Hang Li,et al. Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[19] Yu Zhou,et al. Expert Training: Task Hardness Aware Meta-Learning for Few-Shot Classification , 2020, ArXiv.

[20] Subhransu Maji,et al. Meta-Learning With Differentiable Convex Optimization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Tao Xiang,et al. Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[24] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[25] Amos J. Storkey,et al. How to train your MAML , 2018, ICLR.

[26] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.

[27] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[28] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[29] Alexandre Lacoste,et al. TADAM: Task dependent adaptive metric for improved few-shot learning , 2018, NeurIPS.