论文信息 - Decision-Theoretic Meta-Learning: Versatile and Efficient Amortization of Few-Shot Learning

Decision-Theoretic Meta-Learning: Versatile and Efficient Amortization of Few-Shot Learning

This paper develops a general framework for data efficient and versatile deep learning. The new framework comprises three elements: 1) Discriminative probabilistic models from multi-task learning that leverage shared statistical information across tasks. 2) A novel Bayesian decision theoretic approach to meta-learning probabilistic inference across many tasks. 3) A fast, flexible, and simple to train amortization network that can automatically generalize and extrapolate to a wide range of settings. The VERSA algorithm, a particular instance of the framework, is evaluated on a suite of supervised few-shot learning tasks. VERSA achieves state-of-the-art performance in one-shot learning on Omniglot and miniImagenet, and produces compelling results on a one-shot ShapeNet view reconstruction challenge.

[1] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[2] Richard J. Mammone,et al. Meta-neural networks that learn by learning , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[3] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[4] A. Dawid. The geometry of proper scoring rules , 2007 .

[5] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[6] J. Berger. Statistical Decision Theory and Bayesian Analysis , 1988 .

[7] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.

[8] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9] Daan Wierstra,et al. One-Shot Generalization in Deep Generative Models , 2016, ICML.

[10] Alexander M. Rush,et al. Semi-Amortized Variational Autoencoders , 2018, ICML.

[11] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[12] Michael I. Jordan,et al. On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[13] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[14] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .