论文信息 - A Comprehensive Overview and Survey of Recent Advances in Meta-Learning

A Comprehensive Overview and Survey of Recent Advances in Meta-Learning

This article reviews meta-learning also known as learning-to-learn which seeks rapid and accurate model adaptation to unseen tasks with applications in strong AI, few-shot learning, natural language processing and robotics. Unlike deep learning, meta-learning can be applied to few-shot high-dimensional datasets and considers further improving model generalization to unseen tasks. Deep learning is focused upon in-sample prediction and meta-learning concerns model adaptation for out-of-sample prediction. Meta-learning can continually perform self-improvement to achieve highly autonomous AI. Meta-learning may serve as an additional generalization block complementary for original deep learning model. Meta-learning seeks adaptation of machine learning models to unseen tasks which are vastly different from trained tasks. Meta-learning with coevolution between agent and environment provides solutions for complex tasks unsolvable by training from scratch. Meta-learning methodology covers a wide range of great minds and thoughts. We briefly summarize meta-learning methodologies into the following categories: black-box meta-learning, metric-based meta-learning, layered meta-learning and Bayesian meta-learning framework. Recent applications concentrate upon the integration of meta-learning with other machine learning framework to provide feasible integrated problem solutions. We briefly present recent meta-learning advances and discuss potential future research directions.

Huimin Peng | Huimin Peng

[1] Yoshua Bengio,et al. Gradient-Based Optimization of Hyperparameters , 2000, Neural Computation.

[2] Jeffrey Li,et al. Differentially Private Meta-Learning , 2020, ICLR.

[3] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[4] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[5] Sergey Levine,et al. Watch, Try, Learn: Meta-Learning from Demonstrations and Reward , 2019, ICLR.

[6] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.

[7] Mohamed Saber Naceur,et al. A Review of Meta-Reinforcement Learning for Deep Neural Networks Architecture Search , 2018, ArXiv.

[8] Joshua B. Tenenbaum,et al. Meta-Learning for Semi-Supervised Few-Shot Classification , 2018, ICLR.

[9] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.

[10] Yee Whye Teh,et al. Meta reinforcement learning as task inference , 2019, ArXiv.

[11] Jürgen Schmidhuber,et al. Artificial curiosity based on discovering novel algorithmic predictability through coevolution , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[12] Tsendsuren Munkhdalai,et al. Rapid Adaptation with Conditionally Shifted Neurons , 2017, ICML.

[13] Sebastian Nowozin,et al. Versa: Versatile and Efficient Few-shot Learning , 2018 .

[14] Rich Caruana,et al. Learning Many Related Tasks at the Same Time with Backpropagation , 1994, NIPS.

[15] Tamim Asfour,et al. ProMP: Proximal Meta-Policy Search , 2018, ICLR.

[16] Ramesh Raskar,et al. Designing Neural Network Architectures using Reinforcement Learning , 2016, ICLR.

[17] Deva Ramanan,et al. MetaPix: Few-Shot Video Retargeting , 2020, ICLR.

[18] Sylvain Delisle,et al. A meta-learning system based on genetic algorithms , 2004, SPIE Defense + Commercial Sensing.

[19] Luca Bertinetto,et al. Meta-learning with differentiable closed-form solvers , 2018, ICLR.

[20] Marcin Andrychowicz,et al. One-Shot Imitation Learning , 2017, NIPS.

[21] Amos J. Storkey,et al. How to train your MAML , 2018, ICLR.

[22] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[23] Sergey Levine,et al. One-Shot Visual Imitation Learning via Meta-Learning , 2017, CoRL.

[24] Aoxue Li,et al. Boosting Few-Shot Learning With Adaptive Margin Loss , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Xin Yao,et al. A review of evolutionary artificial neural networks , 1993, Int. J. Intell. Syst..

[26] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[27] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.

[28] David Berthelot,et al. Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer , 2018, ICLR.

[29] Kenji Doya,et al. Meta-learning in Reinforcement Learning , 2003, Neural Networks.

[30] Sergio Gomez Colmenarejo,et al. One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL , 2018, ArXiv.

[31] Tom Bosc,et al. Learning to Learn Neural Networks , 2016, ArXiv.

[32] Hang Li,et al. Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[33] Alexandre Lacoste,et al. TADAM: Task dependent adaptive metric for improved few-shot learning , 2018, NeurIPS.

[34] Sergey Levine,et al. Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning , 2018, ICLR.

[35] Eunho Yang,et al. Learning to Propagate Labels: Transductive Propagation Network for Few-Shot Learning , 2018, ICLR.

[36] Pietro Perona,et al. Caltech-UCSD Birds 200 , 2010 .

[37] Leslie Pack Kaelbling,et al. Meta-learning curiosity algorithms , 2020, ICLR.

[38] Marco Pavone,et al. Meta-Learning Priors for Efficient Online Bayesian Regression , 2018, WAFR.

[39] Yoshua Bengio,et al. Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[40] Quoc V. Le,et al. AutoML-Zero: Evolving Machine Learning Algorithms From Scratch , 2020, ICML.

[41] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[42] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.

[43] Richard Socher,et al. Taming MAML: Efficient unbiased meta-reinforcement learning , 2019, ICML.

[44] Sergey Levine,et al. Online Meta-Learning , 2019, ICML.

[45] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[46] Bogdan Gabrys,et al. Meta-learning for time series forecasting and forecast combination , 2010, Neurocomputing.

[47] Matthijs Douze,et al. Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[48] Debasmit Das,et al. A Two-Stage Approach to Few-Shot Learning for Image Recognition , 2019, IEEE Transactions on Image Processing.

[49] Yoshua Bengio,et al. Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[50] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[51] Hong Yu,et al. Meta Networks , 2017, ICML.

[52] Jürgen Schmidhuber,et al. On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models , 2015, ArXiv.

[53] Sergey Levine,et al. Unsupervised Learning via Meta-Learning , 2018, ICLR.

[54] Frank Hutter,et al. Speeding Up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves , 2015, IJCAI.

[55] Risto Miikkulainen,et al. Evolving Soccer Keepaway Players Through Task Decomposition , 2005, Machine Learning.

[56] Nikos Komodakis,et al. Dynamic Few-Shot Visual Learning Without Forgetting , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[57] Kaiqi Huang,et al. Learning to Learn Cropping Models for Different Aspect Ratio Requirements , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[58] Subhransu Maji,et al. Meta-Learning With Differentiable Convex Optimization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Dilin Wang,et al. Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm , 2016, NIPS.

[60] J. Urgen Schmidhuber,et al. Making the World Di erentiable: On Using Self-Supervised Fully Recurrent Neural Networks for Dynamic Reinforcement Learning , 1990 .

[61] Hugo Larochelle,et al. Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples , 2019, ICLR.

[62] Misha Denil,et al. Learned Optimizers that Scale and Generalize , 2017, ICML.

[63] Louis Kirsch,et al. Improving Generalization in Meta Reinforcement Learning using Learned Objectives , 2020, ICLR.

[64] Dong Cao,et al. Learning Meta Face Recognition in Unseen Domains , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[65] Juergen Schmidhuber,et al. On learning how to learn learning strategies , 1994 .

[66] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.

[67] Mike Wu,et al. Meta-Amortized Variational Inference and Learning , 2019, AAAI.

[68] Y. Nesterov. A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .

[69] Pedro H. O. Pinheiro,et al. Adaptive Cross-Modal Few-Shot Learning , 2019, NeurIPS.

[70] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[71] Anthony Barrett,et al. Task-Decomposition via Plan Parsing , 1994, AAAI.

[72] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[73] Ramesh Raskar,et al. Accelerating Neural Architecture Search using Performance Prediction , 2017, ICLR.

[74] Raquel Urtasun,et al. Few-Shot Learning Through an Information Retrieval Lens , 2017, NIPS.

[75] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.

[76] Quanming Yao,et al. Few-shot Learning: A Survey , 2019, ArXiv.

[77] Alexander Schliep,et al. Ranking and selecting clustering algorithms using a meta-learning approach , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[78] Honglak Lee,et al. Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies , 2020, ICLR.

[79] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[80] Jürgen Schmidhuber,et al. Goedel Machines: Self-Referential Universal Problem Solvers Making Provably Optimal Self-Improvements , 2003, ArXiv.

[81] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[82] T. Xiang,et al. Few-Shot Learning as Domain Adaptation: Algorithm and Analysis , 2020, ICML 2020.

[83] Yang Liu,et al. Stein Variational Policy Gradient , 2017, UAI.

[84] Aurko Roy,et al. Learning to Remember Rare Events , 2017, ICLR.

[85] Amos J. Storkey,et al. Towards a Neural Statistician , 2016, ICLR.

[86] Yoshua Bengio,et al. Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[87] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[88] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.

[89] Ladislau Bölöni,et al. Unsupervised Meta-Learning for Few-Shot Image Classification , 2019, NeurIPS.

[90] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[91] Paul E. Utgoff,et al. Many-Layered Learning , 2002, Neural Computation.

[92] Pavel Kordík,et al. Meta-learning approach to neural network optimization , 2010, Neural Networks.

[93] Rui Wang,et al. Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions , 2019, ArXiv.

[94] Jürgen Schmidhuber,et al. Reinforcement Learning with Self-Modifying Policies , 1998, Learning to Learn.

[95] Zeb Kurth-Nelson,et al. Causal Reasoning from Meta-reinforcement Learning , 2019, ArXiv.

[96] Sergey Levine,et al. Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL , 2018, ICLR.

[97] Shimon Whiteson,et al. TACO: Learning Task Decomposition via Temporal Alignment for Control , 2018, ICML.

[98] Thomas L. Griffiths,et al. Recasting Gradient-Based Meta-Learning as Hierarchical Bayes , 2018, ICLR.

[99] G. Evans,et al. Learning to Optimize , 2008 .

[100] Antonio González Muñoz,et al. A Set of Complexity Measures Designed for Applying Meta-Learning to Instance Selection , 2015, IEEE Transactions on Knowledge and Data Engineering.

[101] André Carlos Ponce de Leon Ferreira de Carvalho,et al. Multi-objective optimization and Meta-learning for SVM parameter selection , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[102] Kathryn R. Cullen,et al. Functional magnetic resonance imaging as experienced by stroke survivors. , 2014, Research in gerontological nursing.

[103] Joel Z. Leibo,et al. Prefrontal cortex as a meta-reinforcement learning system , 2018, bioRxiv.

[104] J. Schmidhuber,et al. A neural network that embeds its own meta-levels , 1993, IEEE International Conference on Neural Networks.

[105] Geoffrey E. Hinton,et al. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models , 2016, NIPS.

[106] Donald R. Jones,et al. A Taxonomy of Global Optimization Methods Based on Response Surfaces , 2001, J. Glob. Optim..

[107] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[108] Li Fei-Fei,et al. Progressive Neural Architecture Search , 2017, ECCV.

[109] Razvan Pascanu,et al. Meta-Learning with Latent Embedding Optimization , 2018, ICLR.

[110] Sergey Levine,et al. One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning , 2018, Robotics: Science and Systems.

[111] Bradly C. Stadie. The Importance of Sampling in Meta-Reinforcement Learning , 2018 .

[112] Tao Xiang,et al. Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[113] Sergey Levine,et al. Probabilistic Model-Agnostic Meta-Learning , 2018, NeurIPS.

[114] Aaron Klein,et al. Efficient and Robust Automated Machine Learning , 2015, NIPS.

[115] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[116] Ricardo Vilalta,et al. A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[117] Sepp Hochreiter,et al. Meta-learning with backpropagation , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[118] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[119] Yoshua Bengio,et al. Bayesian Model-Agnostic Meta-Learning , 2018, NeurIPS.

[120] J. Schulman,et al. Reptile: a Scalable Metalearning Algorithm , 2018 .

[121] Jascha Sohl-Dickstein,et al. Meta-Learning Update Rules for Unsupervised Representation Learning , 2018, ICLR.

[122] Jeff Clune,et al. AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence , 2019, ArXiv.

[123] Joshua B. Tenenbaum,et al. One shot learning of simple visual concepts , 2011, CogSci.

[124] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[125] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[126] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[127] Pieter Abbeel,et al. Some Considerations on Learning to Explore via Meta-Reinforcement Learning , 2018, ICLR 2018.

[128] Sergey Levine,et al. Unsupervised Meta-Learning for Reinforcement Learning , 2018, ArXiv.

[129] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.