论文信息 - On Tiny Episodic Memories in Continual Learning

On Tiny Episodic Memories in Continual Learning

In continual learning (CL), an agent learns from a stream of tasks leveraging prior experience to transfer knowledge to future tasks. It is an ideal framework to decrease the amount of supervision in the existing learning algorithms. But for a successful knowledge transfer, the learner needs to remember how to perform previous tasks. One way to endow the learner the ability to perform tasks seen in the past is to store a small memory, dubbed episodic memory, that stores few examples from previous tasks and then to replay these examples when training for future tasks. In this work, we empirically analyze the effectiveness of a very small episodic memory in a CL setup where each training example is only seen once. Surprisingly, across four rather different supervised learning benchmarks adapted to CL, a very simple baseline, that jointly trains on both examples from the current task as well as examples stored in the episodic memory, significantly outperforms specifically designed CL approaches with and without episodic memory. Interestingly, we find that repetitive training on even tiny memories of past tasks does not harm generalization, on the contrary, it improves it, with gains between 7\% and 17\% when the memory is populated with a single example per class.

[1] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[2] David Isele,et al. Selective Experience Replay for Lifelong Learning , 2018, AAAI.

[3] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[4] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[5] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[6] Gerald Tesauro,et al. Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference , 2018, ICLR.

[7] Byoung-Tak Zhang,et al. Overcoming Catastrophic Forgetting by Incremental Moment Matching , 2017, NIPS.

[8] Shimon Whiteson,et al. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning , 2017, ICML.

[9] Michael McCloskey,et al. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[10] Marc'Aurelio Ranzato,et al. Efficient Lifelong Learning with A-GEM , 2018, ICLR.

[11] Nathan D. Cahill,et al. Memory Efficient Experience Replay for Streaming Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[12] Sebastian Thrun,et al. Lifelong Learning Algorithms , 1998, Learning to Learn.

[13] Marcus Rohrbach,et al. Memory Aware Synapses: Learning what (not) to forget , 2017, ECCV.

[14] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[15] Sung Ju Hwang,et al. Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[16] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[17] Yee Whye Teh,et al. Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[18] Philip H. S. Torr,et al. Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence , 2018, ECCV.

[19] Surya Ganguli,et al. Continual Learning Through Synaptic Intelligence , 2017, ICML.

[20] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[21] Mark B. Ring. Child: A First Step Towards Continual Learning , 1998, Learning to Learn.

[22] Christoph H. Lampert,et al. iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Marc'Aurelio Ranzato,et al. Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[24] David Rolnick,et al. Experience Replay for Continual Learning , 2018, NeurIPS.

[25] J. Schulman,et al. Reptile: a Scalable Metalearning Algorithm , 2018 .

[26] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[27] Derek Hoiem,et al. Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Jeffrey Scott Vitter,et al. Random sampling with a reservoir , 1985, TOMS.

[29] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .