论文信息 - GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal

GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal

Pseudo-rehearsal allows neural networks to learn a sequence of tasks without forgetting how to perform in earlier tasks. Preventing forgetting is achieved by introducing a generative network which can produce data from previously seen tasks so that it can be rehearsed along side learning the new task. This has been found to be effective in both supervised and reinforcement learning. Our current work aims to further prevent forgetting by encouraging the generator to accurately generate features important for task retention. More specifically, the generator is improved by introducing a second discriminator into the Generative Adversarial Network which learns to classify between real and fake items from the intermediate activation patterns that they produce when fed through a continual learning agent. Using Atari 2600 games, we experimentally find that improving the generator can considerably reduce catastrophic forgetting compared to the standard pseudo-rehearsal methods used in deep reinforcement learning. Furthermore, we propose normalising the Q-values taught to the long-term system as we observe this substantially reduces catastrophic forgetting by minimising the interference between tasks' reward functions.

[1] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[2] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[3] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[4] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[5] Anthony V. Robins,et al. Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[6] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[7] Hod Lipson,et al. Understanding Neural Networks Through Deep Visualization , 2015, ArXiv.

[8] Razvan Pascanu,et al. Progressive Neural Networks , 2016, ArXiv.

[9] Yee Whye Teh,et al. Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[10] Laurent Itti,et al. Closed-Loop Memory GAN for Continual Learning , 2018, IJCAI.

[11] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[12] Murray Shanahan,et al. Continual Reinforcement Learning with Complex Synapses , 2018, ICML.

[13] Ronald Kemker,et al. FearNet: Brain-Inspired Model for Incremental Learning , 2017, ICLR.

[14] Yan Liu,et al. Deep Generative Dual Memory Network for Continual Learning , 2017, ArXiv.

[15] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[16] Brendan McCane,et al. Pseudo-Rehearsal: Achieving Deep Reinforcement Learning without Catastrophic Forgetting , 2018, Neurocomputing.

[17] Jiwon Kim,et al. Continual Learning with Deep Generative Replay , 2017, NIPS.

[18] LinLin Shen,et al. Deep Feature Consistent Variational Autoencoder , 2016, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[19] Aggelos K. Katsaggelos,et al. Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution , 2018, IEEE Transactions on Image Processing.

[20] David Filliat,et al. Continual State Representation Learning for Reinforcement Learning using Generative Replay , 2018, ArXiv.

[21] Brendan McCane,et al. Pseudo-Recursal: Solving the Catastrophic Forgetting Problem in Deep Neural Networks , 2018, ArXiv.

[22] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[23] Michael McCloskey,et al. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[24] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).