Self-Paced Weight Consolidation for Continual Learning

Continual learning algorithms which keep the parameters of new tasks close to that of previous tasks, are popular in preventing catastrophic forgetting in sequential task learning settings. However, 1) the performance for the new continual learner will be degraded without distinguishing the contributions of previously learned tasks; 2) the computational cost will be greatly increased with the number of tasks, since most existing algorithms need to regularize all previous tasks when learning new tasks. To address the above challenges, we propose a self-paced Weight Consolidation (spWC) framework to attain robust continual learning via evaluating the discriminative contributions of previous tasks. To be specific, we develop a self-paced regularization to reflect the priorities of past tasks via measuring difficulty based on key performance indicator (i.e., accuracy). When encountering a new task, all previous tasks are sorted from"difficult"to"easy"based on the priorities. Then the parameters of the new continual learner will be learned via selectively maintaining the knowledge amongst more difficult past tasks, which could well overcome catastrophic forgetting with less computational cost. We adopt an alternative convex search to iteratively update the model parameters and priority weights in the bi-convex formulation. The proposed spWC framework is plug-and-play, which is applicable to most continual learning algorithms (e.g., EWC, MAS and RCIL) in different directions (e.g., classification and segmentation). Experimental results on several public benchmark datasets demonstrate that our proposed framework can effectively improve performance when compared with other popular continual learning algorithms.

[1]  Dengxin Dai,et al.  Federated Incremental Semantic Segmentation , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  B. Schiele,et al.  Continual Detection Transformer for Incremental Object Detection , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yang Cong,et al.  Lifelong Visual-Tactile Spectral Clustering for Robotic Object Perception , 2023, IEEE transactions on circuits and systems for video technology (Print).

[4]  David Zhang,et al.  Multi-adversarial Faster-RCNN with Paradigm Teacher for Unrestricted Object Detection , 2022, International Journal of Computer Vision.

[5]  Fabian Caba Heilbron,et al.  PIVOT: Prompting for Video Continual Learning , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Qinghua Hu,et al.  Curiosity-Driven Class-Incremental Learning via Adaptive Sample Selection , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Youngmin Oh,et al.  ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation , 2022, NeurIPS.

[8]  Jingkuan Song,et al.  Progressive Meta-Learning With Curriculum , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Zhengming Ding,et al.  Continuous Multi-View Human Action Recognition , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Hanli Wang,et al.  Meta-Learning-Based Incremental Few-Shot Object Detection , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  E. Ricci,et al.  Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Xiao Wang,et al.  Federated Class-Incremental Learning , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  A. K. Qin,et al.  Self-Paced Co-Training of Graph Neural Networks for Semi-Supervised Node Classification , 2022, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Xialei Liu,et al.  Representation Compensation Networks for Continual Semantic Segmentation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  S. Calderara,et al.  Class-Incremental Continual Learning Into the eXtended DER-Verse , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Longxiang Gao,et al.  Prototype-Guided Memory Replay for Continual Learning. , 2021, IEEE transactions on neural networks and learning systems.

[17]  Yunpeng Chen,et al.  Continual Learning via Bit-Level Information Preserving , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Pietro Zanuttigh,et al.  Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Haibin Yu,et al.  What and How: Generalized Lifelong Spectral Clustering via Dual Memory , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Matthieu Cord,et al.  PLOP: Learning without Forgetting for Continual Semantic Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yi Yang,et al.  Tasks Integrated Networks: Joint Detection and Retrieval for Image Search , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Hongsheng Li,et al.  Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID , 2020, NeurIPS.

[23]  Xiaofeng Zhu,et al.  Unsupervised feature selection by self-paced learning regularization , 2020, Pattern Recognit. Lett..

[24]  B. Caputo,et al.  Modeling the Background for Incremental Learning in Semantic Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Yang Cong,et al.  L3DOC: Lifelong 3D Object Classification , 2021, IEEE Transactions on Image Processing.

[26]  Tinne Tuytelaars,et al.  A Continual Learning Survey: Defying Forgetting in Classification Tasks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Eric Eaton,et al.  Learning Shared Knowledge for Deep Lifelong Learning using Deconvolutional Networks , 2019, IJCAI.

[28]  Pietro Zanuttigh,et al.  Incremental Learning Techniques for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[29]  Yun Fu,et al.  Representative Task Self-Selection for Flexible Clustered Lifelong Learning , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Yee Whye Teh,et al.  Progress & Compress: A scalable framework for continual learning , 2018, ICML.

[31]  Clement Chung,et al.  Implementation of an integrated computerized prescriber order‐entry system for chemotherapy in a multisite safety‐net health system , 2018, American journal of health-system pharmacy : AJHP : official journal of the American Society of Health-System Pharmacists.

[32]  Xuelong Li,et al.  Incremental Learning With Saliency Map for Moving Object Detection , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Stefan Wermter,et al.  Continual Lifelong Learning with Neural Networks: A Review , 2018, Neural Networks.

[34]  Ronald Kemker,et al.  FearNet: Brain-Inspired Model for Incremental Learning , 2017, ICLR.

[35]  Marcus Rohrbach,et al.  Memory Aware Synapses: Learning what (not) to forget , 2017, ECCV.

[36]  Svetlana Lazebnik,et al.  PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Sung Ju Hwang,et al.  Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[38]  Marc'Aurelio Ranzato,et al.  Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[39]  Matthew B. Blaschko,et al.  Encoder Based Lifelong Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[41]  Andrei A. Rusu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[42]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Tinne Tuytelaars,et al.  Expert Gate: Lifelong Learning with a Network of Experts , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Bing Liu,et al.  Lifelong machine learning: a paradigm for continuous learning , 2017, Frontiers of Computer Science.

[45]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Changsheng Li,et al.  Self-Paced Multi-Task Learning , 2016, AAAI.

[48]  Qingshan Liu,et al.  A Self-Paced Regularization Framework for Multilabel Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Maoguo Gong,et al.  Multi-Objective Self-Paced Learning , 2016, AAAI.

[50]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Shiguang Shan,et al.  Self-Paced Curriculum Learning , 2015, AAAI.

[52]  Shiguang Shan,et al.  Self-Paced Learning with Diversity , 2014, NIPS.

[53]  Deyu Meng,et al.  Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search , 2014, ACM Multimedia.

[54]  Eric Eaton,et al.  ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.

[55]  Q. M. Jonathan Wu,et al.  Incremental Learning in Human Action Recognition Based on Snippets , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[56]  Jonathan G Tullis,et al.  On the effectiveness of self-paced learning. , 2011, Journal of memory and language.

[57]  Daphne Koller,et al.  Self-Paced Learning for Latent Variable Models , 2010, NIPS.

[58]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[60]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[61]  Kathrin Klamroth,et al.  Biconvex sets and optimization with biconvex functions: a survey and extensions , 2007, Math. Methods Oper. Res..

[62]  Yoshua Bengio,et al.  An empirical evaluation of deep architectures on problems with many factors of variation , 2007, ICML '07.

[63]  Jonghyun Choi,et al.  Online Boundary-Free Continual Learning by Scheduled Data Prior , 2023, ICLR.

[64]  Dafan Zhang,et al.  Style Uncertainty Based Self-Paced Meta Learning for Generalizable Person Re-Identification , 2023, IEEE Transactions on Image Processing.

[65]  Jiayu Wu,et al.  Tiny ImageNet Challenge , 2017 .

[66]  Junchi Yan,et al.  Self-Paced MultiTask Learning , 2017 .

[67]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[68]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .