Co-Transport for Class-Incremental Learning

Traditional learning systems are trained in closed-world for a fixed number of classes, and need pre-collected datasets in advance. However, new classes often emerge in real-world applications and should be learned incrementally. For example, in electronic commerce, new types of products appear daily, and in a social media community, new topics emerge frequently. Under such circumstances, incremental models should learn several new classes at a time without forgetting. We find a strong correlation between old and new classes in incremental learning, which can be applied to relate and facilitate different learning stages mutually. As a result, we propose CO-transport for class Incremental Learning (COIL), which learns to relate across incremental tasks with the class-wise semantic relationship. In detail, co-transport has two aspects: prospective transport tries to augment the old classifier with optimal transported knowledge as fast model adaptation. Retrospective transport aims to transport new class classifiers backward as old ones to overcome forgetting. With these transports, COIL efficiently adapts to new tasks, and stably resists forgetting. Experiments on benchmark and real-world multimedia datasets validate the effectiveness of our proposed method.

[1]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[2]  Hui Wang,et al.  Complementary Incremental Hashing With Query-Adaptive Re-Ranking for Image Retrieval , 2020, IEEE Transactions on Multimedia.

[3]  De-Chuan Zhan,et al.  Learning Adaptive Classifiers Synthesis for Generalized Few-Shot Learning , 2019, International Journal of Computer Vision.

[4]  L. V. Kantorovich,et al.  Mathematical Methods of Organizing and Planning Production , 1960 .

[5]  Philip S. Yu,et al.  Open-world Learning and Application to Product Classification , 2018, WWW.

[6]  Shihui Ying,et al.  IExpressNet: Facial Expression Recognition with Incremental Classes , 2020, ACM Multimedia.

[7]  C. Villani Optimal Transport: Old and New , 2008 .

[8]  Lars Petersson,et al.  Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning , 2021, Computer Vision and Pattern Recognition.

[9]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[10]  Yandong Guo,et al.  Large Scale Incremental Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Richard Sinkhorn,et al.  Concerning nonnegative matrices and doubly stochastic matrices , 1967 .

[12]  Max Welling,et al.  Herding dynamical weights to learn , 2009, ICML '09.

[13]  Yuan Jiang,et al.  Auxiliary Information Regularized Machine for Multiple Modality Feature Learning , 2015, IJCAI.

[14]  Lukasz Golab,et al.  Issues in data stream management , 2003, SGMD.

[15]  Marcus Rohrbach,et al.  Memory Aware Synapses: Learning what (not) to forget , 2017, ECCV.

[16]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  De-Chuan Zhan,et al.  Learning Placeholders for Open-Set Recognition , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  De-Chuan Zhan,et al.  Tailoring Embedding Function to Heterogeneous Few-Shot Tasks by Global and Local Feature Adaptors , 2021, AAAI.

[20]  Zhi-Hua Zhou,et al.  Heterogeneous Few-Shot Model Rectification With Semantic Mapping , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Pietro Perona,et al.  The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[22]  Dahua Lin,et al.  Learning a Unified Classifier Incrementally via Rebalancing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Yonghua Yang,et al.  AliCoCo: Alibaba E-commerce Cognitive Concept Net , 2020, SIGMOD Conference.

[24]  IExpressNet , 2020, Proceedings of the 28th ACM International Conference on Multimedia.

[25]  Nicolas Courty,et al.  Optimal Transport for Domain Adaptation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Shutao Xia,et al.  Maintaining Discrimination and Fairness in Class Incremental Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Shuguang Han,et al.  Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling , 2020, AAAI.

[28]  Adrian Popescu,et al.  IL2M: Class Incremental Learning With Dual Memory , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Chong You,et al.  Incremental Learning via Rate Reduction , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  De-Chuan Zhan,et al.  Detecting Sequentially Novel Classes with Stable Generalization Ability , 2021, PAKDD.

[31]  Xiaopeng Hong,et al.  Few-Shot Class-Incremental Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Minlie Huang,et al.  Continual Learning for Natural Language Generation in Task-oriented Dialog Systems , 2020, FINDINGS.

[33]  Martin Jaggi,et al.  Model Fusion via Optimal Transport , 2019, NeurIPS.

[34]  Zhi-Hua Zhou,et al.  Rectify Heterogeneous Models with Semantic Mapping , 2018, ICML.

[35]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[36]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[37]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Fei-Yue Wang,et al.  Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition , 2020, ACM Multimedia.

[39]  Adaptive Aggregation Networks for Class-Incremental Learning Supplementary Materials , 2021 .

[40]  Rama Chellappa,et al.  Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation , 2020, NeurIPS.

[41]  Joost van de Weijer,et al.  Semantic Drift Compensation for Class-Incremental Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Zhi-Hua Zhou,et al.  Multi-Instance Learning With Emerging Novel Class , 2021, IEEE Transactions on Knowledge and Data Engineering.

[43]  Fei Sha,et al.  When MAML Can Adapt Fast and How to Assist When It Cannot , 2021, AISTATS.

[44]  Arthur Cayley,et al.  The Collected Mathematical Papers: On Monge's “Mémoire sur la théorie des déblais et des remblais” , 2009 .

[45]  Bhavani M. Thuraisingham,et al.  Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints , 2011, IEEE Transactions on Knowledge and Data Engineering.

[46]  Zeming Li,et al.  OTA: Optimal Transport Assignment for Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Mohamed Daoudi,et al.  A Novel Space-Time Representation on the Positive Semidefinite Cone for Facial Expression Recognition , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[48]  Shan Li,et al.  Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition , 2019, IEEE Transactions on Image Processing.

[49]  Marco Cuturi,et al.  Computational Optimal Transport: With Applications to Data Science , 2019 .

[50]  Fei Yin,et al.  Prototype Augmentation and Self-Supervision for Incremental Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Zhanxing Zhu,et al.  Reinforced Continual Learning , 2018, NeurIPS.

[52]  Tinne Tuytelaars,et al.  A Continual Learning Survey: Defying Forgetting in Classification Tasks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Gustavo K. Rohde,et al.  Optimal Mass Transport: Signal processing and machine-learning applications , 2017, IEEE Signal Processing Magazine.

[54]  Chunyan Miao,et al.  Distilling Causal Effect of Data in Class-Incremental Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Lijun Yin,et al.  Facial Expression Recognition by De-expression Residue Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[56]  Yinghui Xu,et al.  Few-Shot Incremental Learning with Continually Evolved Classifiers , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[58]  Cordelia Schmid,et al.  Memory-Efficient Incremental Learning Through Feature Adaptation , 2020, ECCV.

[59]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Zhi-Hua Zhou,et al.  Learnware: on the future of machine learning , 2016, Frontiers of Computer Science.

[61]  Zheng-Jun Zha,et al.  Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  De-Chuan Zhan,et al.  Adaptive Deep Models for Incremental Learning: Considering Capacity Scalability and Sustainability , 2019, KDD.

[63]  Sung Ju Hwang,et al.  Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[64]  Ying Fu,et al.  Incremental Learning Using Conditional Adversarial Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[65]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[66]  Marco Cuturi,et al.  Sinkhorn Distances: Lightspeed Computation of Optimal Transport , 2013, NIPS.

[67]  Byoung-Tak Zhang,et al.  Overcoming Catastrophic Forgetting by Incremental Moment Matching , 2017, NIPS.

[68]  Xiu-Shen Wei,et al.  Contextualizing Multiple Tasks via Learning to Decompose , 2021, ArXiv.

[69]  Tao Mei,et al.  MetaSearch: Incremental Product Search via Deep Meta-Learning , 2020, IEEE Transactions on Image Processing.