论文信息 - Jo-SRC: A Contrastive Approach for Combating Noisy Labels

Jo-SRC: A Contrastive Approach for Combating Noisy Labels

Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance. Existing state-of-the-art methods primarily adopt a sample selection strategy, which selects small-loss samples for subsequent training. However, prior literature tends to perform sample selection within each mini-batch, neglecting the imbalance of noise ratios in different mini-batches. Moreover, valuable knowledge within high-loss samples is wasted. To this end, we propose a noise-robust approach named Jo-SRC (Joint Sample Selection and Model Regularization based on Consistency). Specifically, we train the network in a contrastive learning manner. Predictions from two different views of each sample are used to estimate its "likelihood" of being clean or out-of-distribution. Furthermore, we propose a joint loss to advance the model generalization performance by introducing consistency regularization. Extensive experiments have validated the superiority of our approach over existing state-of-the-art methods. The source code and models have been made available at https://github.com/NUST-Machine-Intelligence-Laboratory/Jo-SRC.

[1] Xiu-Shen Wei,et al. CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning , 2020, ACM Multimedia.

[2] Xingrui Yu,et al. Co-teaching: Robust training of deep neural networks with extremely noisy labels , 2018, NeurIPS.

[3] Mikhail Belkin,et al. A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .

[4] Mohan S. Kankanhalli,et al. Learning to Learn From Noisy Labeled Data , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[6] Lei Zhang,et al. CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7] Ling Shao,et al. Extracting Multiple Visual Senses for Web Learning , 2019, IEEE Transactions on Multimedia.

[8] Andrew McCallum,et al. Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples , 2017, NIPS.

[9] Heng Tao Shen,et al. Exploiting Web Images for Multi-Output Classification: From Category to Subcategories , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[10] Mert R. Sabuncu,et al. Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels , 2018, NeurIPS.

[11] Jian Zhang,et al. A Domain Robust Approach For Image Dataset Construction , 2016, ACM Multimedia.

[12] Qi Wu,et al. Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Li Fei-Fei,et al. MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels , 2017, ICML.

[14] Pietro Perona,et al. Learning Object Categories From Internet Image Searches , 2010, Proceedings of the IEEE.

[15] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Joan Bruna,et al. Training Convolutional Networks with Noisy Labels , 2014, ICLR 2014.

[17] Bin Yang,et al. Learning to Reweight Examples for Robust Deep Learning , 2018, ICML.

[18] Zechao Li,et al. Data-driven Meta-set Based Fine-Grained Visual Recognition , 2020, ACM Multimedia.

[19] Jian Zhang,et al. Exploiting Web Images for Dataset Construction: A Domain Robust Approach , 2016, IEEE Transactions on Multimedia.

[20] Bo An,et al. Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Grant Van Horn,et al. The iNaturalist Species Classification and Detection Dataset-Supplementary Material , 2018 .

[22] Zheng Zhang,et al. Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification , 2020, AAAI.

[23] Harri Valpola,et al. Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[24] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Guosheng Lin,et al. SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[26] Kevin Gimpel,et al. Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise , 2018, NeurIPS.

[27] Fumin Shen,et al. Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples , 2021, IEEE Transactions on Multimedia.

[28] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[29] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[30] Yang Song,et al. The iNaturalist Species Classification and Detection Dataset , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31] Richard Nock,et al. Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Yu-Kun Lai,et al. Recognition From Web Data: A Progressive Filtering Approach , 2018, IEEE Transactions on Image Processing.

[33] Kun Yi,et al. Probabilistic End-To-End Noise Correction for Learning With Noisy Labels , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Xingrui Yu,et al. How does Disagreement Help Generalization against Label Corruption? , 2019, ICML.

[35] Guanyu Gao,et al. Bridging the Web Data and Fine-Grained Visual Recognition via Alleviating Label Noise and Domain Mismatch , 2020, ACM Multimedia.

[36] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37] Xiaogang Wang,et al. Learning from massive noisy labeled data for image classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Yoshua Bengio,et al. A Closer Look at Memorization in Deep Networks , 2017, ICML.

[39] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.

[40] Michal Valko,et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[41] Ashok Veeraraghavan,et al. Webly Supervised Learning Meets Zero-shot Learning: A Hybrid Approach for Fine-Grained Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[43] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Noel E. O'Connor,et al. Unsupervised label noise modeling and loss correction , 2019, ICML.

[45] Yali Wang,et al. MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Dumitru Erhan,et al. Training Deep Neural Networks on Noisy Labels with Bootstrapping , 2014, ICLR.

[47] Shai Shalev-Shwartz,et al. Decoupling "when to update" from "how to update" , 2017, NIPS.

[48] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[49] Jian Zhang,et al. Towards Automatic Construction of Diverse, High-Quality Image Datasets , 2017, IEEE Transactions on Knowledge and Data Engineering.

[50] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[51] Kiyoharu Aizawa,et al. Joint Optimization Framework for Learning with Noisy Labels , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[52] Antonio Criminisi,et al. Harvesting Image Databases from the Web , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[53] Ling Shao,et al. Region Graph Embedding Network for Zero-Shot Learning , 2020, ECCV.

[54] Jacob Goldberger,et al. Training deep neural-networks using a noise adaptation layer , 2016, ICLR.

[55] Junnan Li,et al. DivideMix: Learning with Noisy Labels as Semi-supervised Learning , 2020, ICLR.