论文信息 - Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients

Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients

Federated learning (FL) is a distributed framework for collaborative training with privacy guarantees. In real-world scenarios, clients may have Non-IID data (local class imbalance) with poor annotation quality (label noise). The co-existence of label noise and class imbalance in FL’s small local datasets renders conventional FL methods and noisy-label learning methods both ineffective. To address the challenges, we propose FEDCNI without using an additional clean proxy dataset. It includes a noise-resilient local solver and a robust global aggregator. For the local solver, we design a more robust prototypical noise detector to distinguish noisy samples. Further to reduce the negative impact brought by the noisy samples, we devise a curriculum pseudo labeling method and a denoise Mixup training strategy. For the global aggregator, we propose a switching re-weighted aggregation method tailored to different learning periods. Extensive experiments demonstrate our method can substantially outperform state-of-the-art solutions in mix-heterogeneous FL environments.

Fang Wang | Chen Wu | Zexi Li | Chao Wu

[1] Jiangchuan Liu,et al. Towards Joint Loss and Bitrate Adaptation in Realtime Video Streaming , 2022, 2022 IEEE International Conference on Multimedia and Expo (ICME).

[2] X. Fang,et al. Robust Federated Learning with Noisy and Heterogeneous Clients , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Y. Cheung,et al. FEDIC: Federated Learning on Non-IID and Long-Tailed Data via Calibrated Distillation , 2022, 2022 IEEE International Conference on Multimedia and Expo (ICME).

[4] Tony Q. S. Quek,et al. FedCorr: Multi-Stage Federated Learning for Label Noise Correction , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Venkatesh Saligrama,et al. Federated Learning Based on Dynamic Regularization , 2021, ICLR.

[6] T. Shinozaki,et al. FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling , 2021, NeurIPS.

[7] Guodong Long,et al. FedProto: Federated Prototype Learning across Heterogeneous Clients , 2021, AAAI.

[8] Changick Kim,et al. Robust Federated Learning With Noisy Labels , 2020, IEEE Intelligent Systems.

[9] Junnan Li,et al. DivideMix: Learning with Noisy Labels as Semi-supervised Learning , 2020, ICLR.

[10] Han Yu,et al. FOCUS: Dealing with Label Quality Disparity in Federated Learning , 2020, Federated Learning.

[11] Kin K. Leung,et al. Overcoming Noisy and Irrelevant Data in Federated Learning , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[12] Sashank J. Reddi,et al. SCAFFOLD: Stochastic Controlled Averaging for Federated Learning , 2019, ICML.

[13] Anit Kumar Sahu,et al. Federated Optimization in Heterogeneous Networks , 2018, MLSys.

[14] Yanyao Shen,et al. Learning with Bad Training Data via Iterative Trimmed Loss Minimization , 2018, ICML.

[15] Masashi Sugiyama,et al. Co-teaching: Robust training of deep neural networks with extremely noisy labels , 2018, NeurIPS.

[16] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[17] Aritra Ghosh,et al. Robust Loss Functions under Label Noise for Deep Neural Networks , 2017, AAAI.

[18] Blaise Agüera y Arcas,et al. Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[19] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Xiaogang Wang,et al. Learning from massive noisy labeled data for image classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .