论文信息 - GDST: Global Distillation Self-Training for Semi-Supervised Federated Learning

GDST: Global Distillation Self-Training for Semi-Supervised Federated Learning

Federated Learning (FL) refers to the machine learning scheme that enables decentralized model training over massive separate data sources without privacy concerns. However, existing works rarely consider difficulty of obtaining sufficient data labels due to uncontrollable user behavior, especially in cross-device FL scenarios. In this paper, we consider semi-supervised federated learning (SSFL) setups and mainly focus on the disjoint scenario where local clients only have access to unlabeled data. By integrating self-training scheme for unlabeled data, we propose self-training loss as part of local training objective within federated learning framework. To further stablize and improve the learning process, we propose global distillation loss that utilize output logits of global model for per client-sample as supervision and also soften such distillation by temperature to obtain more discriminative information. Based on self-training and global distillation loss, combined with server-side training, we propose Global Distillation Self-Training (GDST) Federated Learning algorithm, which enables to distributedly learn a global model in the disjoint scenario of SSFL. Finally, we do sufficient ablation study to explore the role of each component of our GDST method, experimentally guarantee the interpretability.

Xue Yang | Shutao Xia | Yong Jiang | Linghui Zhu | Xinyi Liu

[1] Joseph E. Gonzalez,et al. Benchmarking Semi-supervised Federated Learning , 2020, ArXiv.

[2] T. Nishio,et al. Distillation-Based Semi-Supervised Federated Learning for Communication-Efficient Collaborative Training With Non-IID Private Data , 2020, IEEE Transactions on Mobile Computing.

[3] Eunho Yang,et al. Federated Semi-Supervised Learning with Inter-Client Consistency , 2020, ArXiv.

[4] Yang Liu,et al. Towards Utilizing Unlabeled Data in Federated Learning: A Survey and Prospective , 2020, 2002.11545.

[5] Yasaman Khazaeni,et al. Federated Learning with Matched Averaging , 2020, ICLR.

[6] David Berthelot,et al. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence , 2020, NeurIPS.

[7] Nadav Israel,et al. Overcoming Forgetting in Federated Learning on Non-IID Data , 2019, ArXiv.

[8] Quoc V. Le,et al. Randaugment: Practical automated data augmentation with a reduced search space , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Lifeng Sun,et al. Towards Faster and Better Federated Learning: A Feature Fusion Approach , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[10] Anit Kumar Sahu,et al. Federated Learning: Challenges, Methods, and Future Directions , 2019, IEEE Signal Processing Magazine.

[11] Xiang Li,et al. On the Convergence of FedAvg on Non-IID Data , 2019, ICLR.

[12] Song Han,et al. Deep Leakage from Gradients , 2019, NeurIPS.

[13] David Berthelot,et al. MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[14] Tianjian Chen,et al. Federated Machine Learning: Concept and Applications , 2019 .

[15] Hubert Eichner,et al. Towards Federated Learning at Scale: System Design , 2019, SysML.

[16] Anit Kumar Sahu,et al. Federated Optimization in Heterogeneous Networks , 2018, MLSys.

[17] Prateek Mittal,et al. Analyzing Federated Learning through an Adversarial Lens , 2018, ICML.

[18] Matthieu Cord,et al. HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning , 2018, ECCV.

[19] Yue Zhao,et al. Federated Learning with Non-IID Data , 2018, ArXiv.

[20] Shin Ishii,et al. Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Blaise Agüera y Arcas,et al. Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[22] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[23] Bo Wang,et al. Dynamic Label Propagation for Semi-supervised Multi-class Multi-label Classification , 2013, 2013 IEEE International Conference on Computer Vision.

[24] Dong-Hyun Lee,et al. Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .