SplitAVG: A Heterogeneity-Aware Federated Deep Learning Method for Medical Imaging

Federated learning is an emerging research paradigm for enabling collaboratively training deep learning models without sharing patient data. However, the data from different institutions are usually heterogeneous across institutions, which may reduce the performance of models trained using federated learning. In this study, we propose a novel heterogeneity-aware federated learning method, SplitAVG, to overcome the performance drops from data heterogeneity in federated learning. Unlike previous federated methods that require complex heuristic training or hyper parameter tuning, our SplitAVG leverages the simple network split and feature map concatenation strategies to encourage the federated model training an unbiased estimator of the target data distribution. We compare SplitAVG with seven state-of-the-art federated learning methods, using centrally hosted training data as the baseline on a suite of both synthetic and real-world federated datasets. We find that the performance of models trained using all the comparison federated learning methods degraded significantly with the increasing degrees of data heterogeneity. In contrast, SplitAVG method achieves comparable results to the baseline method under all heterogeneous settings, that it achieves 96.2% of the accuracy and 110.4% of the mean absolute error obtained by the baseline in a diabetic retinopathy binary classification dataset and a bone age prediction dataset, respectively, on highly heterogeneous data partitions. We conclude that SplitAVG method can effectively overcome the performance drops from variability in data distributions across institutions. Experimental results also show that SplitAVG can be adapted to different base convolutional neural networks (CNNs) and generalized to various types of medical imaging tasks. The code is publicly available at https://github.com/zm17943/SplitAVG.

[1]  S. Avestimehr,et al.  Federated Learning of Generative Image Priors for MRI Reconstruction , 2022, IEEE Transactions on Medical Imaging.

[2]  D. Rubin,et al.  Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging , 2021, Medical Image Anal..

[3]  D. Rubin,et al.  Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Jiayu Zhou,et al.  Data-Free Knowledge Distillation for Heterogeneous Federated Learning , 2021, ICML.

[5]  Pheng-Ann Heng,et al.  FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  V. Sudha,et al.  Diabetic Retinopathy Detection , 2020, International Journal of Engineering and Advanced Technology.

[7]  Ken Chang,et al.  Accounting for data variability in multi-institutional distributed deep learning for medical imaging , 2020, J. Am. Medical Informatics Assoc..

[8]  Huiqi Li,et al.  Data-Driven Enhancement of Blurry Retinal Images via Generative Adversarial Networks , 2019, MICCAI.

[9]  Phillip B. Gibbons,et al.  The Non-IID Data Quagmire of Decentralized Machine Learning , 2019, ICML.

[10]  Tzu-Ming Harry Hsu,et al.  Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification , 2019, ArXiv.

[11]  Hao Chen,et al.  Unifying Structure Analysis and Surrogate-driven Function Regression for Glaucoma OCT Image Screening , 2019, MICCAI.

[12]  Isabel Chien,et al.  Needle in a Haystack: Natural Language Processing to Identify Serious Illness. , 2019, Journal of Palliative Medicine.

[13]  H. H. Thodberg,et al.  The RSNA Pediatric Bone Age Machine Learning Challenge. , 2019, Radiology.

[14]  Shih-Hwa Chiou,et al.  Artificial intelligence-based decision-making for age-related macular degeneration , 2019, Theranostics.

[15]  Francisco Herrera,et al.  Learning from Imbalanced Data Sets , 2018, Springer International Publishing.

[16]  Chang Liu,et al.  Automatic Brain Tumor Segmentation and Overall Survival Prediction Using Machine Learning Algorithms , 2018, BrainLes@MICCAI.

[17]  Spyridon Bakas,et al.  Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation , 2018, BrainLes@MICCAI.

[18]  Yue Zhao,et al.  Federated Learning with Non-IID Data , 2018, ArXiv.

[19]  Bruce R. Rosen,et al.  Distributed deep learning networks among institutions for medical imaging , 2018, J. Am. Medical Informatics Assoc..

[20]  Kaiming He,et al.  Group Normalization , 2018, International Journal of Computer Vision.

[21]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Christos Davatzikos,et al.  Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features , 2017, Scientific Data.

[23]  Byoung-Tak Zhang,et al.  Overcoming Catastrophic Forgetting by Incremental Moment Matching , 2017, NIPS.

[24]  P. Lambin,et al.  Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital - A real life proof of concept. , 2016, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[25]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[26]  Ian Goodfellow,et al.  Deep Learning with Differential Privacy , 2016, CCS.

[27]  J. Marc Overhage,et al.  Going Digital: A Survey on Digitalization and Large-Scale Data Analytics in Healthcare , 2016, Proceedings of the IEEE.

[28]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Brian B. Avants,et al.  The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[31]  Vitaly Shmatikov,et al.  Privacy-preserving deep learning , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[32]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[33]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[34]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[35]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[36]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[37]  Brett Koonce MobileNet v2 , 2021, Convolutional Neural Networks with Swift for Tensorflow.

[38]  Marina Blanton,et al.  Secure Multiparty Computation , 2011, Encyclopedia of Cryptography and Security.