Differential privacy preserved federated learning for prognostic modeling in COVID-19 patients using large multi-institutional chest CT dataset.

BACKGROUND Notwithstanding the encouraging results of previous studies reporting on the efficiency of deep learning (DL) in COVID-19 prognostication, clinical adoption of the developed methodology still needs to be improved. To overcome this limitation, we set out to predict the prognosis of a large multi-institutional cohort of patients with COVID-19 using a DL-based model. PURPOSE This study aimed to evaluate the performance of deep privacy-preserving federated learning (DPFL) in predicting COVID-19 outcomes using chest CT images. METHODS After applying inclusion and exclusion criteria, 3055 patients from 19 centers, including 1599 alive and 1456 deceased, were enrolled in this study. Data from all centers were split (randomly with stratification respective to each center and class) into a training/validation set (70%/10%) and a hold-out test set (20%). For the DL model, feature extraction was performed on 2D slices, and averaging was performed at the final layer to construct a 3D model for each scan. The DensNet model was used for feature extraction. The model was developed using centralized and FL approaches. For FL, we employed DPFL approaches. Membership inference attack was also evaluated in the FL strategy. For model evaluation, different metrics were reported in the hold-out test sets. In addition, models trained in two scenarios, centralized and FL, were compared using the DeLong test for statistical differences. RESULTS The centralized model achieved an accuracy of 0.76, while the DPFL model had an accuracy of 0.75. Both the centralized and DPFL models achieved a specificity of 0.77. The centralized model achieved a sensitivity of 0.74, while the DPFL model had a sensitivity of 0.73. A mean AUC of 0.82 and 0.81 with 95% confidence intervals of (95% CI: 0.79-0.85) and (95% CI: 0.77-0.84) were achieved by the centralized model and the DPFL model, respectively. The DeLong test did not prove statistically significant differences between the two models (p-value = 0.98). The AUC values for the inference attacks fluctuate between 0.49 and 0.51, with an average of 0.50 ± 0.003 and 95% CI for the mean AUC of 0.500 to 0.501. CONCLUSION The performance of the proposed model was comparable to centralized models while operating on large and heterogeneous multi-institutional datasets. In addition, the model was resistant to inference attacks, ensuring the privacy of shared data during the training process.

[1]  P. Geramifar,et al.  Differentiation of COVID‐19 pneumonia from other lung diseases using CT radiomic features and machine learning: A large multicentric cohort study , 2024, Int. J. Imaging Syst. Technol..

[2]  A. Rahmim,et al.  Differential privacy preserved federated transfer learning for multi-institutional 68Ga-PET image artefact detection and disentanglement , 2023, European Journal of Nuclear Medicine and Molecular Imaging.

[3]  A. Rahmim,et al.  Decentralized collaborative multi-institutional PET attenuation and scatter correction using federated deep learning , 2022, European Journal of Nuclear Medicine and Molecular Imaging.

[4]  P. Colombo,et al.  Diagnostic Performance in Differentiating COVID-19 from Other Viral Pneumonias on CT Imaging: Multi-Reader Analysis Compared with an Artificial Intelligence-Based Model , 2022, Tomography.

[5]  C. Streba,et al.  Federated Learning Approach with Pre-Trained Deep Learning Models for COVID-19 Detection from Unsegmented CT images , 2022, Life.

[6]  Wentao Liu,et al.  Integrated CNN and Federated Learning for COVID-19 Detection on Chest X-Ray Images. , 2022, IEEE/ACM transactions on computational biology and bioinformatics.

[7]  A. Rahmim,et al.  Two-step machine learning to diagnose and predict involvement of lungs in COVID-19 and pneumonia using CT radiomics , 2022, Computers in Biology and Medicine.

[8]  Yuhao Gu,et al.  CS-MIA: Membership inference attack based on prediction confidence series in federated learning , 2022, J. Inf. Secur. Appl..

[9]  Yennun Huang,et al.  FedSGDCOVID: Federated SGD COVID-19 Detection under Local Differential Privacy Using Chest X-ray Images and Symptom Information , 2022, Sensors.

[10]  A. Rahmim,et al.  High-dimensional multinomial multiclass severity scoring of COVID-19 pneumonia using CT radiomics features and machine learning algorithms , 2022, Scientific Reports.

[11]  Kwok-Yan Lam,et al.  Privacy-Preserving Aggregation in Federated Learning: A Survey , 2022, IEEE Transactions on Big Data.

[12]  Andrew Ilyas,et al.  Review of COVID-19 testing and diagnostic methods , 2022, Talanta.

[13]  A. Elmokadem,et al.  Comparison of chest CT severity scoring systems for COVID-19 , 2022, European Radiology.

[14]  Gao Huang,et al.  Artificial intelligence for stepwise diagnosis and monitoring of COVID-19 , 2022, European Radiology.

[15]  A. Saberi,et al.  COVID-19 prognostic modeling using CT radiomic features and machine learning algorithms: Analysis of a multi-institutional dataset of 14,339 patients , 2021, Computers in Biology and Medicine.

[16]  Karisma Trinanda Putra,et al.  A Systematic Review of Federated Learning in the Healthcare Area: From the Perspective of Data Properties and Applications , 2021, Applied Sciences.

[17]  H. Zaidi,et al.  COLI‐Net: Deep learning‐assisted fully automated COVID‐19 lung and infection pneumonia lesion detection and segmentation from chest computed tomography images , 2021, Int. J. Imaging Syst. Technol..

[18]  Colin B. Compas,et al.  Federated learning for predicting clinical outcomes in patients with COVID-19 , 2021, Nature Medicine.

[19]  Suraksha Gupta,et al.  The Application of the Principles of Responsible AI on Social Media Marketing for Digital Health , 2021, Information Systems Frontiers.

[20]  Mustafa Abdul Salam,et al.  COVID-19 detection using federated machine learning , 2021, PloS one.

[21]  Daniel Rueckert,et al.  End-to-end privacy preserving deep learning on multi-institutional medical imaging , 2021, Nature Machine Intelligence.

[22]  Yogesh K. Dwivedi,et al.  Responsible Artificial Intelligence (AI) for Value Formation and Market Performance in Healthcare: the Mediating Role of Patient’s Cognitive Engagement , 2021, Information Systems Frontiers.

[23]  Khan Muhammad,et al.  Federated learning for COVID-19 screening from Chest X-ray images , 2021, Applied Soft Computing.

[24]  Hongsheng Hu,et al.  Membership Inference Attacks on Machine Learning: A Survey , 2021, ACM Comput. Surv..

[25]  Farinaz Koushanfar,et al.  A Taxonomy of Attacks on Federated Learning , 2021, IEEE Security & Privacy.

[26]  Hadi Karimi Mobin,et al.  A multi-center study of COVID-19 patient prognosis using deep learning-based CT image analysis and electronic health records , 2021, European Journal of Radiology.

[27]  Ali Dehghantanha,et al.  A survey on security and privacy of federated learning , 2021, Future Gener. Comput. Syst..

[28]  Zheng Wang,et al.  Open resource of clinical data from patients with pneumonia for the prediction of COVID-19 outcomes via deep learning , 2020, Nature Biomedical Engineering.

[29]  Li Li,et al.  A review of applications in federated learning , 2020, Comput. Ind. Eng..

[30]  Andriy I. Bandos,et al.  Automated quantification of COVID-19 severity and progression using chest CT images , 2020, European Radiology.

[31]  C. Catalano,et al.  Chest CT score in COVID-19 patients: correlation with disease severity and short-term prognosis , 2020, European Radiology.

[32]  Rickmer Braren,et al.  Secure, privacy-preserving and federated machine learning in medical imaging , 2020, Nature Machine Intelligence.

[33]  Bram van Ginneken,et al.  CO-RADS – A categorical CT assessment scheme for patients with suspected COVID-19: definition and evaluation , 2020, Radiology.

[34]  H. Haghighatkhah,et al.  COVID-19 Evaluation by Low-Dose High Resolution CT Scans Protocol , 2020, Academic Radiology.

[35]  Richard D Riley,et al.  Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal , 2020, BMJ.

[36]  Yan Bai,et al.  A fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis , 2020, European Respiratory Journal.

[37]  B. Song,et al.  Chest CT manifestations of new coronavirus disease 2019 (COVID-19): a pictorial review , 2020, European Radiology.

[38]  Micah J. Sheller,et al.  The future of digital health with federated learning , 2020, npj Digital Medicine.

[39]  Peter B. Walker,et al.  Federated Learning for Healthcare Informatics , 2019, Journal of Healthcare Informatics Research.

[40]  Bingsheng He,et al.  A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection , 2019, IEEE Transactions on Knowledge and Data Engineering.

[41]  Jingwei Shang,et al.  A privacy protection method for health care big data management based on risk access control , 2019, Health Care Management Science.

[42]  H. B. McMahan,et al.  Differentially Private Learning with Adaptive Clipping , 2019, NeurIPS.

[43]  Simson L. Garfinkel,et al.  Issues Encountered Deploying Differential Privacy , 2018, WPES@CCS.

[44]  Xiao Chen,et al.  Adaptive medical image encryption algorithm based on multiple chaotic mapping , 2017, Saudi journal of biological sciences.

[45]  Ashwin Machanavajjhala,et al.  Differential Privacy in the Wild: A Tutorial on Current Practices & Open Challenges , 2016, Proc. VLDB Endow..

[46]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[47]  Md. Saddam Hossain Mukta,et al.  Challenges, Applications and Design Aspects of Federated Learning: A Survey , 2021, IEEE Access.

[48]  P. Boor,et al.  Computer Methods and Programs in Biomedicine , 2022 .