Federated learning framework integrating REFINED CNN and Deep Regression Forests

Abstract Summary Predictive learning from medical data incurs additional challenge due to concerns over privacy and security of personal data. Federated learning, intentionally structured to preserve high level of privacy, is emerging to be an attractive way to generate cross-silo predictions in medical scenarios. However, the impact of severe population-level heterogeneity on federated learners is not well explored. In this article, we propose a methodology to detect presence of population heterogeneity in federated settings and propose a solution to handle such heterogeneity by developing a federated version of Deep Regression Forests. Additionally, we demonstrate that the recently conceptualized REpresentation of Features as Images with NEighborhood Dependencies CNN framework can be combined with the proposed Federated Deep Regression Forests to provide improved performance as compared to existing approaches. Availability and implementation The Python source code for reproducing the main results are available on GitHub: https://github.com/DanielNolte/FederatedDeepRegressionForests. Contact ranadip.pal@ttu.edu Supplementary information Supplementary data are available at Bioinformatics Advances online.

[1]  Bjoern H Menze,et al.  The federated tumor segmentation (FeTS) tool: an open-source solution to further solid tumor research , 2022, Physics in medicine and biology.

[2]  Qiang Yang,et al.  An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application , 2022, ArXiv.

[3]  C. Vachon,et al.  MammoDL: Mammographic Breast Density Estimation using Federated Learning , 2022, ArXiv.

[4]  Bjoern H Menze,et al.  Federated learning enables big data for rare cancer boundary detection , 2022, Nature Communications.

[5]  Michael G. Rabbat,et al.  Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning , 2022, ArXiv.

[6]  Wei-Lun Chao,et al.  On Pre-Training for Federated Learning , 2022, ArXiv.

[7]  Colin B. Compas,et al.  Federated learning for predicting clinical outcomes in patients with COVID-19 , 2021, Nature Medicine.

[8]  Micah J. Sheller,et al.  OpenFL: the open federated learning library , 2021, Physics in medicine and biology.

[9]  Christos Davatzikos,et al.  The Federated Tumor Segmentation (FeTS) Challenge , 2021, ArXiv.

[10]  Efthymios Tzinis,et al.  Separate But Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data , 2021, 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[11]  R. Pal,et al.  Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction , 2020, Bioinform..

[12]  Bert Arnrich,et al.  Federated Learning in a Medical Context: A Systematic Literature Review , 2021, ACM Trans. Internet Techn..

[13]  Colin B. Compas,et al.  Federated Learning for Breast Density Classification: A Real-World Implementation , 2020, DART/DCL@MICCAI.

[14]  Spyridon Bakas,et al.  Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data , 2020, Scientific Reports.

[15]  Micah J. Sheller,et al.  The future of digital health with federated learning , 2020, npj Digital Medicine.

[16]  Yasaman Khazaeni,et al.  Federated Learning with Matched Averaging , 2020, ICLR.

[17]  Saugato Rahman Dhruba,et al.  REFINED (REpresentation of Features as Images with NEighborhood Dependencies): A novel feature representation for Convolutional Neural Networks , 2019, ArXiv.

[18]  Peter B. Walker,et al.  Federated Learning for Healthcare Informatics , 2019, Journal of Healthcare Informatics Research.

[19]  Anit Kumar Sahu,et al.  Federated Optimization in Heterogeneous Networks , 2018, MLSys.

[20]  Vitaly Shmatikov,et al.  How To Backdoor Federated Learning , 2018, AISTATS.

[21]  Junpu Wang,et al.  FedMD: Heterogenous Federated Learning via Model Distillation , 2019, ArXiv.

[22]  Emil Gustavsson,et al.  Federated Learning of Deep Neural Decision Forests , 2019, International Conference on Machine Learning, Optimization, and Data Science.

[23]  Amir Houmansadr,et al.  Comprehensive Privacy Analysis of Deep Learning: Passive and Active White-box Inference Attacks against Centralized and Federated Learning , 2018, 2019 IEEE Symposium on Security and Privacy (SP).

[24]  Vitaly Shmatikov,et al.  Exploiting Unintended Feature Leakage in Collaborative Learning , 2018, 2019 IEEE Symposium on Security and Privacy (SP).

[25]  Jacob D. Jaffe,et al.  Next-generation characterization of the Cancer Cell Line Encyclopedia , 2019, Nature.

[26]  Hubert Eichner,et al.  APPLIED FEDERATED LEARNING: IMPROVING GOOGLE KEYBOARD QUERY SUGGESTIONS , 2018, ArXiv.

[27]  Spyridon Bakas,et al.  Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation , 2018, BrainLes@MICCAI.

[28]  Z. Tezak,et al.  Right data for right patient—a precisionFDA NCI–CPTAC Multi-omics Mislabeling Challenge , 2018, Nature Medicine.

[29]  Yue Zhao,et al.  Federated Learning with Non-IID Data , 2018, ArXiv.

[30]  Jianhua Wu,et al.  Effects of PHA-665752 and vemurafenib combination treatment on in vitro and murine xenograft growth of human colorectal cancer cells with BRAFV600E mutations. , 2018, Oncology letters.

[31]  Bo Wang,et al.  Deep Regression Forests for Age Estimation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[33]  Peter Kontschieder,et al.  Deep Neural Decision Forests , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  A. C. Dubbelman,et al.  Disposition and metabolism of 14C-dovitinib (TKI258), an inhibitor of FGFR and VEGFR, after oral administration in patients with advanced solid tumors , 2012, Cancer Chemotherapy and Pharmacology.

[35]  Rob Knight,et al.  Supervised classification of microbiota mitigates mislabeling errors , 2011, The ISME Journal.

[36]  C. Rubbi,et al.  Nutlin-3, the small-molecule inhibitor of MDM2, promotes senescence and radiosensitises laryngeal carcinoma cells harbouring wild-type p53 , 2010, British Journal of Cancer.

[37]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.

[38]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .