Explainability Matters: Backdoor Attacks on Medical Imaging

Deep neural networks have been shown to be vulnerable to backdoor attacks, which could be easily introduced to the training set prior to model training. Recent work has focused on investigating backdoor attacks on natural images or toy datasets. Consequently, the exact impact of backdoors is not yet fully understood in complex real-world applications, such as in medical imaging where misdiagnosis can be very costly. In this paper, we explore the impact of backdoor attacks on a multi-label disease classification task using chest radiography, with the assumption that the attacker can manipulate the training dataset to execute the attack. Extensive evaluation of a state-of-the-art architecture demonstrates that by introducing images with few-pixel perturbations into the training set, an attacker can execute the backdoor successfully without having to be involved with the training procedure. A simple 3$\times$3 pixel trigger can achieve up to 1.00 Area Under the Receiver Operating Characteristic (AUROC) curve on the set of infected images. In the set of clean images, the backdoored neural network could still achieve up to 0.85 AUROC, highlighting the stealthiness of the attack. As the use of deep learning based diagnostic systems proliferates in clinical practice, we also show how explainability is indispensable in this context, as it can identify spatially localized backdoors in inference time.

[1]  S. Nahavandi,et al.  Automated Detection and Forecasting of COVID-19 using Deep Learning Techniques: A Review , 2020, Neurocomputing.

[2]  Dmytro Poplavskiy,et al.  Deep Learning for Automatic Pneumonia Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Vitaly Shmatikov,et al.  Blind Backdoors in Deep Learning Models , 2020, USENIX Security Symposium.

[4]  A. Wong,et al.  COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images , 2020, Scientific Reports.

[5]  Luca Foschini,et al.  Deep learning models for electrocardiograms are susceptible to adversarial attack , 2020, Nature Medicine.

[6]  Marzyeh Ghassemi,et al.  CheXclusion: Fairness gaps in deep chest X-ray classifiers , 2020, PSB.

[7]  D. Popa,et al.  Information Management , 2019, Springer Texts in Education.

[8]  D. Song,et al.  The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Xiangyu Zhang,et al.  ABS: Scanning Neural Networks for Back-doors by Artificial Brain Stimulation , 2019, CCS.

[10]  N. Courtier,et al.  Opportunities for radiographer reporting in Ghana and the potential for improved patient care. , 2019, Radiography.

[11]  Yukun Yang,et al.  Defending Neural Backdoors via Generative Distribution Modeling , 2019, NeurIPS.

[12]  David H. Way,et al.  A deep learning system for differential diagnosis of skin diseases , 2019, Nature Medicine.

[13]  Reza Shokri,et al.  Bypassing Backdoor Detection Algorithms in Deep Learning , 2019, 2020 IEEE European Symposium on Security and Privacy (EuroS&P).

[14]  Georg Langs,et al.  Causability and explainability of artificial intelligence in medicine , 2019, WIREs Data Mining Knowl. Discov..

[15]  Ben Y. Zhao,et al.  Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks , 2019, 2019 IEEE Symposium on Security and Privacy (SP).

[16]  I. Kohane,et al.  Adversarial attacks on medical machine learning , 2019, Science.

[17]  Nathaniel R. Greenbaum,et al.  MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs , 2019, 1901.07042.

[18]  Benjamin Edwards,et al.  Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering , 2018, SafeAI@AAAI.

[19]  Muhammad Hussain,et al.  Deep Learning based Computer-Aided Diagnosis Systems for Diabetic Retinopathy: A Survey , 2018, Artif. Intell. Medicine.

[20]  Jerry Li,et al.  Spectral Signatures in Backdoor Attacks , 2018, NeurIPS.

[21]  A. Ng,et al.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists , 2018, PLoS medicine.

[22]  R. Duszak,et al.  Prevalence of Burnout among Canadian Radiologists and Radiology Trainees , 2018, Canadian Association of Radiologists journal = Journal l'Association canadienne des radiologistes.

[23]  Ramandeep Singh,et al.  Deep learning in chest radiography: Detection of findings and presence of change , 2018, PloS one.

[24]  Sencun Zhu,et al.  Backdoor Embedding in Convolutional Neural Network Models via Invisible Perturbation , 2018, CODASPY.

[25]  François Chollet,et al.  Keras: The Python Deep Learning library , 2018 .

[26]  Brendan Dolan-Gavitt,et al.  Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks , 2018, RAID.

[27]  Andrew L. Beam,et al.  Adversarial Attacks Against Medical Deep Learning Systems , 2018, ArXiv.

[28]  A. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[29]  Abi Rimmer,et al.  Radiologist shortage leaves patient care at risk, warns royal college , 2017, British Medical Journal.

[30]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[31]  Ramprasaath R. Selvaraju,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, International Journal of Computer Vision.

[32]  Andy Davis,et al.  This Paper Is Included in the Proceedings of the 12th Usenix Symposium on Operating Systems Design and Implementation (osdi '16). Tensorflow: a System for Large-scale Machine Learning Tensorflow: a System for Large-scale Machine Learning , 2022 .

[33]  S. Kennedy,et al.  Diagnostic Radiology in Liberia: A Country Report , 2015 .

[34]  M. Lenaz Health-care fraud and abuse. , 2009, Connecticut medicine.

[35]  Kristi Reynolds,et al.  Metabolic syndrome: underrated or underdiagnosed? , 2005, Diabetes care.

[36]  Aaron S Kesselheim,et al.  Overbilling vs. downcoding--the battle between physicians and insurers. , 2005, The New England journal of medicine.

[37]  James H Thrall,et al.  Using imaging biomarkers to accelerate drug development and clinical trials. , 2005, Drug discovery today.

[38]  I. Wilson,et al.  Physician manipulation of reimbursement rules for patients: between a rock and a hard place. , 2000, JAMA.

[39]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[40]  K. Kumamaru,et al.  Global and Japanese regional variations in radiologist potential workload for computed tomography and magnetic resonance imaging examinations , 2018, Japanese Journal of Radiology.