Improving ECG Classification Interpretability using Saliency Maps

Cardiovascular disease is a large worldwide healthcare issue; symptoms often present suddenly with minimal warning. The electrocardiogram (ECG) is a fast, simple and reliable method of evaluating the health of the heart, by measuring electrical activity recorded through electrodes placed on the skin. ECGs often need to be analyzed by a cardiologist, taking time which could be spent on improving patient care and outcomes.Because of this, automatic ECG classification systems using machine learning have been proposed, which can learn complex interactions between ECG features and use this to detect abnormalities. However, algorithms built for this purpose often fail to generalize well to unseen data, reporting initially impressive results which drop dramatically when applied to new environments. Additionally, machine learning algorithms suffer a ‘black-box’ issue, in which it is difficult to determine how a decision has been made. This is vital for applications in healthcare, as clinicians need to be able to verify the process of evaluation in order to trust the algorithm.This paper proposes a method for visualizing model decisions across each class in the MIT-BIH arrhythmia dataset, using adapted saliency maps averaged across complete classes to determine what patterns are being learned. We do this by building two algorithms based on state-of-the-art models. This paper highlights how these maps can be used to find problems in the model which could be affecting generalizability and model performance. Comparing saliency maps across complete classes gives an overall impression of confounding variables or other biases in the model, unlike what would be highlighted when comparing saliency maps on an ECG-by-ECG basis.

[1]  Timothy D. Solberg,et al.  Expert-augmented machine learning , 2019, Proceedings of the National Academy of Sciences.

[2]  G.B. Moody,et al.  The impact of the MIT-BIH Arrhythmia Database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[3]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  N. Boyle,et al.  The Enduring Role of the Electrocardiogram as a Diagnostic Tool in Cardiology. , 2017, Journal of the American College of Cardiology.

[5]  Saroj Kumar Pandey,et al.  Automatic detection of arrhythmia from imbalanced ECG database using CNN model with SMOTE , 2019, Australasian Physical & Engineering Sciences in Medicine.

[6]  Masoumeh Haghpanahi,et al.  Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network , 2019, Nature Medicine.

[7]  Nils Strodthoff,et al.  Detecting and interpreting myocardial infarction using fully convolutional neural networks , 2018, Physiological measurement.

[8]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[9]  Peng Lu,et al.  An Effective LSTM Recurrent Network to Detect Arrhythmia on Imbalanced ECG Dataset , 2019, Journal of healthcare engineering.

[10]  Mustafa Suleyman,et al.  Key challenges for delivering clinical impact with artificial intelligence , 2019, BMC Medicine.

[11]  P. Mele,et al.  The ECG dilemma: guidelines on improving interpretation. , 2008, Journal of healthcare risk management : the journal of the American Society for Healthcare Risk Management.

[12]  Xia Hu,et al.  Techniques for interpretable machine learning , 2018, Commun. ACM.

[13]  H. Tun,et al.  Analysis of Computer Aided Identification System for ECG Characteristic Points , 2015 .

[14]  Naif Alajlan,et al.  Dense Convolutional Networks With Focal Loss and Image Generation for Electrocardiogram Classification , 2019, IEEE Access.

[15]  C. Tracy,et al.  ACC/AHA clinical competence statement on electrocardiography and ambulatory electrocardiography. A report of the ACC/AHA/ACP-ASIM Task Force on Clinical Competence (ACC/AHA Committee to Develop a Clinical Competence Statement on Electrocardiography and Ambulatory Electrocardiography). , 2001, Journal of the American College of Cardiology.

[16]  Thomas B. Schön,et al.  Automatic diagnosis of the 12-lead ECG using a deep neural network , 2020, Nature Communications.

[17]  Mohanasankar Sivaprakasam,et al.  Interpreting Deep Neural Networks for Single-Lead ECG Arrhythmia Classification , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[18]  Ankur Teredesai,et al.  Interpretable Machine Learning in Healthcare , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[19]  Cengiz Öztireli,et al.  Towards better understanding of gradient-based attribution methods for Deep Neural Networks , 2017, ICLR.