Interpretable heartbeat classification using local model-agnostic explanations on ECGs

Treatment and prevention of cardiovascular diseases often rely on Electrocardiogram (ECG) interpretation. Dependent on the physician's variability, ECG interpretation is subjective and prone to errors. Machine learning models are often developed and used to support doctors; however, their lack of interpretability stands as one of the main drawbacks of their widespread operation. This paper focuses on an Explainable Artificial Intelligence (XAI) solution to make heartbeat classification more explainable using several state-of-the-art model-agnostic methods. We introduce a high-level conceptual framework for explainable time series and propose an original method that adds temporal dependency between time samples using the time series' derivative. The results were validated in the MIT-BIH arrhythmia dataset: we performed a performance's analysis to evaluate whether the explanations fit the model's behaviour; and employed the 1-D Jaccard's index to compare the subsequences extracted from an interpretable model and the XAI methods used. Our results show that the use of the raw signal and its derivative includes temporal dependency between samples to promote classification explanation. A small but informative user study concludes this study to evaluate the potential of the visual explanations produced by our original method for being adopted in real-world clinical settings, either as diagnostic aids or training resource.

[1]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[2]  Yading Yuan,et al.  Automatic Skin Lesion Segmentation Using Deep Fully Convolutional Networks With Jaccard Distance , 2017, IEEE Transactions on Medical Imaging.

[3]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[4]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[5]  David Paydarfar,et al.  Explaining Deep Classification of Time-Series Data with Learned Prototypes , 2019, KHD@IJCAI.

[6]  Bertram Taetz,et al.  Interpretability of Input Representations for Gait Classification in Patients after Total Hip Arthroplasty , 2020, Sensors.

[7]  Jingnan Liu,et al.  Investigating the impacting factors for the healthcare professionals to adopt artificial intelligence-based medical diagnosis support system (AIMDSS) , 2018, Annals of Operations Research.

[8]  Andreas Spanias,et al.  Attend and Diagnose: Clinical Time Series Analysis using Attention Models , 2017, AAAI.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Zachary C. Lipton,et al.  The mythos of model interpretability , 2018, Commun. ACM.

[11]  Jason Lines,et al.  A shapelet transform for time series classification , 2012, KDD.

[12]  Philip de Chazal,et al.  Automatic classification of heartbeats using ECG morphology and heartbeat interval features , 2004, IEEE Transactions on Biomedical Engineering.

[13]  S. Lapuschkin,et al.  Revealing the unique features of each individual's muscle activation signatures , 2021, Journal of the Royal Society Interface.

[14]  Hugo Gamboa,et al.  SSTS: A syntactic tool for pattern search on time series , 2019, Inf. Process. Manag..

[15]  U. Rajendra Acharya,et al.  Automated heartbeat classification and detection of arrhythmia using optimal orthogonal wavelet filters , 2019, Informatics in Medicine Unlocked.

[16]  Sherif Sakr,et al.  Interpretability in HealthCare A Comparative Study of Local Machine Learning Interpretability Techniques , 2019, 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS).

[17]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[18]  Manuel G. Penedo,et al.  Heartbeat classification fusing temporal and morphological information of ECGs via ensemble of classifiers , 2019, Biomed. Signal Process. Control..

[19]  Daniel A. Keim,et al.  Towards A Rigorous Evaluation Of XAI Methods On Time Series , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  Xiting Wang,et al.  Towards better analysis of machine learning models: A visual analytics perspective , 2017, Vis. Informatics.

[22]  Francisco Herrera,et al.  Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI , 2020, Inf. Fusion.

[23]  Lars Schmidt-Thieme,et al.  Learning time-series shapelets , 2014, KDD.

[24]  Gustavo Carneiro,et al.  Producing Radiologist-Quality Reports for Interpretable Deep Learning. , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[25]  Seth Flaxman,et al.  European Union Regulations on Algorithmic Decision-Making and a "Right to Explanation" , 2016, AI Mag..

[26]  Eamonn J. Keogh,et al.  Derivative Dynamic Time Warping , 2001, SDM.

[27]  Henry J.L. Marriott,et al.  Ventricular Fusion Beats , 1962 .

[28]  Federico Cabitza,et al.  A giant with feet of clay: on the validity of the data that feed machine learning in medicine , 2017, Organizing for the Digital World.

[29]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[30]  Lalana Kagal,et al.  Explaining Explanations: An Overview of Interpretability of Machine Learning , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[31]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[32]  Federico Cabitza,et al.  As if sand were stone. New concepts and metrics to probe the ground on which to build trustable AI , 2020, BMC Medical Informatics and Decision Making.

[33]  Tomasz Górecki,et al.  Using derivatives in time series classification , 2012, Data Mining and Knowledge Discovery.

[34]  Hugo Gamboa,et al.  Time Alignment Measurement for Time Series , 2018, Pattern Recognit..

[35]  Alexandre Termier,et al.  Agnostic Local Explanation for Time Series Classification , 2019, 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI).

[36]  Tomasz Górecki,et al.  First and Second Derivatives in Time Series Classification Using DTW , 2014, Commun. Stat. Simul. Comput..

[37]  Federico Cabitza,et al.  Biases Affecting Human Decision Making in AI-Supported Second Opinion Settings , 2019, MDAI.

[38]  Matthew B. Blaschko,et al.  Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory and Practice , 2019, MICCAI.