Explainable machine learning predictions to support personalized cardiology strategies

Abstract Aims A widely practiced intervention to modify cardiac health, the effect of physical activity on older adults is likely heterogeneous. While machine learning (ML) models that combine various systemic signals may aid in predictive modelling, the inability to rationalize predictions at a patient personalized level is a major shortcoming in the current field of ML. Methods and results We applied a novel methodology, SHapley Additive exPlanations (SHAP), on a dataset of older adults n = 86 (mean age 72 ± 4 years) whose physical activity levels were studied alongside changes in their left ventricular (LV) structure. SHAP was tested to provide intelligible visualization on the magnitude of the impact of the features in their physical activity levels on their LV structure. As proof of concept, using repeated K-cross-validation on the train set (n = 68), we found the Random Forest Regressor with the most optimal hyperparameters, which achieved the lowest mean squared error. With the trained model, we evaluated its performance by reporting its mean absolute error and plotting the correlation on the test set (n = 18). Based on collective force plot, individually numbered patients are indicated on the horizontal axis, and each bandwidth implies the magnitude (i.e. effect) of physical parameters (higher in red; lower in blue) towards prediction of their LV structure. Conclusions As a tool that identified specific features in physical activity that predicted cardiac structure on a per patient level, our findings support a role for explainable ML to be incorporated into personalized cardiology strategies.

[1]  Xinghua Lu,et al.  Understanding Heart-Failure Patients EHR Clinical Features via SHAP Interpretation of Tree-Based Machine Learning Model Predictions , 2021, AMIA.

[2]  G. D. Barmparis,et al.  Detection of abnormal left ventricular geometry in patients without cardiovascular disease through machine learning: An ECG‐based approach , 2021, Journal of clinical hypertension.

[3]  G. D. Barmparis,et al.  Prediction of abnormal left ventricular geometry in patients without cardiovascular disease through machine learning: An ECG-based approach. , 2020, medRxiv.

[4]  Kipp W. Johnson,et al.  Machine learning prediction in cardiovascular diseases: a meta-analysis , 2020, Scientific Reports.

[5]  A. Sampath Dakshina Murthy,et al.  An automated detection of heart arrhythmias using machine learning technique: SVM , 2020 .

[6]  D. Panagiotakos,et al.  The impact of physical activity on healthy ageing trajectories: evidence from eight cohort studies , 2020, International Journal of Behavioral Nutrition and Physical Activity.

[7]  W. Koh,et al.  Associations between Skeletal Muscle and Myocardium in Aging: A Syndrome of “Cardio‐Sarcopenia”? , 2019, Journal of the American Geriatrics Society.

[8]  P. Rahko,et al.  Guidelines for Performing a Comprehensive Transthoracic Echocardiographic Examination in Adults: Recommendations from the American Society of Echocardiography. , 2019, Journal of the American Society of Echocardiography : official publication of the American Society of Echocardiography.

[9]  Carlos Guestrin,et al.  Anchors: High-Precision Model-Agnostic Explanations , 2018, AAAI.

[10]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[11]  J. Kai,et al.  Can machine-learning improve cardiovascular risk prediction using routine clinical data? , 2017, PloS one.

[12]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.

[13]  M. Cesari,et al.  Physical activity and exercise as countermeasures to physical frailty and sarcopenia , 2017, Aging Clinical and Experimental Research.

[14]  Marco Tulio Ribeiro,et al.  “Why Should I Trust You?”: Explaining the Predictions of Any Classifier , 2016, NAACL.

[15]  Victor Mor-Avi,et al.  Recommendations for cardiac chamber quantification by echocardiography in adults: an update from the American Society of Echocardiography and the European Association of Cardiovascular Imaging. , 2015, European heart journal cardiovascular Imaging.

[16]  Jalaluddin Khan,et al.  Heart Disease Identification Method Using Machine Learning Classification in E-Healthcare , 2020, IEEE Access.

[17]  K. McKeown,et al.  Justification Narratives for Individual Classifications , 2014 .

[18]  U. Grömping Dependence of Variable Importance in Random Forests on the Shape of the Regressor Space , 2009 .

[19]  L. Breiman Random Forests , 2001, Machine Learning.