Predicting Post-stroke Hospital Discharge Disposition Using Interpretable Machine Learning Approaches

Stroke is the fifth leading cause of death for Americans. Due to critical consequences and cost, an efficient stroke system of care (i.e., acute treatment, post-stroke acute rehabilitation) is necessary. Early determination of hospital discharge disposition is important for stroke management, and can make an immense impact in optimizing acute treatment and planning post-acute rehabilitation with desired outcomes. With the rise of sophisticated machine learning models, many researchers have gained momentum in using such models in their studies. However, due to the lack of explanations that models provide, the integrity of the prediction result is often challenged. Our goal is to predict post-stroke hospital discharge disposition using machine learning models to increase the prediction capability while providing explanations for the results using an interpretation method, such as Local Interpretable Model-agnoistic Explanations (LIME). Our results demonstrate the effectiveness of LIME in providing interpretability to machine learning models and suggests further exploration in performance improvement.

[1]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[2]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[3]  Samin K. Sharma,et al.  A risk score to predict in-hospital mortality for percutaneous coronary interventions. , 2006, Journal of the American College of Cardiology.

[4]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[5]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[6]  Paul R. Yarnold,et al.  Reading and Understanding Multivariate Statistics , 1995 .

[7]  J. Hilbe Logistic Regression Models , 2009 .

[8]  Ralph B D'Agostino,et al.  Presentation of multivariate data for clinical use: The Framingham Study risk score functions. , 2005, Statistics in medicine.

[9]  Chandan Singh,et al.  Hierarchical interpretations for neural network predictions , 2018, ICLR.

[10]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Mina Sartipi,et al.  Post-stroke discharge disposition prediction using deep learning , 2017, SoutheastCon 2017.

[12]  Klaus-Robert Müller,et al.  Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models , 2017, ArXiv.

[13]  O. Blatchford,et al.  A risk score to predict need for treatment for uppergastrointestinal haemorrhage , 2000, The Lancet.

[14]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Kristin L. Sainani,et al.  Logistic Regression , 2014, PM & R : the journal of injury, function, and rehabilitation.

[17]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[18]  G. Heath,et al.  Hospital Discharge Disposition of Stroke Patients in Tennessee , 2017, Southern medical journal.