Interval Coded Scoring: a toolbox for interpretable scoring systems

Over the last decades, clinical decision support systems have been gaining importance. They help clinicians to make effective use of the overload of available information to obtain correct diagnoses and appropriate treatments. However, their power often comes at the cost of a black box model which cannot be interpreted easily. This interpretability is of paramount importance in a medical setting with regard to trust and (legal) responsibility. In contrast, existing medical scoring systems are easy to understand and use, but they are often a simplified rule-of-thumb summary of previous medical experience rather than a well-founded system based on available data. Interval Coded Scoring (ICS) connects these two approaches, exploiting the power of sparse optimization to derive scoring systems from training data. The presented toolbox interface makes this theory easily applicable to both small and large datasets. It contains two possible problem formulations based on linear programming or elastic net. Both allow to construct a model for a binary classification problem and establish risk profiles that can be used for future diagnosis. All of this requires only a few lines of code. ICS differs from standard machine learning through its model consisting of interpretable main effects and interactions. Furthermore, insertion of expert knowledge is possible because the training can be semi-automatic. This allows end users to make a tradeoff between complexity and performance based on cross-validation results and expert knowledge. Additionally, the toolbox offers an accessible way to assess classification performance via accuracy and the ROC curve, whereas the calibration of the risk profile can be evaluated via a calibration curve. Finally, the colour-coded model visualization has particular appeal if one wants to apply ICS manually on new observations, as well as for validation by experts in the specific application domains. The validity and applicability of the toolbox is demonstrated by comparing it to standard Machine Learning approaches such as Naive Bayes and Support VectorMachines for several reallife datasets. These case studies on medical problems show its applicability as a decision support system. ICS performs similarly in terms of classification and calibration. Its slightly lower performance is countered by its model simplicity which makes it the method of choice if interpretability is a key issue. Subjects Data Mining and Machine Learning, Data Science, Optimization Theory and Computation

[1]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[2]  Kaustubh Supekar,et al.  Estimation of functional connectivity in fMRI data using stability selection-based sparse partial correlation with elastic net penalty , 2012, NeuroImage.

[3]  Hao Xu,et al.  The generalized lasso is reducible to a subspace constrained lasso , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Barry R. Greene,et al.  Assessment and Classification of Early-Stage Multiple Sclerosis With Inertial Sensors: Comparison Against Clinical Measures of Disease State , 2015, IEEE Journal of Biomedical and Health Informatics.

[5]  Francesca Pennecchi,et al.  Comparison of calibration curves using the Lp norm , 2009 .

[6]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[7]  Sabine Van Huffel,et al.  Interval Coded Scoring extensions for larger problems , 2017, 2017 IEEE Symposium on Computers and Communications (ISCC).

[8]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[9]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[10]  Gregory Y H Lip,et al.  Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based approach: the euro heart survey on atrial fibrillation. , 2010, Chest.

[11]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[12]  Yuan Qi,et al.  Identifying Neuroimaging and Proteomic Biomarkers for MCI and AD via the Elastic Net , 2011, MBIA.

[13]  Daoqiang Zhang,et al.  Machine Learning Techniques for AD/MCI Diagnosis and Prognosis , 2014, Machine Learning in Healthcare Informatics.

[14]  D. Ayres-de- Campos,et al.  SisPorto 2.0: a program for automated analysis of cardiotocograms. , 2000, The Journal of maternal-fetal medicine.

[15]  Byeong-Ho Jeong,et al.  Performances of prognostic scoring systems in patients with healthcare-associated pneumonia. , 2013, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[16]  Constantin F. Aliferis,et al.  Medical decision support using machine learning for early detection of late-onset neonatal sepsis , 2014, J. Am. Medical Informatics Assoc..

[17]  Joseph S. Valacich,et al.  The Influence of Task Interruption on Individual Decision Making: An Information Overload Perspective , 1999 .

[18]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[19]  J. Czerniak,et al.  Application of rough sets in the presumptive diagnosis of urinary system diseases , 2003 .

[20]  Alfredo Alvarado,et al.  A Practical Score for the Early Diagnosis of Acute Appendicitis , 1986 .

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  AlpaydinEthem,et al.  Cost-Conscious Comparison of Supervised Learning Algorithms over Multiple Data Sets , 2008 .

[23]  M. Dougados,et al.  Development of an ASAS-endorsed disease activity score (ASDAS) in patients with ankylosing spondylitis , 2008, Annals of the rheumatic diseases.

[24]  J. Habbema,et al.  Prognostic Modeling with Logistic Regression Analysis , 2001, Medical decision making : an international journal of the Society for Medical Decision Making.

[25]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[26]  N. B. Venkateswarlu,et al.  A Critical Comparative Study of Liver Patients from USA and INDIA: An Exploratory Analysis , 2012 .

[27]  Chandan Chakraborty,et al.  Wavelet-based Machine Learning Techniques for ECG Signal Analysis , 2014, Machine Learning in Healthcare Informatics.

[28]  R. Haynes,et al.  Effects of Computer-based Clinical Decision Support Systems on Clinician Performance and Patient Outcome: A Critical Appraisal of Research , 1994, Annals of Internal Medicine.

[29]  Walter F. Stewart,et al.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[30]  Hai-Ning Liang,et al.  Overview of the Health Informatics Research Field: A Bibliometric Approach , 2010, E-HEALTH.

[31]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[32]  Eta S. Berner,et al.  Overview of Clinical Decision Support Systems , 2016 .

[33]  R. Duda,et al.  Expert Systems Research. , 1983, Science.

[34]  H. Mcdonald,et al.  Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review. , 2005, JAMA.

[35]  Cynthia Rudin,et al.  Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model , 2015, ArXiv.

[36]  Sabine Van Huffel,et al.  Interval Coded Scoring Index with Interaction Effects - A Sensitivity Study , 2016, ICPRAM.

[37]  Stephen P. Boyd,et al.  A Mathematical Model for Interpretable Clinical Decision Support with Applications in Gynecology , 2012, PloS one.

[38]  Sreerupa Das,et al.  Machine learning for improved diagnosis and prognosis in healthcare , 2017, 2017 IEEE Aerospace Conference.

[39]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[40]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[41]  Olivier Chapelle,et al.  Training a Support Vector Machine in the Primal , 2007, Neural Computation.

[42]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[43]  Yixin Chen,et al.  A Reduction of the Elastic Net to Support Vector Machines with an Application to GPU Computing , 2014, AAAI.

[44]  D J Protti,et al.  The Synergism of Health/Medical Informatics Revisited , 1995, Methods of Information in Medicine.

[45]  Dhiraj Yadav,et al.  Comparison of existing clinical scoring systems to predict persistent organ failure in patients with acute pancreatitis. , 2012, Gastroenterology.

[46]  P. Shekelle,et al.  Systematic Review: Impact of Health Information Technology on Quality, Efficiency, and Costs of Medical Care , 2006, Annals of Internal Medicine.

[47]  Cynthia Rudin,et al.  Supersparse linear integer models for optimized medical scoring systems , 2015, Machine Learning.

[48]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.