TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications

In high stakes applications such as healthcare and finance analytics, the interpretability of predictive models is required and necessary for domain practitioners to trust the predictions. Traditional machine learning models, e.g., logistic regression (LR), are easy to interpret in nature. However, many of these models aggregate time-series data without considering the temporal correlations and variations. Therefore, their performance cannot match up to recurrent neural network (RNN) based models, which are nonetheless difficult to interpret. In this paper, we propose a general framework TRACER to facilitate accurate and interpretable predictions, with a novel model TITV devised for healthcare analytics and other high stakes applications such as financial investment and risk management. Different from LR and other existing RNN-based models, TITV is designed to capture both the time-invariant and the time-variant feature importance using a feature-wise transformation subnetwork and a self-attention subnetwork, for the feature influence shared over the entire time series and the time-related importance respectively. Healthcare analytics is adopted as a driving use case, and we note that the proposed TRACER is also applicable to other domains, e.g., fintech. We evaluate the accuracy of TRACER extensively in two real-world hospital datasets, and our doctors/clinicians further validate the interpretability of TRACER in both the patient level and the feature level. Besides, TRACER is also validated in a critical financial application. The experimental results confirm that TRACER facilitates both accurate and interpretable analytics for high stakes applications.

[1]  B. Laird,et al.  The role of the systemic inflammatory response in predicting outcomes in patients with advanced inoperable cancer: Systematic review and meta-analysis. , 2017, Critical reviews in oncology/hematology.

[2]  David J. DeWitt,et al.  Choosing A Cloud DBMS: Architectures and Tradeoffs , 2019, Proc. VLDB Endow..

[3]  Karen Smith,et al.  Treatment of Comatose Survivors of Out-of-hospital Cardiac Arrest With Induced Hypothermia , 2003 .

[4]  Tim Miller,et al.  Explanation in Artificial Intelligence: Insights from the Social Sciences , 2017, Artif. Intell..

[5]  Haozhe Qi,et al.  Neutrophil Extracellular Traps and Endothelial Dysfunction in Atherosclerosis and Thrombosis , 2017, Front. Immunol..

[6]  Tara N. Sainath,et al.  Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Weiping Zhang,et al.  I/O-efficient statistical computing with RIOT , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[8]  Jeffrey F. Naughton,et al.  Model Selection Management Systems: The Next Frontier of Advanced Analytics , 2016, SGMD.

[9]  Beng Chin Ooi,et al.  Capturing Feature-Level Irregularity in Disease Progression Modeling , 2017, CIKM.

[10]  M. Cassatella,et al.  Social networking of human neutrophils within the immune system. , 2014, Blood.

[11]  S. K. Das,et al.  Pleural Fluid Cholesterol in Differentiating Exudative and Transudative Pleural Effusion , 2013, Pulmonary medicine.

[12]  J. Moreno,et al.  Haematuria on the Spanish Registry of Glomerulonephritis , 2016, Scientific Reports.

[13]  Christopher Ré,et al.  Materialization optimizations for feature selection workloads , 2014, SIGMOD Conference.

[14]  C. Manthous,et al.  Effect of cooling on oxygen consumption in febrile critically ill patients. , 1995, American journal of respiratory and critical care medicine.

[15]  Suzette J. Bielinski,et al.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study , 2012, J. Am. Medical Informatics Assoc..

[16]  John G. Bartlett,et al.  Guidelines for evaluation of new fever in critically ill adult patients: 2008 update from the American College of Critical Care Medicine and the Infectious Diseases Society of America , 2008, Critical care medicine.

[17]  W. Knaus,et al.  Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. The ACCP/SCCM Consensus Conference Committee. American College of Chest Physicians/Society of Critical Care Medicine. , 1992, Chest.

[18]  M Kaye,et al.  Resistance to parathyroid hormone in renal failure: role of vitamin D metabolites. , 1978, Kidney international.

[19]  Ying Tang,et al.  C-reactive protein promotes acute kidney injury via Smad3-dependent inhibition of CDK2/cyclin E. , 2016, Kidney international.

[20]  Shirish Tatikonda,et al.  SystemML: Declarative Machine Learning on Spark , 2016, Proc. VLDB Endow..

[21]  Walter F. Stewart,et al.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[22]  J. Falk,et al.  Lactic acidosis in critical illness , 1992, Critical care medicine.

[23]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[24]  Fei-Fei Li,et al.  Visualizing and Understanding Recurrent Networks , 2015, ArXiv.

[25]  U Ziegler,et al.  Parathyroid Hormone-dependent Degradation of Type II Na+/Pi Cotransporters* , 1997, The Journal of Biological Chemistry.

[26]  Garrison W. Cottrell,et al.  A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction , 2017, IJCAI.

[27]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[28]  Yan Liu,et al.  Distilling Knowledge from Deep Networks with Applications to Healthcare Domain , 2015, ArXiv.

[29]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[30]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[31]  M. Ishida,et al.  Serum levels of six pancreatic enzymes as related to the degree of renal dysfunction. , 1995, The American journal of gastroenterology.

[32]  A. Rebuzzi,et al.  The prognostic value of C-reactive protein and serum amyloid a protein in severe unstable angina. , 1994, The New England journal of medicine.

[33]  Jay Wook Lee,et al.  Fluid and Electrolyte Disturbances in Critically Ill Patients , 2010, Electrolyte & blood pressure : E & BP.

[34]  Bruce J. Vanstone,et al.  Financial time series forecasting with machine learning techniques: a survey , 2010, ESANN.

[35]  Jonathan Himmelfarb,et al.  Fluid accumulation, survival and recovery of kidney function in critically ill patients with acute kidney injury. , 2009, Kidney international.

[36]  Jiayu Zhou,et al.  A multi-task learning formulation for predicting disease progression , 2011, KDD.

[37]  Didier Payen,et al.  A positive fluid balance is associated with a worse outcome in patients with acute renal failure , 2008, Critical care.

[38]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[39]  Philip R Mayeux,et al.  Evidence for the role of reactive nitrogen species in polymicrobial sepsis-induced renal peritubular capillary dysfunction and tubular injury. , 2007, Journal of the American Society of Nephrology : JASN.

[40]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[41]  Eric A Elster,et al.  New directions for induction immunosuppression strategy in solid organ transplantation. , 2009, American journal of surgery.

[42]  Norbert Lameire,et al.  Notice , 2012, Kidney International Supplements.

[43]  Aaron C. Courville,et al.  FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[44]  Le Song,et al.  GRAM: Graph-based Attention Model for Healthcare Representation Learning , 2016, KDD.

[45]  Gang Chen,et al.  Adaptive Lightweight Regularization Tool for Complex Analytics , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[46]  Wenjun Zeng,et al.  Deeply-Fused Nets , 2016, ArXiv.

[47]  S. Bernard,et al.  Treatment of comatose survivors of out-of-hospital cardiac arrest with induced hypothermia. , 2002, The New England journal of medicine.

[48]  Fenglong Ma,et al.  Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks , 2017, KDD.

[49]  Hideo Yoshida,et al.  Urinary amylase / urinary creatinine ratio (uAm/uCr) - a less-invasive parameter for management of hyperamylasemia , 2013, BMC Pediatrics.

[50]  Juan Pardo,et al.  On-line learning of indoor temperature forecasting models towards energy efficiency , 2014 .

[51]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[52]  Fei Wang,et al.  From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records , 2014, KDD.

[53]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[54]  Chen-fei Zheng,et al.  Admission serum sodium and potassium levels predict survival among critically ill patients with acute kidney injury: a cohort study , 2019, BMC Nephrology.

[55]  Farsad Afshinnia,et al.  Effect of ionized serum calcium on outcomes in acute kidney injury needing renal replacement therapy: secondary analysis of the acute renal failure trial network study , 2013, Renal failure.

[56]  J. Vincent,et al.  Body temperature alterations in the critically ill , 2004, Intensive Care Medicine.

[57]  A. Vijayan,et al.  Relationship of 1,25 dihydroxy Vitamin D Levels to Clinical Outcomes in Critically Ill Patients with Acute Kidney Injury. , 2015, Journal of nephrology & therapeutics.

[58]  Or Biran,et al.  Explanation and Justification in Machine Learning : A Survey Or , 2017 .

[59]  May D. Wang,et al.  Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network , 2017, BCB.

[60]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[61]  J. Egido,et al.  Haematuria as a risk factor for chronic kidney disease progression in glomerular diseases: A review , 2016, Pediatric Nephrology.

[62]  Gustavo Alonso,et al.  SysML: The New Frontier of Machine Learning Systems , 2019, ArXiv.

[63]  Yoshua Bengio,et al.  Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition , 2017, INTERSPEECH.

[64]  今井 圓裕,et al.  estimated glomerular filtration rate (eGFR) , 2012 .

[65]  Yan Liu,et al.  Deep Computational Phenotyping , 2015, KDD.

[66]  P. Breen,et al.  Arterial blood gas and pH analysis. Clinical approach and interpretation. , 2001, Anesthesiology clinics of North America.

[67]  Takeshi Yamamoto,et al.  Association of body temperature and antipyretic treatments with mortality of critically ill patients with and without sepsis: multi-centered prospective observational study , 2012, Critical Care.

[68]  Shanshan Zhang,et al.  Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time , 2018, KDD.

[69]  Michael Stonebraker,et al.  Smile: A System to Support Machine Learning on EEG Data at Scale , 2019, Proc. VLDB Endow..

[70]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[71]  J Savory,et al.  Biochemistry of renal failure. , 1981, Annals of clinical and laboratory science.

[72]  Sushma Sagar,et al.  Evaluation of amylase and lipase levels in blunt trauma abdomen patients , 2012, Journal of emergencies, trauma, and shock.

[73]  L. Plzak,et al.  The Left Shifted Oxyhemoglobin Curve in Sepsis: A Preventable Defect , 1974, Annals of surgery.

[74]  Beng Chin Ooi,et al.  Resolving the Bias in Electronic Medical Records , 2017, KDD.

[75]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[76]  Sang Hoon Han,et al.  The C-Reactive Protein/Albumin Ratio as an Independent Predictor of Mortality in Patients with Severe Sepsis or Septic Shock Treated with Early Goal-Directed Therapy , 2015, PloS one.

[77]  Herbert Chase,et al.  FGF-23 levels in patients with AKI and risk of adverse outcomes. , 2012, Clinical journal of the American Society of Nephrology : CJASN.

[78]  Mirella Lapata,et al.  Long Short-Term Memory-Networks for Machine Reading , 2016, EMNLP.

[79]  Christopher De Sa,et al.  MLSys: The New Frontier of Machine Learning Systems , 2019, 1904.03257.

[80]  Yoshua Bengio,et al.  Feature-wise transformations , 2018, Distill.

[81]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[82]  Jiayu Zhou,et al.  Modeling disease progression via fused sparse group lasso , 2012, KDD.

[83]  Zhi-De Hu,et al.  Lower mean corpuscular hemoglobin concentration is associated with poorer outcomes in intensive care unit admitted patients with acute myocardial infarction. , 2016, Annals of translational medicine.

[84]  Z. Granot,et al.  The diversity of circulating neutrophils in cancer. , 2017, Immunobiology.

[85]  S. Iyer,et al.  Intravenous maintenance fluid tonicity and hyponatremia after major surgery- a cohort study. , 2019, International journal of surgery.

[86]  Dong Ki Kim,et al.  Electrolyte and mineral disturbances in septic acute kidney injury patients undergoing continuous renal replacement therapy , 2016, Medicine.

[87]  V. Keim,et al.  A comparison of lipase and amylase in the diagnosis of acute pancreatitis in patients with abdominal pain. , 1998, Pancreas.

[88]  Meihui Zhang,et al.  GEMINI: An Integrative Healthcare Analytics System , 2014, Proc. VLDB Endow..

[89]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[90]  Percy Liang,et al.  Understanding Black-box Predictions via Influence Functions , 2017, ICML.

[91]  Jakob Uszkoreit,et al.  A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.