Using Transfer Learning for Improved Mortality Prediction in a Data-Scarce Hospital Setting

Algorithm–based clinical decision support (CDS) systems associate patient-derived health data with outcomes of interest, such as in-hospital mortality. However, the quality of such associations often depends on the availability of site-specific training data. Without sufficient quantities of data, the underlying statistical apparatus cannot differentiate useful patterns from noise and, as a result, may underperform. This initial training data burden limits the widespread, out-of-the-box, use of machine learning–based risk scoring systems. In this study, we implement a statistical transfer learning technique, which uses a large “source” data set to drastically reduce the amount of data needed to perform well on a “target” site for which training data are scarce. We test this transfer technique with AutoTriage, a mortality prediction algorithm, on patient charts from the Beth Israel Deaconess Medical Center (the source) and a population of 48 249 adult inpatients from University of California San Francisco Medical Center (the target institution). We find that the amount of training data required to surpass 0.80 area under the receiver operating characteristic (AUROC) on the target set decreases from more than 4000 patients to fewer than 220. This performance is superior to the Modified Early Warning Score (AUROC: 0.76) and corresponds to a decrease in clinical data collection time from approximately 6 months to less than 10 days. Our results highlight the usefulness of transfer learning in the specialization of CDS systems to new hospital sites, without requiring expensive and time-consuming data collection efforts.

[1]  Hamid Mohamadlou,et al.  High-performance detection and early prediction of septic shock for alcohol-use disorder patients , 2016, Annals of medicine and surgery.

[2]  S. Lemeshow,et al.  A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. , 1993, JAMA.

[3]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[4]  E. Draper,et al.  APACHE II: A severity of disease classification system , 1985, Critical care medicine.

[5]  Ilya Narsky,et al.  Statistical Analysis Techniques in Particle Physics , 2013 .

[6]  Peter J. Haug,et al.  Early Detection of Sepsis in the Emergency Department using Dynamic Bayesian Networks , 2012, AMIA.

[7]  Jenna Wiens,et al.  A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions , 2014, J. Am. Medical Informatics Assoc..

[8]  C. Subbe,et al.  Validation of physiological scoring systems in the accident and emergency department , 2006, Emergency Medicine Journal.

[9]  Cheng Soon Ong,et al.  Multivariate spearman's ρ for aggregating ranks using copulas , 2016 .

[10]  S L Hui,et al.  Validation techniques for logistic regression models. , 1991, Statistics in medicine.

[11]  Christopher W. Barton,et al.  A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[12]  Christopher W. Barton,et al.  Discharge recommendation based on a novel technique of homeostatic analysis , 2017, J. Am. Medical Informatics Assoc..

[13]  Leo Anthony Celi,et al.  A Database-driven Decision Support System: Customized Mortality Prediction , 2012, Journal of personalized medicine.

[14]  R. Bone,et al.  Toward an epidemiology and natural history of SIRS (systemic inflammatory response syndrome) , 1992, JAMA.

[15]  Marleen de Bruijne,et al.  Transfer Learning Improves Supervised Image Segmentation Across Imaging Protocols , 2015, IEEE Trans. Medical Imaging.

[16]  G. Fetherston,et al.  The medical emergency team , 2001, The Medical journal of Australia.

[17]  J. Vincent,et al.  The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure , 1996, Intensive Care Medicine.

[18]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[19]  Bernhard Schölkopf,et al.  Domain Adaptation with Conditional Transferable Components , 2016, ICML.

[20]  Vladimir Vovk,et al.  Empirical Inference - Festschrift in Honor of Vladimir N. Vapnik , 2014, Empirical Inference.

[21]  P. Pronovost,et al.  A targeted real-time early warning score (TREWScore) for septic shock , 2015, Science Translational Medicine.

[22]  John V. Guttag,et al.  Instance Weighting for Patient-Specific Risk Stratification Models , 2015, KDD.

[23]  Hans-Peter Kriegel,et al.  Integrating structured biological data by Kernel Maximum Mean Discrepancy , 2006, ISMB.

[24]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[25]  Jenna Wiens,et al.  Patient Risk Stratification with Time-Varying Parameters: A Multitask Learning Approach , 2016, J. Mach. Learn. Res..

[26]  Uli K. Chettipally,et al.  Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach , 2016, JMIR medical informatics.

[27]  Christopher Barton,et al.  A computational approach to mortality prediction of alcohol use disorder inpatients , 2016, Comput. Biol. Medicine.

[28]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[29]  Hamid Mohamadlou,et al.  Using electronic health record collected clinical variables to predict medical intensive care unit mortality , 2016, Annals of medicine and surgery.

[30]  D. Cox Two further applications of a model for binary regression , 1958 .