The Dependence of Machine Learning on Electronic Medical Record Quality

There is growing interest in applying machine learning methods to Electronic Medical Records (EMR). Across different institutions, however, EMR quality can vary widely. This work investigated the impact of this disparity on the performance of three advanced machine learning algorithms: logistic regression, multilayer perceptron, and recurrent neural network. The EMR disparity was emulated using different permutations of the EMR collected at Children's Hospital Los Angeles (CHLA) Pediatric Intensive Care Unit (PICU) and Cardiothoracic Intensive Care Unit (CTICU). The algorithms were trained using patients from the PICU to predict in-ICU mortality for patients on a held out set of PICU and CTICU patients. The disparate patient populations between the PICU and CTICU provide an estimate of generalization errors across different ICUs. We quantified and evaluated the generalization of these algorithms on varying EMR size, input types, and fidelity of data.

[1]  J. L. Gall,et al.  APACHE II--a severity of disease classification system. , 1986, Critical care medicine.

[2]  David J. Stone,et al.  "Big data" in the intensive care unit. Closing the data loop. , 2013, American journal of respiratory and critical care medicine.

[3]  K H Wesseling,et al.  Non-invasive continuous finger blood pressure measurement during orthostatic stress compared to intra-arterial pressure. , 1990, Cardiovascular research.

[4]  Steven Walczak,et al.  An Empirical Analysis of Data Requirements for Financial Forecasting with Neural Networks , 2001, J. Manag. Inf. Syst..

[5]  M. Balaan,et al.  Acute Respiratory Distress Syndrome , 2016, Critical care nursing quarterly.

[6]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[7]  A. Evans,et al.  Prostate cancer detection with multi‐parametric MRI: Logistic regression analysis of quantitative T2, diffusion‐weighted imaging, and dynamic contrast‐enhanced MRI , 2009, Journal of magnetic resonance imaging : JMRI.

[8]  C. Sprung,et al.  Surviving Sepsis Campaign: International Guidelines for Management of Severe Sepsis and Septic Shock 2012 , 2013, Critical care medicine.

[9]  Yuichi Nakamura,et al.  Approximation of dynamical systems by continuous time recurrent neural networks , 1993, Neural Networks.

[10]  F. Shann,et al.  PIM2: a revised version of the Paediatric Index of Mortality , 2003, Intensive Care Medicine.

[11]  M. Ghassemi,et al.  State of the art review: the data revolution in critical care , 2015, Critical Care.

[12]  Richard Holubkov,et al.  The Pediatric Risk of Mortality Score: Update 2015* , 2016, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[13]  Murray M Pollack Severity of Illness Confusion. , 2016, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[14]  Arthur Flexer,et al.  Statistical evaluation of neural networks experiments: Minimum requirements and current practice , 1994 .

[15]  Melissa Aczon,et al.  2: EARLY PREDICTION OF PATIENT DETERIORATION USING MACHINE LEARNING TECHNIQUES WITH TIME SERIES DATA , 2016, Critical care medicine.

[16]  Andrew James,et al.  Big Data in the Intensive Care Unit , 2017, AMIA.

[17]  Krzysztof J. Cios,et al.  Uniqueness of medical data mining , 2002, Artif. Intell. Medicine.

[18]  J. Henry,et al.  Adoption of Electronic Health Record Systems among U . S . Non-Federal Acute Care Hospitals : 2008-2015 , 2013 .

[19]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[20]  Melissa Aczon,et al.  Dynamic Mortality Risk Predictions in Pediatric Critical Care Using Recurrent Neural Networks , 2017, ArXiv.

[21]  M. Levy,et al.  Surviving Sepsis Campaign: International guidelines for management of severe sepsis and septic shock: 2008 , 2007, Intensive Care Medicine.

[22]  Jimeng Sun,et al.  Using recurrent neural network models for early detection of heart failure onset , 2016, J. Am. Medical Informatics Assoc..

[23]  Robyn Norton,et al.  A comparison of albumin and saline for fluid resuscitation in the Intensive Care unit , 2005 .

[24]  U. Ruttimann,et al.  PRISM III: an updated Pediatric Risk of Mortality score. , 1996, Critical care medicine.