A novel model to label delirium in an intensive care unit from clinician actions

Background In the intensive care unit (ICU), delirium is a common, acute, confusional state associated with high risk for short- and long-term morbidity and mortality. Machine learning (ML) has promise to address research priorities and improve delirium outcomes. However, due to clinical and billing conventions, delirium is often inconsistently or incompletely labeled in electronic health record (EHR) datasets. Here, we identify clinical actions abstracted from clinical guidelines in electronic health records (EHR) data that indicate risk of delirium among intensive care unit (ICU) patients. We develop a novel prediction model to label patients with delirium based on a large data set and assess model performance. Methods EHR data on 48,451 admissions from 2001 to 2012, available through Medical Information Mart for Intensive Care-III database (MIMIC-III), was used to identify features to develop our prediction models. Five binary ML classification models (Logistic Regression; Classification and Regression Trees; Random Forests; Naïve Bayes; and Support Vector Machines) were fit and ranked by Area Under the Curve (AUC) scores. We compared our best model with two models previously proposed in the literature for goodness of fit, precision, and through biological validation. Results Our best performing model with threshold reclassification for predicting delirium was based on a multiple logistic regression using the 31 clinical actions (AUC 0.83). Our model out performed other proposed models by biological validation on clinically meaningful, delirium-associated outcomes. Conclusions Hurdles in identifying accurate labels in large-scale datasets limit clinical applications of ML in delirium. We developed a novel labeling model for delirium in the ICU using a large, public data set. By using guideline-directed clinical actions independent from risk factors, treatments, and outcomes as model predictors, our classifier could be used as a delirium label for future clinically targeted models.

[1]  John P. Corradi,et al.  Prediction of Incident Delirium Using a Random Forest classifier , 2018, Journal of Medical Systems.

[2]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[3]  Jesse Davis,et al.  Learning from positive and unlabeled data: a survey , 2018, Machine Learning.

[4]  S. Abidi,et al.  Exploiting Machine Learning Algorithms and Methods for the Prediction of Agitated Delirium After Cardiac Surgery: Models Development and Validation Study , 2019, JMIR medical informatics.

[5]  F. Hamdy,et al.  Misclassification of outcome in case–control studies: Methods for sensitivity analysis , 2016, Statistical methods in medical research.

[6]  Jonathan H Chen,et al.  Assessing clinical heterogeneity in sepsis through treatment patterns and machine learning , 2019, J. Am. Medical Informatics Assoc..

[7]  Mark Newman,et al.  Designing risk prediction models for ambulatory no-shows across different specialties and clinics , 2018, J. Am. Medical Informatics Assoc..

[8]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[9]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[10]  M. E. Maron,et al.  Automatic Indexing: An Experimental Inquiry , 1961, JACM.

[11]  Theodore Speroff,et al.  Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. , 2004, JAMA.

[12]  E. Marcantonio,et al.  Cognitive trajectories after postoperative delirium. , 2012, The New England journal of medicine.

[13]  R. Jaeschke,et al.  Clinical Practice Guidelines for the Management of Pain, Agitation, and Delirium in Adult Patients in the Intensive Care Unit , 2013, Critical care medicine.

[14]  J. Devlin,et al.  Use of a validated delirium assessment tool improves the ability of physicians to identify delirium in medical intensive care unit patients , 2007, Critical care medicine.

[15]  G. Bernard,et al.  Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM-ICU). , 2001, JAMA.

[16]  J. Cerejeira,et al.  Identification of sub-groups in acutely ill elderly patients with delirium: a cluster analysis , 2016, International Psychogeriatrics.

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  K. Huybrechts,et al.  Evaluation of algorithms to identify delirium in administrative claims and drug utilization database , 2017, Pharmacoepidemiology and drug safety.

[19]  F. Santosa,et al.  Linear inversion of ban limit reflection seismograms , 1986 .

[20]  Parisa Rashidi,et al.  Delirium Prediction using Machine Learning Models on Predictive Electronic Health Records Data , 2017, 2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE).

[21]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[22]  E. Marcantonio Delirium in Hospitalized Older Adults , 2017, The New England journal of medicine.

[23]  G. Bernard,et al.  Long-term cognitive impairment after critical illness. , 2014, The New England journal of medicine.

[24]  Rui Xiao,et al.  Identifying surgical site infections in electronic health data using predictive models , 2018, J. Am. Medical Informatics Assoc..

[25]  Leo A. Celi,et al.  The MIMIC Code Repository: enabling reproducibility in critical care research , 2017, J. Am. Medical Informatics Assoc..

[26]  E. Marcantonio Delirium , 2011, Annals of Internal Medicine.

[27]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[28]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[29]  John F. Hurdle,et al.  Measuring diagnoses: ICD code accuracy. , 2005, Health services research.

[30]  Patrick B. Ryan,et al.  Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data , 2018, J. Am. Medical Informatics Assoc..

[31]  A. Sillner,et al.  Performance of Electronic Prediction Rules for Prevalent Delirium at Hospital Admission , 2018, JAMA network open.

[32]  P. Eikelenboom,et al.  Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta-analysis. , 2010, JAMA.

[33]  S. Bush,et al.  Delirium diagnosis, screening and management , 2014, Current opinion in supportive and palliative care.

[34]  Jinmiao Huang,et al.  An Empirical Evaluation of Deep Learning for ICD-9 Code Assignment using MIMIC-III Clinical Notes , 2018, Comput. Methods Programs Biomed..

[35]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[36]  A. Ciampi,et al.  Latent class analysis of the multivariate Delirium Index in long-term care settings , 2018, International Psychogeriatrics.

[37]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[38]  B. Wells,et al.  Strategies for Handling Missing Data in Electronic Health Record Derived Data , 2013, EGEMS.

[39]  Frank E. Harrell,et al.  Prognostic/Clinical Prediction Models: Development of a Clinical Prediction Model for an Ordinal Outcome: The World Health Organization Multicentre Study of Clinical Signs and Etiological Agents of Pneumonia, Sepsis and Meningitis in Young Infants , 2005 .

[40]  E. Marcantonio,et al.  The Language of Delirium: Keywords for Identifying Delirium from Medical Records. , 2015, Journal of gerontological nursing.

[41]  Boreom Lee,et al.  Prediction and early detection of delirium in the intensive care unit by using heart rate variability and machine learning , 2018, Physiological measurement.

[42]  H. Stelfox,et al.  Incidence and Prevalence of Delirium Subtypes in an Adult ICU: A Systematic Review and Meta-Analysis* , 2018, Critical care medicine.

[43]  J. Saczynski,et al.  Delirium in elderly people , 2014, The Lancet.

[44]  E. Ely,et al.  COVID-19: ICU delirium management during SARS-CoV-2 pandemic , 2020, Critical Care.

[45]  Sharon K. Inouye,et al.  Delirium in elderly adults: diagnosis, prevention and treatment , 2009, Nature Reviews Neurology.

[46]  G. Bernard,et al.  Long-term cognitive impairment after critical illness. , 2013, The New England journal of medicine.

[47]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[48]  Albert T. Young,et al.  Development and Validation of an Electronic Health Record–Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment , 2018, JAMA network open.

[49]  C. Wiener Harrison's principles of internal medicine , 2013 .

[50]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[51]  Aram Galstyan,et al.  Multitask learning and benchmarking with clinical time series data , 2017, Scientific Data.

[52]  Kevin R Coombes,et al.  Unsupervised machine learning and prognostic factors of survival in chronic lymphocytic leukemia , 2020, J. Am. Medical Informatics Assoc..

[53]  F. Harrell,et al.  Development of a clinical prediction model for an ordinal outcome: the World Health Organization Multicentre Study of Clinical Signs and Etiological agents of Pneumonia, Sepsis and Meningitis in Young Infants. WHO/ARI Young Infant Multicentre Study Group. , 1998, Statistics in medicine.

[54]  Jan Horsky,et al.  Accuracy and Completeness of Clinical Coding Using ICD-10 for Ambulatory Visits , 2017, AMIA.

[55]  Günter Schreier,et al.  On the Representation of Machine Learning Results for Delirium Prediction in a Hospital Information System in Routine Care , 2018, ICIMTH.

[56]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[57]  G. Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement , 2015, Annals of Internal Medicine.

[58]  J. Driver,et al.  Validation of a Delirium Risk Assessment Using Electronic Medical Record Information. , 2016, Journal of the American Medical Directors Association.

[59]  Y. Skrobik,et al.  Intensive Care Delirium Screening Checklist: evaluation of a new screening tool , 2001, Intensive Care Medicine.

[60]  E. Marcantonio,et al.  Effect of Delirium and Other Major Complications on Outcomes After Elective Surgery in Older Adults. , 2015, JAMA surgery.

[61]  James R. Johnson,et al.  Delirium in Hospitalized Older Adults. , 2018, The New England journal of medicine.

[62]  G. Arbanas Diagnostic and Statistical Manual of Mental Disorders (DSM-5) , 2015 .

[63]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[64]  Stephanie T. Lanza,et al.  Sensitivity and Specificity of Information Criteria , 2018, bioRxiv.

[65]  G. Shan,et al.  Fisher’s exact approach for post hoc analysis of a chi-squared test , 2017, PloS one.

[66]  T. Therneau,et al.  An Introduction to Recursive Partitioning Using the RPART Routines , 2015 .

[67]  Gilbert Reibnegger,et al.  Optimum binary cut-off threshold of a diagnostic test: comparison of different methods using Monte Carlo technique , 2014, BMC Medical Informatics and Decision Making.