An “All-Data-on-Hand” Deep Learning Model to Predict Hospitalization for Diabetic Ketoacidosis in Youth With Type 1 Diabetes: Development and Validation Study

Background Although prior research has identified multiple risk factors for diabetic ketoacidosis (DKA), clinicians continue to lack clinic-ready models to predict dangerous and costly episodes of DKA. We asked whether we could apply deep learning, specifically the use of a long short-term memory (LSTM) model, to accurately predict the 180-day risk of DKA-related hospitalization for youth with type 1 diabetes (T1D). Objective We aimed to describe the development of an LSTM model to predict the 180-day risk of DKA-related hospitalization for youth with T1D. Methods We used 17 consecutive calendar quarters of clinical data (January 10, 2016, to March 18, 2020) for 1745 youths aged 8 to 18 years with T1D from a pediatric diabetes clinic network in the Midwestern United States. The input data included demographics, discrete clinical observations (laboratory results, vital signs, anthropometric measures, diagnosis, and procedure codes), medications, visit counts by type of encounter, number of historic DKA episodes, number of days since last DKA admission, patient-reported outcomes (answers to clinic intake questions), and data features derived from diabetes- and nondiabetes-related clinical notes via natural language processing. We trained the model using input data from quarters 1 to 7 (n=1377), validated it using input from quarters 3 to 9 in a partial out-of-sample (OOS-P; n=1505) cohort, and further validated it in a full out-of-sample (OOS-F; n=354) cohort with input from quarters 10 to 15. Results DKA admissions occurred at a rate of 5% per 180-days in both out-of-sample cohorts. In the OOS-P and OOS-F cohorts, the median age was 13.7 (IQR 11.3-15.8) years and 13.1 (IQR 10.7-15.5) years; median glycated hemoglobin levels at enrollment were 8.6% (IQR 7.6%-9.8%) and 8.1% (IQR 6.9%-9.5%); recall was 33% (26/80) and 50% (9/18) for the top-ranked 5% of youth with T1D; and 14.15% (213/1505) and 12.7% (45/354) had prior DKA admissions (after the T1D diagnosis), respectively. For lists rank ordered by the probability of hospitalization, precision increased from 33% to 56% to 100% for positions 1 to 80, 1 to 25, and 1 to 10 in the OOS-P cohort and from 50% to 60% to 80% for positions 1 to 18, 1 to 10, and 1 to 5 in the OOS-F cohort, respectively. Conclusions The proposed LSTM model for predicting 180-day DKA-related hospitalization was valid in this sample. Future research should evaluate model validity in multiple populations and settings to account for health inequities that may be present in different segments of the population (eg, racially or socioeconomically diverse cohorts). Rank ordering youth by probability of DKA-related hospitalization will allow clinics to identify the most at-risk youth. The clinical implication of this is that clinics may then create and evaluate novel preventive interventions based on available resources.

[1]  Lindsay Wells,et al.  Explainable AI and Reinforcement Learning—A Systematic Review of Current Approaches and Trends , 2021, Frontiers in Artificial Intelligence.

[2]  C. Molony,et al.  Performance assessment of different machine learning approaches in predicting diabetic ketoacidosis in adults with type 1 diabetes using electronic health records data , 2021, Pharmacoepidemiology and drug safety.

[3]  P. White,et al.  Risk factors for hospitalization in youth with type 1 diabetes: Development and validation of a multivariable prediction model , 2020, Pediatric diabetes.

[4]  E. Di Ruggiero,et al.  Artificial intelligence for good health: a scoping review of the ethics literature , 2020, BMC medical ethics.

[5]  S. Garg,et al.  The Silver Lining to COVID-19: Avoiding Diabetic Ketoacidosis Admissions with Telehealth. , 2020, Diabetes technology & therapeutics.

[6]  Søren Brunak,et al.  Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. , 2020, The Lancet. Digital health.

[7]  L. Pyle,et al.  Diabetic Ketoacidosis at Diagnosis of Type 1 Diabetes in Colorado Children, 2010–2017 , 2019, Diabetes Care.

[8]  J. Fishbein,et al.  Diabetic Ketoacidosis at Onset of Type 1 Diabetes: Rates and Risk Factors Today to 15 Years Ago , 2019, Global pediatric health.

[9]  David M Maahs,et al.  State of Type 1 Diabetes Management and Outcomes from the T1D Exchange in 2016-2018. , 2019, Diabetes technology & therapeutics.

[10]  Jenna Wong,et al.  Using Machine Learning to Identify Health Outcomes from Electronic Health Record Data , 2018, Current Epidemiology Reports.

[11]  G. Umpierrez,et al.  Increasing Hospitalizations for DKA: A Need for Prevention Programs , 2018, Diabetes Care.

[12]  U. Schubart,et al.  Health Care Utilization and Burden of Diabetic Ketoacidosis in the U.S. Over the Past Decade: A Nationwide Analysis , 2018, Diabetes Care.

[13]  Amit Shah,et al.  Partnering with Insurers in Caring for the Most Vulnerable Youth with Diabetes: NICH as an Integrator , 2017, Current Diabetes Reports.

[14]  David V. Wagner,et al.  NICH at Its Best for Diabetes at Its Worst: Texting Teens and Their Caregivers for Better Outcomes , 2017, Journal of diabetes science and technology.

[15]  Steven J. Choi,et al.  Pediatric Type 1 Diabetes: Reducing Admission Rates for Diabetes Ketoacidosis , 2016, Quality management in health care.

[16]  W. Ghali,et al.  Temporal variation of diabetic ketoacidosis and hypoglycemia in adults with type 1 diabetes: A nationwide cohort study , 2016, Journal of diabetes.

[17]  Darrell M. Wilson,et al.  Outpatient Care Preceding Hospitalization for Diabetic Ketoacidosis , 2016, Pediatrics.

[18]  G. Umpierrez,et al.  Diabetic emergencies — ketoacidosis, hyperglycaemic hyperosmolar state and hypoglycaemia , 2016, Nature Reviews Endocrinology.

[19]  S. Shilo,et al.  Diabetic Ketoacidosis in the Pediatric Population with Type 1 Diabetes , 2015 .

[20]  R. Beck,et al.  Rates of Diabetic Ketoacidosis: International Comparison With 49,859 Pediatric Patients With Type 1 Diabetes From England, Wales, the U.S., Austria, and Germany , 2015, Diabetes Care.

[21]  David V. Wagner,et al.  Treating the Most Vulnerable and Costly in Diabetes , 2015, Current Diabetes Reports.

[22]  Christine A. Sinsky,et al.  From Triple to Quadruple Aim: Care of the Patient Requires Care of the Provider , 2014, The Annals of Family Medicine.

[23]  B. Anderson,et al.  A psychosocial risk index for poor glycemic control in children and adolescents with type 1 diabetes , 2014, Pediatric diabetes.

[24]  R. Beck,et al.  Severe hypoglycemia and diabetic ketoacidosis among youth with type 1 diabetes in the T1D Exchange clinic registry , 2013, Pediatric diabetes.

[25]  Rajendu Srivastava,et al.  Variation in Resource Use and Readmission for Diabetic Ketoacidosis in Children’s Hospitals , 2013, Pediatrics.

[26]  Kellee M Miller,et al.  The T1D Exchange clinic registry. , 2012, The Journal of clinical endocrinology and metabolism.

[27]  J. Rosenbauer,et al.  Predictors of diabetic ketoacidosis in children and adolescents with type 1 diabetes. Experience from a large multicentre database , 2011, Pediatric diabetes.

[28]  R. Hanås,et al.  A 2‐yr national population study of pediatric ketoacidosis in Sweden: predisposing conditions and insulin pump use , 2009, Pediatric diabetes.

[29]  Horst Rinne,et al.  The Weibull Distribution: A Handbook , 2008 .

[30]  M. Rewers,et al.  Predictors of acute complications in children with type 1 diabetes. , 2002, JAMA.

[31]  D A Butler,et al.  Predictors of glycemic control and short-term adverse outcomes in youth with type 1 diabetes. , 2001, The Journal of pediatrics.

[32]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[33]  Gregory D. Hager,et al.  Deep learning: RNNs and LSTM , 2020 .

[34]  Egil Martinsson,et al.  WTTE-RNN : Weibull Time To Event Recurrent Neural Network A model for sequential prediction of time-to-event in the case of discrete or continuous censored data, recurrent events or time-varying covariates , 2017 .

[35]  David V. Wagner,et al.  Novel Interventions in Children's Health Care (NICH): Innovative treatment for youth with complex medical conditions , 2013 .

[36]  LeighAnne Olsen,et al.  The Learning Healthcare System , 2007 .