Personalized Multimorbidity Management for Patients with Type 2 Diabetes Using Reinforcement Learning of Electronic Health Records

Comorbid chronic conditions are common among people with type 2 diabetes. We developed an Artificial Intelligence algorithm, based on Reinforcement Learning (RL), for personalized diabetes and multi-morbidity management with strong potential to improve health outcomes relative to current clinical practice. In this paper, we modeled glycemia, blood pressure and cardiovascular disease (CVD) risk as health outcomes using a retrospective cohort of 16,665 patients with type 2 diabetes from New York University Langone Health ambulatory care electronic health records in 2009 to 2017. We trained a RL prescription algorithm that recommends a treatment regimen optimizing patients' cumulative health outcomes using their individual characteristics and medical history at each encounter. The RL recommendations were evaluated on an independent subset of patients. The results demonstrate that the proposed personalized reinforcement learning prescriptive framework for type 2 diabetes yielded high concordance with clinicians' prescriptions and substantial improvements in glycemia, blood pressure, cardiovascular disease risk outcomes.

[1]  Eve A Kerr,et al.  The impact of comorbid chronic conditions on diabetes care. , 2006, Diabetes care.

[2]  A. Khera,et al.  2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease. , 2019, Circulation.

[3]  Martin Fortin,et al.  Managing patients with multimorbidity: systematic review of interventions in primary care and community settings , 2012, BMJ : British Medical Journal.

[4]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[5]  Ying Daisy Zhuo,et al.  Personalized Diabetes Management Using Electronic Medical Records , 2016, Diabetes Care.

[6]  M. Kosorok,et al.  Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer , 2011, Biometrics.

[7]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[8]  S. Wyke,et al.  Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study , 2012, The Lancet.

[9]  Y. Yao,et al.  On Early Stopping in Gradient Descent Learning , 2007 .

[10]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[11]  Adela Castelló,et al.  Principal components analysis in clinical studies. , 2017, Annals of translational medicine.

[12]  Aldo A. Faisal,et al.  The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.

[13]  Maja J. Mataric,et al.  Reward Functions for Accelerated Learning , 1994, ICML.

[14]  Scott M. Lundberg,et al.  Explainable machine-learning predictions for the prevention of hypoxaemia during surgery , 2018, Nature Biomedical Engineering.

[15]  Dimitris Bertsimas,et al.  Optimal Prescriptive Trees , 2019, INFORMS J. Optim..

[16]  John R. Petrie,et al.  Diabetes, Hypertension, and Cardiovascular Disease: Clinical Insights and Vascular Mechanisms , 2017, The Canadian journal of cardiology.

[17]  A. Khera,et al.  2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. , 2019, Journal of the American College of Cardiology.

[18]  C. Salisbury,et al.  Measures of Multimorbidity and Morbidity Burden for Use in Primary Care and Community Settings: A Systematic Review and Guide , 2012, The Annals of Family Medicine.

[19]  Fred Godtliebsen,et al.  Reinforcement learning application in diabetes blood glucose control: A systematic review , 2020, Artif. Intell. Medicine.

[20]  Louis Wehenkel,et al.  Clinical data based optimal STI strategies for HIV: a reinforcement learning approach , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[21]  S. Mercer,et al.  Challenges of managing people with multimorbidity in today’s healthcare systems , 2015, BMC Family Practice.

[22]  C. Whitty,et al.  Rising to the challenge of multimorbidity , 2020, BMJ.

[23]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[24]  C. Blumberg Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[25]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[26]  L. Bouter,et al.  How to measure comorbidity. a critical review of available methods. , 2003, Journal of clinical epidemiology.

[27]  A. Kiss,et al.  Eligibility criteria of randomized controlled trials published in high-impact general medical journals: a systematic sampling review. , 2007, JAMA.

[28]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[29]  José David Martín-Guerrero,et al.  Optimization of anemia treatment in hemodialysis patients via reinforcement learning , 2014, Artif. Intell. Medicine.

[30]  P Whelton,et al.  Prevalence of hypertension in the US adult population. Results from the Third National Health and Nutrition Examination Survey, 1988-1991. , 1995, Hypertension.

[31]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[32]  M. Pencina,et al.  General Cardiovascular Risk Profile for Use in Primary Care: The Framingham Heart Study , 2008, Circulation.

[33]  H A Pincus,et al.  Comparing the national economic burden of five chronic conditions. , 2001, Health affairs.

[34]  Inger Ekman,et al.  Medication adherence: a call for action. , 2011, American heart journal.