Reinforcement learning application in diabetes blood glucose control: A systematic review

BACKGROUND Reinforcement learning (RL) is a computational approach to understanding and automating goal-directed learning and decision-making. It is designed for problems which include a learning agent interacting with its environment to achieve a goal. For example, blood glucose (BG) control in diabetes mellitus (DM), where the learning agent and its environment are the controller and the body of the patient respectively. RL algorithms could be used to design a fully closed-loop controller, providing a truly personalized insulin dosage regimen based exclusively on the patient's own data. OBJECTIVE In this review we aim to evaluate state-of-the-art RL approaches to designing BG control algorithms in DM patients, reporting successfully implemented RL algorithms in closed-loop, insulin infusion, decision support and personalized feedback in the context of DM. METHODS An exhaustive literature search was performed using different online databases, analyzing the literature from 1990 to 2019. In a first stage, a set of selection criteria were established in order to select the most relevant papers according to the title, keywords and abstract. Research questions were established and answered in a second stage, using the information extracted from the articles selected during the preliminary selection. RESULTS The initial search using title, keywords, and abstracts resulted in a total of 404 articles. After removal of duplicates from the record, 347 articles remained. An independent analysis and screening of the records against our inclusion and exclusion criteria defined in Methods section resulted in removal of 296 articles, leaving 51 relevant articles. A full-text assessment was conducted on the remaining relevant articles, which resulted in 29 relevant articles that were critically analyzed. The inter-rater agreement was measured using Cohen Kappa test, and disagreements were resolved through discussion. CONCLUSIONS The advances in health technologies and mobile devices have facilitated the implementation of RL algorithms for optimal glycemic regulation in diabetes. However, there exists few articles in the literature focused on the application of these algorithms to the BG regulation problem. Moreover, such algorithms are designed for control tasks as BG adjustment and their use have increased recently in the diabetes research area, therefore we foresee RL algorithms will be used more frequently for BG control in the coming years. Furthermore, in the literature there is a lack of focus on aspects that influence BG level such as meal intakes and physical activity (PA), which should be included in the control problem. Finally, there exists a need to perform clinical validation of the algorithms.

[1]  Atul Malhotra,et al.  Transfer Entropy Estimation and Directional Coupling Change Detection in Biomedical Time Series , 2012, Biomedical engineering online.

[2]  Peter Szolovits,et al.  Representation and Reinforcement Learning for Personalized Glycemic Control in Septic Patients , 2017, ArXiv.

[3]  Stavroula G. Mougiakakou,et al.  An Actor-Critic based controller for glucose regulation in type 1 diabetes , 2013, Comput. Methods Programs Biomed..

[4]  J. Vehí,et al.  Artificial Intelligence for Diabetes Management and Decision Support: Literature Review , 2018, Journal of medical Internet research.

[5]  F. Doyle,et al.  Automatic bolus and adaptive basal algorithm for the artificial pancreatic β-cell. , 2010, Diabetes technology & therapeutics.

[6]  Javad Akbari Torkestani,et al.  A learning automata-based blood glucose regulation mechanism in type 2 diabetes , 2014 .

[7]  Guolong Cai,et al.  A framework design for the mHealth system for self-management promotion. , 2015, Bio-medical materials and engineering.

[8]  R. Hovorka,et al.  Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. , 2004, Physiological measurement.

[9]  Mohammad Ali Sadrnia,et al.  Glucose level control using Temporal Difference methods , 2017, 2017 Iranian Conference on Electrical Engineering (ICEE).

[10]  C. Cobelli,et al.  The UVA/PADOVA Type 1 Diabetes Simulator , 2014, Journal of diabetes science and technology.

[11]  L. Magni,et al.  Evaluating the Efficacy of Closed-Loop Glucose Regulation via Control-Variability Grid Analysis , 2008, Journal of diabetes science and technology.

[12]  Fred Godtliebsen,et al.  CONTROLLING BLOOD GLUCOSE LEVELS IN PATIENTS WITH TYPE 1 DIABETES USING FITTED Q-ITERATIONS AND FUNCTIONAL FEATURES , 2018, 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP).

[13]  Boris Kovatchev,et al.  Statistical tools to analyze continuous glucose monitor data. , 2009, Diabetes technology & therapeutics.

[14]  Lutz Heinemann,et al.  Variability of insulin absorption and insulin action. , 2002, Diabetes technology & therapeutics.

[15]  Sasan Adibi,et al.  Mobile Health: A Technology Road Map , 2015 .

[16]  Nicholas A Peppas,et al.  The future of open‐ and closed‐loop insulin delivery systems , 2008, The Journal of pharmacy and pharmacology.

[17]  Luis Omar Ávila,et al.  An active inference approach to on-line agent monitoring in safety-critical systems , 2015, Adv. Eng. Informatics.

[18]  Mariano De Paula,et al.  On-line policy learning and adaptation for real-time personalization of an artificial pancreas , 2015, Expert Syst. Appl..

[19]  Josep Vehí,et al.  A review of personalized blood glucose prediction strategies for T1DM patients , 2017, International journal for numerical methods in biomedical engineering.

[20]  Marko V. Jankovic,et al.  Reinforcement Learning-Based Adaptive Insulin Advisor for Individuals with Type 1 Diabetes Patients under Multiple Daily Injections Therapy* , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21]  Michael R. Kosorok,et al.  Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning , 2016, Journal of the American Statistical Association.

[22]  Peter Diem,et al.  Adaptive Algorithms for Personalized Diabetes Treatment , 2014 .

[23]  S. Mougiakakou,et al.  Impact of Errors in Carbohydrate Estimation on Control of Blood Glucose in Type 1 Diabetes , 2018, 2018 14th Symposium on Neural Networks and Applications (NEUREL).

[24]  Ali Karimpour,et al.  Agent-based Simulation for Blood Glucose Control in Diabetic Patients , 2009 .

[25]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[26]  Tarunraj Singh,et al.  Blood glucose control algorithms for type 1 diabetic patients: A methodological review , 2013, Biomed. Signal Process. Control..

[27]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[28]  Marko V. Jankovic,et al.  A Dual Mode Adaptive Basal-Bolus Advisor Based on Reinforcement Learning , 2019, IEEE Journal of Biomedical and Health Informatics.

[29]  Fred Godtliebsen,et al.  Control of Blood Glucose for Type-1 Diabetes by Using Reinforcement Learning with Feedforward Algorithm , 2018, Comput. Math. Methods Medicine.

[30]  C S Hughes,et al.  Anticipating the next meal using meal behavioral profiles: A hybrid model-based stochastic predictive control algorithm for T1DM , 2011, Comput. Methods Programs Biomed..

[31]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control with an application to blood glucose regulation in type 1 diabetics , 2012, Proceedings of the 31st Chinese Control Conference.

[32]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[33]  E D Lehmann,et al.  A physiological model of glucose-insulin interaction in type 1 diabetes mellitus. , 1992, Journal of biomedical engineering.

[34]  David L Kaplan,et al.  Functional, RF‐Trilayer Sensors for Tooth‐Mounted, Wireless Monitoring of the Oral Cavity and Food Consumption , 2018, Advanced materials.

[35]  Roman Hovorka,et al.  Closed-loop insulin delivery: from bench to clinical practice , 2011, Nature Reviews Endocrinology.

[36]  G. Steil Algorithms for a Closed-Loop Artificial Pancreas: The Case for Proportional-Integral-Derivative Control , 2013, Journal of diabetes science and technology.

[37]  A. Aldo Faisal,et al.  The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas , 2013, Expert review of medical devices.

[38]  Pasquale Palumbo,et al.  Qualitative behaviour of a family of delay-differential models of the glucose-insulin system , 2006 .

[39]  Hava T. Siegelmann,et al.  Insulin Regimen ML-based control for T2DM patients , 2017, ArXiv.

[40]  Meysam Bastani,et al.  Model-Free Intelligent Diabetes Management Using Machine Learning , 2014 .

[41]  Roman Hovorka,et al.  Closed-loop insulin delivery: towards improved diabetes care. , 2012, Discovery medicine.

[42]  Mariano De Paula,et al.  Probabilistic optimal control of blood glucose under uncertainty , 2012 .

[43]  Mariano De Paula,et al.  Controlling blood glucose variability under uncertainty using reinforcement learning and Gaussian processes , 2015, Appl. Soft Comput..

[44]  Nuwan Waidyanatha,et al.  Towards a typology of integrated functional early warning systems , 2010, Int. J. Crit. Infrastructures.

[45]  Marios Anthimopoulos,et al.  Carbohydrate Estimation Supported by the GoCARB System in Individuals With Type 1 Diabetes: A Randomized Prospective Pilot Study , 2016, Diabetes Care.

[46]  Fred Godtliebsen,et al.  Reinforcement-learning optimal control for type-1 diabetes , 2018, 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[47]  R. Hovorka,et al.  Artificial pancreas treatment for outpatients with type 1 diabetes: systematic review and meta-analysis , 2018, British Medical Journal.

[48]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[49]  Dmitry Kangin,et al.  Combination of Supervised and Reinforcement Learning For Vision-Based Autonomous Control , 2018 .

[50]  R. Bergman Minimal Model: Perspective from 2005 , 2006, Hormone Research in Paediatrics.

[51]  I. Vlahavas,et al.  Machine Learning and Data Mining Methods in Diabetes Research , 2017, Computational and structural biotechnology journal.

[52]  Stavroula G. Mougiakakou,et al.  Personalized tuning of a reinforcement learning control algorithm for glucose regulation , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[53]  Stavroula G. Mougiakakou,et al.  Model-Free Machine Learning in Biomedicine: Feasibility Study in Type 1 Diabetes , 2016, PloS one.

[54]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[55]  Peter Diem,et al.  Preliminary results of a novel approach for glucose regulation using an Actor-Critic learning based controller , 2010 .

[56]  Claudio Cobelli,et al.  Meal Simulation Model of the Glucose-Insulin System , 2007, IEEE Transactions on Biomedical Engineering.

[57]  B P Kovatchev,et al.  Assessment of risk for severe hypoglycemia among adults with IDDM: validation of the low blood glucose index. , 1998, Diabetes care.

[58]  Perspectives on Continuous Glucose Monitoring Technology , 2018 .

[59]  Luis Omar Ávila,et al.  Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection , 2014, Expert Syst. Appl..

[60]  J. Wiens,et al.  Reinforcement Learning for Blood Glucose Control: Challenges and Opportunities , 2019 .