The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas

Blood glucose control, for example, in diabetes mellitus or severe illness, requires strict adherence to a protocol of food, insulin administration and exercise personalized to each patient. An artificial pancreas for automated treatment could boost quality of glucose control and patients’ independence. The components required for an artificial pancreas are: i) continuous glucose monitoring (CGM), ii) smart controllers and iii) insulin pumps delivering the optimal amount of insulin. In recent years, medical devices for CGM and insulin administration have undergone rapid progression and are now commercially available. Yet, clinically available devices still require regular patients’ or caregivers’ attention as they operate in open-loop control with frequent user intervention. Dosage-calculating algorithms are currently being studied in intensive care patients 1, for short overnight control to supplement conventional insulin delivery 2, and for short periods where patients rest and follow a prescribed food regime 3. Fully automated algorithms that can respond to the varying activity levels seen in outpatients, with unpredictable and unreported food intake, and which provide the necessary personalized control for individuals is currently beyond the state-of-the-art. Here, we review and discuss reinforcement learning algorithms, controlling insulin in a closed-loop to provide individual insulin dosing regimens that are reactive to the immediate needs of the patient.

[1]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[2]  M. Castelli,et al.  Oral Insulin: A Comparison With Subcutaneous Regular Human Insulin in Patients With Type 2 Diabetes , 2010, Diabetes Care.

[3]  D. Dunger,et al.  Closed-loop insulin delivery for treatment of type 1 diabetes , 2011, BMC medicine.

[4]  Marco Forgione,et al.  Run-to-Run Tuning of Model Predictive Control for Type 1 Diabetes Subjects: In Silico Trial , 2009, Journal of diabetes science and technology.

[5]  Darrell M. Wilson,et al.  A Closed-Loop Artificial Pancreas Using Model Predictive Control and a Sliding Meal Size Estimator , 2009, Journal of diabetes science and technology.

[6]  Moshe Phillip,et al.  Feasibility study of automated overnight closed-loop glucose control under MD-logic artificial pancreas in patients with type 1 diabetes: the DREAM Project. , 2012, Diabetes technology & therapeutics.

[7]  M. Araki,et al.  Novel Control System for Blood Glucose Using a Model Predictive Method , 2000, ASAIO journal.

[8]  G. Steil,et al.  Closed-loop insulin delivery-the path to physiological glucose control. , 2004, Advanced drug delivery reviews.

[9]  Malgorzata E. Wilinska,et al.  Overnight Closed-Loop Insulin Delivery with Model Predictive Control: Assessment of Hypoglycemia and Hyperglycemia Risk Using Simulation Studies , 2009, Journal of diabetes science and technology.

[10]  R. Potts,et al.  Physiological differences between interstitial glucose and blood glucose measured in human subjects. , 2003, Diabetes care.

[11]  Joelle Pineau,et al.  Treating Epilepsy via Adaptive Neurostimulation: a Reinforcement Learning Approach , 2009, Int. J. Neural Syst..

[12]  G. Steil,et al.  Evaluation of the Effect of Gain on the Meal Response of an Automated Closed-Loop Insulin Delivery System , 2006, Diabetes.

[13]  Stavroula G. Mougiakakou,et al.  An Actor-Critic based controller for glucose regulation in type 1 diabetes , 2013, Comput. Methods Programs Biomed..

[14]  Min Gul Kim,et al.  The Correlation and Accuracy of Glucose Levels between Interstitial Fluid and Venous Plasma by Continuous Glucose Monitoring System , 2010, Korean diabetes journal.

[15]  D. Dazzi,et al.  The control of blood glucose in the critical diabetic patient: a neuro-fuzzy method. , 2001, Journal of diabetes and its complications.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  P. Jansson,et al.  A microdialysis method allowing characterization of intercellular water space in humans. , 1987, The American journal of physiology.

[18]  Santhisagar Vaddiraju,et al.  Technologies for Continuous Glucose Monitoring: Current Problems and Future Promises , 2010, Journal of diabetes science and technology.

[19]  A H Clemens,et al.  The development of Biostator, a Glucose Controlled Insulin Infusion System (GCIIS). , 1977, Hormone and metabolic research = Hormon- und Stoffwechselforschung = Hormones et metabolisme.

[20]  G. S. Wilson,et al.  Interstitial glucose concentration and glycemia: implications for continuous subcutaneous glucose monitoring. , 2000, American journal of physiology. Endocrinology and metabolism.

[21]  Adam E. Gaweda,et al.  Model Predictive Control with Reinforcement Learning for Drug Delivery in Renal Anemia Management , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[22]  S. Genuth,et al.  The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. , 1993, The New England journal of medicine.

[23]  Anirban Roy,et al.  Effect of pramlintide on prandial glycemic excursions during closed-loop control in adolescents and young adults with type 1 diabetes , 2014 .

[24]  Ali Karimpour,et al.  Agent-based Simulation for Blood Glucose Control in Diabetic Patients , 2009 .

[25]  Eyal Dassau,et al.  Pilot Studies of Wearable Outpatient Artificial Pancreas in Type 1 Diabetes , 2012, Diabetes Care.

[26]  Gastón Schlotthauer,et al.  Modeling, identification and nonlinear model predictive control of type I diabetic patient. , 2006, Medical Engineering and Physics.

[27]  Malgorzata E. Wilinska,et al.  Intensive insulin therapy: enhanced Model Predictive Control algorithm versus standard care , 2008, Intensive Care Medicine.

[28]  Pierre Geurts,et al.  Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[29]  L. Macconell,et al.  Safety and tolerability of exenatide twice daily in patients with type 2 diabetes: integrated analysis of 5594 patients from 19 placebo-controlled and comparator-controlled clinical trials , 2012, Diabetes, metabolic syndrome and obesity : targets and therapy.

[30]  Saadet Ulas Acikgoz,et al.  Blood glucose regulation with stochastic optimal control for insulin-dependent diabetic patients , 2008 .

[31]  Eyal Dassau,et al.  Safety Constraints in an Artificial Pancreatic β Cell: An Implementation of Model Predictive Control with Insulin on Board , 2009, Journal of diabetes science and technology.

[32]  C. Best,et al.  Observations with Insulin on Department of Soldiers' Civil Re-Establishment Diabetics. , 1923, Canadian Medical Association journal.

[33]  R. Hovorka,et al.  Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. , 2004, Physiological measurement.

[34]  W R Campbell,et al.  Further Clinical Experience with Insulin (Pancreatic Extracts) in the Treatment of Diabetes Mellitus , 1923, British medical journal.

[35]  A. Doufas,et al.  Reinforcement Learning Versus Proportional–Integral–Derivative Control of Hypnosis in a Simulated Intraoperative Patient , 2011, Anesthesia and analgesia.

[36]  R. DeFronzo,et al.  Effect of loss of first-phase insulin secretion on hepatic glucose production and tissue glucose disposal in humans. , 1989, The American journal of physiology.

[37]  Malgorzata E. Wilinska,et al.  Tight glycaemic control by an automated algorithm with time-variant sampling in medical ICU patients , 2008, Intensive Care Medicine.

[38]  Eyal Dassau,et al.  Closed-Loop Control of Artificial Pancreatic $\beta$ -Cell in Type 1 Diabetes Mellitus Using Model Predictive Iterative Learning Control , 2010, IEEE Transactions on Biomedical Engineering.

[39]  G. Steil,et al.  Use of Subcutaneous Interstitial Fluid Glucose to Estimate Blood Glucose: Revisiting Delay and Sensor Offset , 2010, Journal of diabetes science and technology.

[40]  U. Pennsylvania,et al.  Clinical and Laboratory Standards Institute , 2019, Springer Reference Medizin.

[41]  Roman Hovorka,et al.  Closed-loop insulin delivery: towards improved diabetes care. , 2012, Discovery medicine.

[42]  S. Nikfar,et al.  The efficacy and tolerability of exenatide in comparison to placebo; a systematic review and meta-analysis of randomized clinical trials. , 2011, Journal of pharmacy & pharmaceutical sciences : a publication of the Canadian Society for Pharmaceutical Sciences, Societe canadienne des sciences pharmaceutiques.

[43]  Rodrigo E Teixeira,et al.  The Next Generation of Artificial Pancreas Control Algorithms , 2008, Journal of diabetes science and technology.

[44]  R. Mazze,et al.  Evaluating the accuracy, reliability, and clinical applicability of continuous glucose monitoring (CGM): Is CGM ready for real time? , 2009, Diabetes technology & therapeutics.

[45]  David Hamilton,et al.  Blood Glucose Prediction Using Artificial Neural Networks Trained with the AIDA Diabetes Simulator: A Proof-of-Concept Pilot Study , 2011, J. Electr. Comput. Eng..

[46]  E. Otto,et al.  An intelligent diabetes software prototype: predicting blood glucose levels and recommending regimen changes. , 2000, Diabetes technology & therapeutics.

[47]  S. Patek,et al.  Closed-Loop Artificial Pancreas Using Subcutaneous Glucose Sensing and Insulin Delivery and a Model Predictive Control Algorithm: The Virginia Experience , 2009, Journal of diabetes science and technology.

[48]  J. Leahy,et al.  Fully Automated Closed-Loop Insulin Delivery Versus Semiautomated Hybrid Control in Pediatric Patients With Type 1 Diabetes Using an Artificial Pancreas , 2008 .

[49]  Robert G. Sutherlin,et al.  A Bihormonal Closed-Loop Artificial Pancreas for Type 1 Diabetes , 2010, Science Translational Medicine.

[50]  Lutz Heinemann,et al.  Continuous glucose monitoring by means of the microdialysis technique: underlying fundamental aspects. , 2003, Diabetes technology & therapeutics.

[51]  Norman Fleischer,et al.  The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. The Diabetes Control and Complications Trial Research Group. , 1993 .

[52]  C. Cobelli,et al.  Artificial Pancreas: Past, Present, Future , 2011, Diabetes.

[53]  Bruce Buckingham,et al.  Toward closing the loop: an update on insulin pumps and continuous glucose monitoring systems. , 2010, Endocrinology and metabolism clinics of North America.

[54]  Carl E. Rasmussen,et al.  PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.

[55]  Howard Zisser,et al.  Glucose Estimation and Prediction through Meal Responses Using Ambulatory Subject Data for Advisory Mode Model Predictive Control , 2007, Journal of diabetes science and technology.

[56]  Hossein Afarideh,et al.  Journal of Pharmacy & Pharmaceutical Sciences A Publication of the Canadian Society for Pharmaceutical Sciences Société canadienne des sciences pharmaceutiques , 2000 .

[57]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[58]  E. Atlas,et al.  MD-Logic Artificial Pancreas System: A Pilot Study in Adults with Type 1 Diabetes Mellitus Running Title: Closed-Loop System In Type 1 Diabetes , 2010 .

[59]  Louis Wehenkel,et al.  Clinical data based optimal STI strategies for HIV: a reinforcement learning approach , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[60]  Akira Matsumoto,et al.  Toward a Continuous Intravascular Glucose Monitoring System , 2010, Sensors.

[61]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[62]  W. Kenneth Ward,et al.  Novel Use of Glucagon in a Closed-Loop System for Prevention of Hypoglycemia in Type 1 Diabetes , 2010, Diabetes Care.

[63]  P Wach,et al.  Simulation studies on neural predictive control of glucose using the subcutaneous route. , 1998, Computer methods and programs in biomedicine.

[64]  Bruce Buckingham,et al.  Glucose control in pediatric intensive care unit patients using an insulin-glucose algorithm. , 2007, Diabetes technology & therapeutics.

[65]  A. Gloyn,et al.  Glucose in the ICU — Evidence, Guidelines, and Outcomes , 2012 .

[66]  B. Thorens Sensing of glucose in the brain. , 2012, Handbook of experimental pharmacology.

[67]  Roman Hovorka,et al.  Blood glucose control by a model predictive control algorithm with variable sampling rate versus a routine glucose management protocol in cardiac surgery patients: a randomized controlled trial. , 2007, The Journal of clinical endocrinology and metabolism.

[68]  Janet M. Allen,et al.  Overnight closed loop insulin delivery (artificial pancreas) in adults with type 1 diabetes: crossover randomised controlled studies , 2011, BMJ : British Medical Journal.

[69]  L. Schaupp,et al.  On‐line adaptive algorithm with glucose prediction capacity for subcutaneous closed loop control of glucose: evaluation under fasting conditions in patients with Type 1 diabetes , 2006, Diabetic medicine : a journal of the British Diabetic Association.

[70]  Robert S. Parker,et al.  Mixed Meal Modeling and Disturbance Rejection in Type I Diabetic Patients , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[71]  K. Kriegstein,et al.  Inhaled insulin for diabetes mellitus. , 2007 .

[72]  Konstantina S. Nikita,et al.  An Insulin Infusion Advisory System Based on Autotuning Nonlinear Model-Predictive Control , 2011, IEEE Transactions on Biomedical Engineering.

[73]  G. Gogou,et al.  A Neural Network Approach in Diabetes Management by Insulin Administration , 2001, Journal of Medical Systems.

[74]  L. Magni,et al.  Closed-Loop Artificial Pancreas Using Subcutaneous Glucose Sensing and Insulin Delivery and a Model Predictive Control Algorithm: Preliminary Studies in Padova and Montpellier , 2009, Journal of diabetes science and technology.

[75]  D. D’Alessio,et al.  Glucagon-like peptide 1: evolution of an incretin into a treatment for diabetes. , 2004, American journal of physiology. Endocrinology and metabolism.

[76]  E. Atlas,et al.  MD-Logic Artificial Pancreas System , 2010, Diabetes Care.

[77]  W. Tamborlane,et al.  A tale of two compartments: interstitial versus blood glucose monitoring. , 2009, Diabetes technology & therapeutics.

[78]  G. Steil,et al.  Subcutaneous glucose predicts plasma glucose independent of insulin: implications for continuous monitoring. , 1999, The American journal of physiology.

[79]  R. Seeley,et al.  Wired on sugar: the role of the CNS in the regulation of glucose homeostasis , 2012, Nature Reviews Neuroscience.

[80]  R. Bellomo,et al.  Glycemic control in the ICU. , 2011, Chest.

[81]  G. McMahon,et al.  Inhaled insulin for diabetes mellitus. , 2007, The New England journal of medicine.

[82]  V. Basevi Standards of Medical Care in Diabetes—2011 , 2011, Diabetes Care.

[83]  Ahmad Haidar,et al.  Closed-Loop Insulin Delivery During Pregnancy Complicated by Type 1 Diabetes , 2011, Diabetes Care.

[84]  F. Doyle,et al.  Detection of a Meal Using Continuous Glucose Monitoring , 2008, Diabetes Care.

[85]  J. Mastrototaro,et al.  The MiniMed continuous glucose monitoring system. , 2000, Diabetes technology & therapeutics.

[86]  M. S. Kirkman,et al.  Comment on: American Diabetes Association. Standards of Medical Care in Diabetes—2011. Diabetes Care 2011;34(Suppl. 1):S11–S61 , 2011 .

[87]  B. Thorens Brain glucose sensing and neural regulation of insulin and glucagon secretion , 2011, Diabetes, obesity & metabolism.

[88]  Andrey V. Savkin,et al.  Optimal H∞ insulin injection control for blood glucose regulation in diabetic patients , 2005, IEEE Trans. Biomed. Eng..

[89]  R. Bergman,et al.  Physiologic evaluation of factors controlling glucose tolerance in man: measurement of insulin sensitivity and beta-cell glucose sensitivity from the response to intravenous glucose. , 1981, The Journal of clinical investigation.

[90]  K S Nikita,et al.  A neural network approach for insulin regime and dose adjustment in type 1 diabetes. , 2000, Diabetes technology & therapeutics.

[91]  A. El-Jabali Neural network modeling and control of type 1 diabetes mellitus , 2005, Bioprocess and biosystems engineering.

[92]  Anirban Roy,et al.  The effect of insulin feedback on closed loop glucose control. , 2011, The Journal of clinical endocrinology and metabolism.

[93]  Roman Hovorka,et al.  Evaluation of a portable ambulatory prototype for automated overnight closed‐loop insulin delivery in young people with type 1 diabetes , 2012, Pediatric diabetes.

[94]  J. Mastrototaro,et al.  The MiniMed Continuous Glucose Monitoring System (CGMS). , 1999, Journal of pediatric endocrinology & metabolism : JPEM.

[95]  P. Aadahl,et al.  Continuous Measurement of Blood Glucose: Validation of a New Intravascular Sensor , 2011, Anesthesiology.

[96]  Nicholas A Peppas,et al.  The future of open‐ and closed‐loop insulin delivery systems , 2008, The Journal of pharmacy and pharmacology.

[97]  J. Rungby,et al.  Amylin agonists: a novel approach in the treatment of diabetes. , 2004, Diabetes.

[98]  M. Haymond,et al.  Twenty-four-hour simultaneous subcutaneous Basal-bolus administration of insulin and amylin in adolescents with type 1 diabetes decreases postprandial hyperglycemia. , 2009, The Journal of clinical endocrinology and metabolism.

[99]  Israel Hartman Insulin Analogs: Impact on Treatment Success, Satisfaction, Quality of Life, and Adherence , 2008, Clinical Medicine & Research.

[100]  Akbar K Waljee,et al.  Machine Learning in Medicine: A Primer for Physicians , 2010, The American Journal of Gastroenterology.

[101]  Howard C. Zisser,et al.  Fully Integrated Artificial Pancreas in Type 1 Diabetes , 2012, Diabetes.

[102]  R. Hovorka,et al.  Comparison of Three Protocols for Tight Glycemic Control in Cardiac Surgery Patients , 2009, Diabetes Care.

[103]  Eric Renard,et al.  Insulin Delivery Route for the Artificial Pancreas: Subcutaneous, Intraperitoneal, or Intravenous? Pros and Cons , 2008, Journal of diabetes science and technology.

[104]  Roman Hovorka,et al.  Automated overnight closed-loop glucose control in young children with type 1 diabetes. , 2011, Diabetes technology & therapeutics.

[105]  E. Adeghate,et al.  Amylin Analogues in the Treatment of Diabetes Mellitus: Medicinal Chemistry and Structural Basis of its Function , 2011, The open medicinal chemistry journal.

[106]  Larry D. Pyeatt,et al.  Reinforcement Learning: A Novel Method for Optimal Control of Propofol-Induced Hypnosis , 2011, Anesthesia and analgesia.

[107]  D. B. Keenan,et al.  Delays in Minimally Invasive Continuous Glucose Monitoring Devices: A Review of Current Technology , 2009, Journal of diabetes science and technology.

[108]  C. C. Palerm,et al.  Closed-Loop Insulin Delivery Using a Subcutaneous Glucose Sensor and Intraperitoneal Insulin Delivery , 2009, Diabetes Care.

[109]  Eyal Dassau,et al.  Zone Model Predictive Control: A Strategy to Minimize Hyper- and Hypoglycemic Events , 2010, Journal of diabetes science and technology.