Glucose level control using Temporal Difference methods

Control theory has been widely used in various fields; one of these areas is medical issues. Diabetes is one of the new topics of interest in control. Obtaining the rates for the injection of insulin automatically always been a concern of physicians. The purpose of the control and treatment of diabetes, is keeping blood glucose in the normal range as possible. In this paper, we used Sarsa method - which is an on-policy Temporal Difference (TD) technique - for insulin delivery rate. TD methods are the most known methods for solving reinforcement learning problem. Because TD methods don't require a precise model of environment dynamics; they have absorbed interests in medical applications during recent years. Although temporal difference methods don't require a mathematical model of the environment, but for simulating an environment, we used Palumbo mathematical model instead of real patients. Since patients' medical parameters vary from person to person, for controlling the disease we should have different drug schedules, in other word, we should have different controller for each patient. While RL methods, by interacting with their environment, automatically define suitable doses for each person. If we want less trial and error on real patients and therefore reduce the side effects of changes in dose on the patient; according to the parameters of a patient, we design a controller which estimate the appropriate insulin injection rate. Then the drug program can be applied to other real patients. At this stage controller (applies Sarsa algorithm) with less trial and error, determines the appropriate dose for real patient. The results of the simulations, represents the efficiency of the proposed method.

[1]  W. Kenneth Ward,et al.  Automated Control of an Adaptive Bihormonal, Dual-Sensor Artificial Pancreas and Evaluation During Inpatient Studies , 2014, IEEE Transactions on Biomedical Engineering.

[2]  Pierdomenico Pepe,et al.  Robust closed-loop control of plasma glycemia: A discrete-delay model approach , 2009 .

[3]  M. Morari,et al.  Closed-Loop Control of Blood Glucose , 2007 .

[4]  P. Aslani,et al.  A Systematic Review of Interventions Addressing Adherence to Anti-Diabetic Medications in Patients with Type 2 Diabetes—Impact on Adherence , 2015, PloS one.

[5]  A. Kitabchi,et al.  Comparison of the effectiveness of various routes of insulin injection: insulin levels and glucose response in normal subjects. , 1976, The Journal of clinical endocrinology and metabolism.

[6]  Pasquale Palumbo,et al.  Qualitative behaviour of a family of delay-differential models of the glucose-insulin system , 2006 .

[7]  L. Perry,et al.  Prevalence of vascular complications and factors predictive of their development in young adults with type 1 diabetes: systematic literature review , 2014, BMC Research Notes.

[8]  Josep Vehí,et al.  Adaptive Calibration Algorithm for Plasma Glucose Estimation in Continuous Glucose Monitoring , 2013, IEEE Journal of Biomedical and Health Informatics.

[9]  B. Olsen,et al.  Sick day management in children and adolescents with diabetes , 2007, Pediatric diabetes.

[10]  B. Olsen,et al.  Sick day management in children and adolescents with diabetes , 2014 .

[11]  Lilia de Souza Nogueira,et al.  [The influence of quality of life in treatment adherence of diabetic patients: a systematic review]. , 2015, Revista da Escola de Enfermagem da U S P.

[12]  Pierdomenico Pepe,et al.  Robust closed-loop control of plasma glycemia: A discrete-delay model approach , 2008, 2008 47th IEEE Conference on Decision and Control.

[13]  K. Barnard,et al.  Fear of hypoglycaemia in parents of young children with type 1 diabetes: a systematic review , 2010, BMC pediatrics.

[14]  B. Zinman,et al.  Alternative routes of insulin delivery , 2003, Diabetic medicine : a journal of the British Diabetic Association.

[15]  Qian Wang,et al.  Developing personalized empirical models for Type-I diabetes: An extended Kalman filter approach , 2013, 2013 American Control Conference.

[16]  Ali Cinar,et al.  Adaptive multivariable closed-loop control of blood glucose concentration in patients with Type 1 Diabetes , 2013, 2013 American Control Conference.

[17]  D. Muchmore,et al.  Inhaled insulin delivery – where are we now? , 2006, Diabetes, obesity & metabolism.

[18]  D. Sherr,et al.  Diabetes self-management education for adults with type 2 diabetes mellitus: A systematic review of the effect on glycemic control. , 2016, Patient education and counseling.

[19]  T. Kohlmann,et al.  Treatment satisfaction and quality-of-life between type 2 diabetes patients initiating long- vs. intermediate-acting basal insulin therapy in combination with oral hypoglycemic agents – a randomized, prospective, crossover, open clinical trial , 2015, Health and Quality of Life Outcomes.

[20]  Stavroula G. Mougiakakou,et al.  Personalized tuning of a reinforcement learning control algorithm for glucose regulation , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[22]  Ali Karimpour,et al.  Agent-based Simulation for Blood Glucose Control in Diabetic Patients , 2009 .

[23]  Hiok Chai Quek,et al.  Model Reference Adaptive Control on Glucose for the Treatment of Diabetes Mellitus , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[24]  Giovanni Sparacino,et al.  Diabetes: Models, Signals, and Control , 2009 .

[25]  Diederick Grobbee,et al.  Intensive blood glucose control and vascular outcomes in patients with type 2 diabetes. , 2008, The New England journal of medicine.

[26]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[27]  L. Bouter,et al.  Randomized Study of Two Different Target Levels of Glycemic Control Within the Acceptable Range in Type 2 Diabetes: Effects on well-being at 1 year , 1998, Diabetes Care.

[28]  Panagiotis D. Christofides,et al.  Improved postprandial glucose control with a customized Model Predictive Controller , 2015, 2015 American Control Conference (ACC).