Towards efficient, personalized anesthesia using continuous reinforcement learning for propofol infusion control

We demonstrate the use of reinforcement learning algorithms for efficient and personalized control of patients' depth of general anesthesia during surgical procedures - an important aspect for Neurotechnology. We used the continuous actor-critic learning automaton technique, which was trained and tested in silico using published patient data, physiological simulation and the bispectral index (BIS) of patient EEG. Our two-stage technique learns first a generic effective control strategy based on average patient data (factory stage) and can then fine-tune itself to individual patients (personalization stage). The results showed that the reinforcement learner as compared to a bang-bang controller reduced the dose of the anesthetic agent administered by 9.4% and kept the patient closer to the target state, as measured by RMSE (4.90 compared to 8.47). It also kept the BIS error within a narrow, clinically acceptable range 93.9% of the time. Moreover, the policy was trained using only 50 simulated operations. Being able to learn a control strategy this quickly indicates that the reinforcement learner could also adapt regularly to a patient's changing responses throughout a live operation and facilitate the task of anesthesiologists by prompting them with recommended actions.

[1]  A. Hoeft,et al.  The Performance of a Target-Controlled Infusion of Propofol in Combination with Remifentanil: A Clinical Investigation with Two Propofol Formulations , 2006, Anesthesia and analgesia.

[2]  Anthony G. Doufas,et al.  Induction Speed Is Not a Determinant of Propofol Pharmacodynamics , 2004, Anesthesiology.

[3]  S. Shafer,et al.  The Influence of Method of Administration and Covariates on the Pharmacokinetics of Propofol in Adult Volunteers , 1998, Anesthesiology.

[4]  K. Leslie,et al.  For Personal Use. Only Reproduce with Permission from the Lancet , 2022 .

[5]  F. Guarracino,et al.  Target controlled infusion: TCI. , 2005, Minerva anestesiologica.

[6]  A. Absalom,et al.  Closed-loop Control of Anesthesia Using Bispectral Index: Performance Assessment in Patients Undergoing Major Orthopedic Surgery under Combined General and Regional Anesthesia , 2002, Anesthesiology.

[7]  A. Aldo Faisal,et al.  Noise in Neurons and Other Constraints , 2012 .

[8]  R. Mirakhur,et al.  Manual versus target‐controlled infusions of propofol , 2004, Anaesthesia.

[9]  A. Aldo Faisal,et al.  Stochastic Simulation of Neurons, Axons, and Action Potentials , 2009 .

[10]  Michel M R F Struys,et al.  Performance Evaluation of Two Published Closed-loop Control Systems Using Bispectral Index Monitoring: A Simulation Study , 2004, Anesthesiology.

[11]  A. Aldo Faisal,et al.  How reliable is the connectivity in cortical neural networks? , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[12]  A. Aldo Faisal,et al.  The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas , 2013, Expert review of medical devices.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  M.A. Wiering,et al.  Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[15]  A. Doufas,et al.  Reinforcement Learning Versus Proportional–Integral–Derivative Control of Hypnosis in a Simulated Intraoperative Patient , 2011, Anesthesia and analgesia.

[16]  M. Struys,et al.  Comparison of Closed-loop Controlled Administration of Propofol Using Bispectral Index as the Controlled Variable versus “Standard Practice” Controlled Administration , 2001, Anesthesiology.

[17]  Luc Barvais,et al.  Titration of Propofol for Anesthetic Induction and Maintenance Guided by the Bispectral Index: Closed-loop versus Manual Control: A Prospective, Randomized, Multicenter Study , 2006, Anesthesiology.

[18]  B. Orser,et al.  Awareness during anesthesia , 2008, Canadian Medical Association Journal.