Comparison of Some Suboptimal Control Policies in Medical Drug Therapy

In drug therapy, efficient dosage policies are needed to maintain drug concentrations at target. The relationship between the concentration of a drug and the dosages is often described by compartment models in which the parameters are unknown, although prior knowledge may be available and can be updated after blood samples are taken during the therapy. In this paper we define some tractable policies for adaptive control of drug concentrations in compartment models and compare their performances using computer simulation in a one-compartment model. We also discuss the effects of assuming normal priors, discrete approximation of a continuous prior, using nonquadratic costs, and information probing. From the simulation we derive intuition as to what types of policies perform well and address the topic of actively versus passively learning.

[1]  W. Lovejoy Myopic policies for some inventory models with uncertain demand distributions , 1990 .

[2]  James C. Scott,et al.  Testing computer-controlled infusion pumps by simulation. , 1988, Anesthesiology.

[3]  R W Jelliffe,et al.  Modeling, adaptive control, and optimal drug therapy. , 1990, Medical progress through technology.

[4]  D. Blackwell Comparison of Experiments , 1951 .

[5]  Averill M. Law,et al.  The art and theory of dynamic programming , 1977 .

[6]  W. Lovejoy A survey of algorithmic methods for partially observed Markov decision processes , 1991 .

[7]  William S. Lovejoy,et al.  Suboptimal Policies, with Bounds, for Parameter Adaptive Decision Processes , 1993, Oper. Res..

[8]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Stochastic Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[9]  D R Stanski,et al.  Bayesian Forecasting Improves the Prediction of Intraoperative Plasma Concentrations of Alfentanil , 1988, Anesthesiology.

[10]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[11]  K. M. vanHee,et al.  Bayesian control of Markov chains , 1978 .

[12]  L B Sheiner,et al.  Forecasting individual pharmacokinetics , 1979, Clinical pharmacology and therapeutics.

[13]  F. Follath,et al.  Rapid Prediction of Individual Dosage Requirements for Lignocaine , 1984, Clinical pharmacokinetics.

[14]  D R Stanski,et al.  Population pharmacokinetics of alfentanil: the average dose-plasma concentration relationship and interindividual variability in patients. , 1987, Anesthesiology.

[15]  A. Mallet A maximum likelihood estimation method for random coefficient regression models , 1986 .

[16]  S Vozeh,et al.  Computer‐Assisted Drug Assay Interpretation Based on Bayesian Estimation of Individual Pharmacokinetics: Application to Lidocaine , 1985, Therapeutic drug monitoring.