Approximate Dynamic Programming and Its Applications to the Design of Phase I Cancer Trials

Optimal design of a Phase I cancer trial can be formulated as a stochastic optimization problem. By making use of recent advances in approximate dynamic programming to tackle the problem, we de- velop an approximation of the Bayesian optimal design. The resulting design is a convex combination of a "treatment" design, such as Babb et al.'s (1998) escalation with overdose control, and a "learning" design, such as Haines et al.'s (2003) c-optimal design, thus directly address- ing the treatment versus experimentation dilemma inherent in Phase I trials and providing a simple and intuitive design for clinical use. Com- putational details are given and the proposed design is compared to existing designs in a simulation study. The design can also be readily modified to include a first stage that cautiously escalates doses similarly to traditional nonparametric step-up/down schemes, while validating the Bayesian parametric model for the efficient model-based design in the second stage.

[1]  D. D. Hoff,et al.  Response rates, duration of response, and dose response effects in phase I studies of antineoplastics , 1991, Investigational New Drugs.

[2]  J. Sacks Asymptotic Distribution of Stochastic Approximation Procedures , 1958 .

[3]  W. J. Studden,et al.  Theory Of Optimal Experiments , 1972 .

[4]  J. Babb,et al.  Flexible Bayesian methods for cancer phase I clinical trials. Dose escalation with overdose control , 2005, Statistics in medicine.

[5]  Tze Leung Lai,et al.  Approximate Policy Optimization and Adaptive Control in Regression Models , 2005 .

[6]  B E Storer,et al.  Design and analysis of phase I clinical trials. , 1989, Biometrics.

[7]  S Zacks,et al.  Cancer phase I clinical trials: efficient dose escalation with overdose control. , 1998, Statistics in medicine.

[8]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[9]  H. Robbins,et al.  Adaptive Design and Stochastic Approximation , 1979 .

[10]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[11]  D. Bayard A forward method for optimal stochastic nonlinear and adaptive control , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[12]  Ying Kuen Cheung,et al.  Coherence principles in dose-finding studies , 2005 .

[13]  Khidir M. Abdelbasit,et al.  Experimental Design for Binary Data , 1983 .

[14]  J O'Quigley,et al.  Continual reassessment method: a practical design for phase 1 clinical trials in cancer. , 1990, Biometrics.

[15]  C. F. Wu,et al.  Efficient Sequential Designs with Binary Data , 1985 .

[16]  L. Haines,et al.  Bayesian Optimal Designs for Phase I Clinical Trials , 2003, Biometrics.

[17]  J O'Quigley,et al.  Continual reassessment method: a likelihood approach. , 1996, Biometrics.

[18]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[19]  M. Christian,et al.  Phase I clinical trial design in cancer drug development. , 2000, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  L V Rubinstein,et al.  A comparison of two phase I trial designs. , 1994, Statistics in medicine.

[21]  Holger Dette,et al.  Optimal designs for a class of nonlinear regression models , 2002 .