Efficient design and inference for multistage randomized trials of individualized treatment policies.

Clinical demand for individualized "adaptive" treatment policies in diverse fields has spawned development of clinical trial methodology for their experimental evaluation via multistage designs, building upon methods intended for the analysis of naturalistically observed strategies. Because often there is no need to parametrically smooth multistage trial data (in contrast to observational data for adaptive strategies), it is possible to establish direct connections among different methodological approaches. We show by algebraic proof that the maximum likelihood (ML) and optimal semiparametric (SP) estimators of the population mean of the outcome of a treatment policy and its standard error are equal under certain experimental conditions. This result is used to develop a unified and efficient approach to design and inference for multistage trials of policies that adapt treatment according to discrete responses. We derive a sample size formula expressed in terms of a parametric version of the optimal SP population variance. Nonparametric (sample-based) ML estimation performed well in simulation studies, in terms of achieved power, for scenarios most likely to occur in real studies, even though sample sizes were based on the parametric formula. ML outperformed the SP estimator; differences in achieved power predominately reflected differences in their estimates of the population mean (rather than estimated standard errors). Neither methodology could mitigate the potential for overestimated sample sizes when strong nonlinearity was purposely simulated for certain discrete outcomes; however, such departures from linearity may not be an issue for many clinical contexts that make evaluation of competitive treatment policies meaningful.

[1]  Ree Dawson,et al.  Sample size calculations for evaluating treatment policies in multi-stage designs , 2010, Clinical trials.

[2]  Philip W. Lavori,et al.  A design for testing clinical strategies: biased adaptive within‐subject randomization , 2000 .

[3]  J. Robins,et al.  Semiparametric regression estimation in the presence of dependent censoring , 1995 .

[4]  A. Tsiatis,et al.  Optimal Estimator for the Survival Distribution and Related Quantities for Treatment Policies in Two‐Stage Randomization Designs in Clinical Trials , 2004, Biometrics.

[5]  P. Lavori,et al.  Sequential causal inference: Application to randomized trials of adaptive treatment strategies , 2008, Statistics in medicine.

[6]  J. Robins The control of confounding by intermediate variables. , 1989, Statistics in medicine.

[7]  Ree Dawson,et al.  Improving the efficiency of estimation in randomized trials of adaptive treatment strategies , 2007, Clinical trials.

[8]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[9]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[10]  J M Robins,et al.  Marginal Mean Models for Dynamic Regimes , 2001, Journal of the American Statistical Association.

[11]  D. Kupfer,et al.  Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design. , 2004, Controlled clinical trials.

[12]  Marie Davidian,et al.  Estimation of Survival Distributions of Treatment Policies in Two‐Stage Randomization Designs in Clinical Trials , 2002, Biometrics.

[13]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.

[14]  Abdus S Wahed,et al.  Sample size for two‐stage studies with maintenance therapy , 2009, Statistics in medicine.

[15]  H. Sung,et al.  Evaluating multiple treatment courses in clinical trials. , 2000, Statistics in medicine.