Using pilot data to size a two‐arm randomized trial to find a nearly optimal personalized treatment strategy

A personalized treatment strategy formalizes evidence-based treatment selection by mapping patient information to a recommended treatment. Personalized treatment strategies can produce better patient outcomes while reducing cost and treatment burden. Thus, among clinical and intervention scientists, there is a growing interest in conducting randomized clinical trials when one of the primary aims is estimation of a personalized treatment strategy. However, at present, there are no appropriate sample size formulae to assist in the design of such a trial. Furthermore, because the sampling distribution of the estimated outcome under an estimated optimal treatment strategy can be highly sensitive to small perturbations in the underlying generative model, sample size calculations based on standard (uncorrected) asymptotic approximations or computer simulations may not be reliable. We offer a simple and robust method for powering a single stage, two-armed randomized clinical trial when the primary aim is estimating the optimal single stage personalized treatment strategy. The proposed method is based on inverting a plugin projection confidence interval and is thereby regular and robust to small perturbations of the underlying generative model. The proposed method requires elicitation of two clinically meaningful parameters from clinical scientists and uses data from a small pilot study to estimate nuisance parameters, which are not easily elicited. The method performs well in simulated experiments and is illustrated using data from a pilot study of time to conception and fertility awareness.

[1]  M. Kosorok,et al.  Q-LEARNING WITH CENSORED DATA. , 2012, Annals of statistics.

[2]  Eric B. Laber,et al.  Interactive model building for Q-learning. , 2014, Biometrika.

[3]  D. Gervini Warped functional regression , 2012, 1203.1975.

[4]  A. V. D. Vaart,et al.  On Differentiable Functionals , 1991 .

[5]  Joelle Pineau,et al.  A multiple imputation strategy for sequential multiple assignment randomized trials , 2014, Statistics in medicine.

[6]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[7]  B. Chakraborty,et al.  Statistical methods for dynamic treatment regimes , 2013 .

[8]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[9]  Susan A. Murphy,et al.  A-Learning for approximate planning , 2004 .

[10]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.

[11]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[12]  Joseph R. Rausch,et al.  Sample size planning for statistical power and accuracy in parameter estimation. , 2008, Annual review of psychology.

[13]  Anastasios A. Tsiatis,et al.  Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[14]  Eric B. Laber,et al.  Dynamic treatment regimes: technical challenges and applications. , 2014, Electronic journal of statistics.

[15]  Susan A Murphy,et al.  Adaptive Confidence Intervals for the Test Error in Classification , 2011, Journal of the American Statistical Association.

[16]  E. Abrahams,et al.  The Case for Personalized Medicine , 2009, Journal of diabetes science and technology.

[17]  Inbal Nahum-Shani,et al.  Q-learning: a data analysis method for constructing adaptive interventions. , 2012, Psychological methods.

[18]  Ken Kelley,et al.  Obtaining Power or Obtaining Precision , 2003, Evaluation & the health professions.

[19]  Wenbin Lu,et al.  On optimal treatment regimes selection for mean survival time , 2015, Statistics in medicine.

[20]  Ken R. Smith,et al.  Impact of instruction in the Creighton model fertilitycare system on time to pregnancy in couples of proven fecundity: results of a randomised trial. , 2014, Paediatric and perinatal epidemiology.

[21]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[22]  Wenbin Lu,et al.  Variable selection for optimal treatment decision , 2013, Statistical methods in medical research.

[23]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[24]  Hannes Leeb,et al.  The Finite-Sample Distribution of Post-Model-Selection Estimators, and Uniform Versus Non-Uniform Approximations , 2000 .

[25]  Daniel J. Lizotte,et al.  Set‐valued dynamic treatment regimes for competing outcomes , 2012, Biometrics.

[26]  F. Collins,et al.  The path to personalized medicine. , 2010, The New England journal of medicine.

[27]  Susan A Murphy,et al.  Sample size formulae for two-stage randomized trials with survival outcomes. , 2011, Biometrika.

[28]  Susan Murphy,et al.  Inference for non-regular parameters in optimal dynamic treatment regimes , 2010, Statistical methods in medical research.

[29]  Donglin Zeng,et al.  New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2015, Journal of the American Statistical Association.

[30]  R. Berger,et al.  P Values Maximized Over a Confidence Set for the Nuisance Parameter , 1994 .

[31]  Susan A. Murphy,et al.  A Generalization Error for Q-Learning , 2005, J. Mach. Learn. Res..

[32]  S. Vollset,et al.  Periconceptional Folic Acid Supplementation and Infant Risk of Congenital Heart Defects in Norway 1999-2009. , 2015, Paediatric and perinatal epidemiology.

[33]  J. Wittes,et al.  The role of internal pilot studies in increasing the efficiency of clinical trials. , 1990, Statistics in medicine.

[34]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[35]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[36]  J. Robins,et al.  Estimation and extrapolation of optimal treatment and testing strategies , 2008, Statistics in medicine.

[37]  K. Hirano,et al.  Impossibility Results for Nondifferentiable Functionals , 2012 .

[38]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[39]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[40]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .