Response‐adaptive randomization for multi‐arm clinical trials using the forward looking Gittins index rule

The Gittins index provides a well established, computationally attractive, optimal solution to a class of resource allocation problems known collectively as the multi-arm bandit problem. Its development was originally motivated by the problem of optimal patient allocation in multi-arm clinical trials. However, it has never been used in practice, possibly for the following reasons: (1) it is fully sequential, i.e., the endpoint must be observable soon after treating a patient, reducing the medical settings to which it is applicable; (2) it is completely deterministic and thus removes randomization from the trial, which would naturally protect against various sources of bias. We propose a novel implementation of the Gittins index rule that overcomes these difficulties, trading off a small deviation from optimality for a fully randomized, adaptive group allocation procedure which offers substantial improvements in terms of patient benefit, especially relevant for small populations. We report the operating characteristics of our approach compared to existing methods of adaptive randomization using a recently published trial as motivation.

[1]  W. Rosenberger,et al.  The theory of response-adaptive randomization in clinical trials , 2006 .

[2]  Urtzi Ayesta,et al.  On the Gittins index in the M/G/1 queue , 2009, Queueing Syst. Theory Appl..

[3]  Yuhong Yang,et al.  RANDOMIZED ALLOCATION WITH NONPARAMETRIC ESTIMATION FOR A MULTI-ARMED BANDIT PROBLEM WITH COVARIATES , 2002 .

[4]  K. Glazebrook On Randomized Dynamic Allocation Indices for the Sequential Design of Experiments , 1980 .

[5]  D. Zucker The Belmont Report , 2014 .

[6]  William F. Rosenberger,et al.  Optimality, Variability, Power , 2003 .

[7]  Linda Wang,et al.  Press Release Cancer Specialists in Disagreement About Purpose of Clinical Trials , 2002 .

[8]  Giulia Bianchi,et al.  Efficacy and safety of neoadjuvant pertuzumab and trastuzumab in women with locally advanced, inflammatory, or early HER2-positive breast cancer (NeoSphere): a randomised multicentre, open-label, phase 2 trial. , 2012, The Lancet. Oncology.

[9]  Kevin D. Glazebrook,et al.  Multi-Armed Bandit Allocation Indices: Gittins/Multi-Armed Bandit Allocation Indices , 2011 .

[10]  Shein-Chung Chow,et al.  Adaptive design methods in clinical trials – a review , 2008, Orphanet journal of rare diseases.

[11]  W. R. Thompson ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[12]  Jack Bowden,et al.  Multi-armed Bandit Models for the Optimal Design of Clinical Trials: Benefits and Challenges. , 2015, Statistical science : a review journal of the Institute of Mathematical Statistics.

[13]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[14]  D. Berry,et al.  Adaptive assignment versus balanced randomization in clinical trials: a decision analysis. , 1995, Statistics in medicine.

[15]  Peter Jacko,et al.  Scheduling of users with markovian time-varying transmission rates , 2013, SIGMETRICS '13.

[16]  Donald A. Berry,et al.  Bandit Problems: Sequential Allocation of Experiments. , 1986 .

[17]  N Stallard,et al.  Optimal Adaptive Designs for Binary Response Trials , 2001, Biometrics.

[18]  John R. Hauser,et al.  Website Morphing , 2009, Mark. Sci..

[19]  Donald A. Berry,et al.  Optimal adaptive randomized designs for clinical trials , 2007 .

[20]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[21]  J M Lachin,et al.  The use of response-adaptive designs in clinical trials. , 1993, Controlled clinical trials.

[22]  William F. Rosenberger,et al.  Implementing Optimal Allocation in Sequential Binary Response Experiments , 2007 .

[23]  William H Press,et al.  Bandit solutions provide unified ethical models for randomized clinical trials and comparative effectiveness research , 2009, Proceedings of the National Academy of Sciences.

[24]  J. Gittins,et al.  A dynamic allocation index for the discounted multiarmed bandit problem , 1979 .

[25]  G. Parmigiani,et al.  Bayesian adaptive randomized trial design for patients with recurrent glioblastoma. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.