Operating characteristics of sample size re‐estimation with futility stopping based on conditional power

Various methods have been described for re‐estimating the final sample size in a clinical trial based on an interim assessment of the treatment effect. Many re‐weight the observations after re‐sizing so as to control the pursuant inflation in the type I error probability α. Lan and Trost (Estimation of parameters and sample size re‐estimation. Proceedings of the American Statistical Association Biopharmaceutical Section 1997; 48–51) proposed a simple procedure based on conditional power calculated under the current trend in the data (CPT). The study is terminated for futility if CPT ⩽ CL, continued unchanged if CPT ⩾ CU, or re‐sized by a factor m to yield CPT = CU if CL < CPT < CU, where CL and CU are pre‐specified probability levels. The overall level α can be preserved since the reduction due to stopping for futility can balance the inflation due to sample size re‐estimation, thus permitting any form of final analysis with no re‐weighting. Herein the statistical properties of this approach are described including an evaluation of the probabilities of stopping for futility or re‐sizing, the distribution of the re‐sizing factor m, and the unconditional type I and II error probabilities α and β. Since futility stopping does not allow a type I error but commits a type II error, then as the probability of stopping for futility increases, α decreases and β increases. An iterative procedure is described for choice of the critical test value and the futility stopping boundary so as to ensure that specified α and β are obtained. However, inflation in β is controlled by reducing the probability of futility stopping, that in turn dramatically increases the possible re‐sizing factor m. The procedure is also generalized to limit the maximum sample size inflation factor, such as at mmax = 4. However, doing so then allows for a non‐trivial fraction of studies to be re‐sized at this level that still have low conditional power. These properties also apply to other methods for sample size re‐estimation with a provision for stopping for futility. Sample size re‐estimation procedures should be used with caution and the impact on the overall type II error probability should be assessed. Copyright © 2005 John Wiley & Sons, Ltd.

[1]  A. Tsiatis,et al.  On the inefficiency of the adaptive design for monitoring clinical trials , 2003 .

[2]  Martin Posch,et al.  Issues in designing flexible trials , 2003, Statistics in medicine.

[3]  Michael A Proschan,et al.  Practical midcourse sample size modification in clinical trials. , 2003, Controlled clinical trials.

[4]  Christopher Jennison,et al.  Mid‐course sample size modification in clinical trials based on the observed treatment effect , 2002, Statistics in medicine.

[5]  W. Shih,et al.  A sample size adjustment procedure for clinical trials based on conditional power. , 2002, Biostatistics.

[6]  W. Brannath,et al.  Recursive Combination Tests , 2002 .

[7]  J. Denne,et al.  Sample size recalculation using conditional power , 2001, Statistics in medicine.

[8]  A L Gould,et al.  Sample size re‐estimation: recent developments and practical considerations , 2001, Statistics in medicine.

[9]  H. Schäfer,et al.  Adaptive Group Sequential Designs for Clinical Trials: Combining the Advantages of Adaptive and of Classical Group Sequential Approaches , 2001, Biometrics.

[10]  G. Chi,et al.  On Sample Size and Inference for Two‐Stage Adaptive Designs , 2001, Biometrics.

[11]  Z. Shun,et al.  Type I error in sample size re‐estimations based on observed treatment difference , 2001, Statistics in medicine.

[12]  K. K. Lan,et al.  FLEXIBLE INTERIM ANALYSIS METHOD FOR SAMPLE SIZE RE-ESTIMATION AND EARLY STOPPING: A CONDITIONAL POWER APPROACH , 2001 .

[13]  D L DeMets,et al.  Computations for group sequential boundaries using the Lan-DeMets spending function method. , 2000, Controlled clinical trials.

[14]  W. Lehmacher,et al.  Adaptive Sample Size Calculations in Group Sequential Trials , 1999, Biometrics.

[15]  P. Bauer,et al.  Adaptive Two Stage Designs and the Conditional Error Function , 1999 .

[16]  B. Turnbull,et al.  Group Sequential Methods , 1999 .

[17]  Sue-Jane Wang,et al.  Modification of Sample Size in Group Sequential Clinical Trials , 1999, Biometrics.

[18]  L Fisher,et al.  Statistical Inference for Self‐Designing Clinical Trials with a One‐Sided Hypothesis , 1999, Biometrics.

[19]  L D Fisher,et al.  Self-designing clinical trials. , 1998, Statistics in medicine.

[20]  G Wassmer,et al.  A comparison of two methods for adaptive interim analyses in clinical trials. , 1998, Biometrics.

[21]  M. Proschan,et al.  Designed extension of studies based on conditional power. , 1995, Biometrics.

[22]  P. Bauer,et al.  Evaluation of experiments with adaptive interim analyses. , 1994, Biometrics.

[23]  David M. Reboussin,et al.  Information and information fractions for design and sequential monitoring of clinical trials , 1994 .

[24]  D. Zucker,et al.  Sequential monitoring of clinical trials: the role of information and Brownian motion. , 1993, Statistics in medicine.

[25]  Michael A. Proschan,et al.  Effects of assumption violations on type I error rate in group sequential monitoring , 1992 .

[26]  T. Fleming,et al.  Parameter estimation following group sequential hypothesis testing , 1990 .

[27]  K K Lan,et al.  The B-value: a tool for monitoring data. , 1988, Biometrics.

[28]  K Kim,et al.  Confidence intervals following group sequential tests in clinical trials. , 1987, Biometrics.

[29]  P K Andersen,et al.  Conditional power calculations as an aid in the decision whether to continue a clinical trial. , 1987, Controlled clinical trials.

[30]  J M Lachin,et al.  Evaluation of sample size and power for analyses of survival with allowance for nonuniform patient entry, losses to follow-up, noncompliance, and stratification. , 1986, Biometrics.

[31]  A A Tsiatis,et al.  Exact confidence intervals following a group sequential test. , 1984, Biometrics.

[32]  D L DeMets,et al.  An aid to data monitoring in long-term clinical trials. , 1982, Controlled clinical trials.

[33]  K. K. Lan,et al.  Stochastically curtailed tests in long–term clinical trials , 1982 .