Beyond p-values: A phase II dual-criterion design with statistical significance and clinical relevance

Background Well-designed phase II trials must have acceptable error rates relative to a pre-specified success criterion, usually a statistically significant p-value. Such standard designs may not always suffice from a clinical perspective because clinical relevance may call for more. For example, proof-of-concept in phase II often requires not only statistical significance but also a sufficiently large effect estimate. Purpose We propose dual-criterion designs to complement statistical significance with clinical relevance, discuss their methodology, and illustrate their implementation in phase II. Methods Clinical relevance requires the effect estimate to pass a clinically motivated threshold (the decision value (DV)). In contrast to standard designs, the required effect estimate is an explicit design input, whereas study power is implicit. The sample size for a dual-criterion design needs careful considerations of the study’s operating characteristics (type I error, power). Results Dual-criterion designs are discussed for a randomized controlled and a single-arm phase II trial, including decision criteria, sample size calculations, decisions under various data scenarios, and operating characteristics. The designs facilitate GO/NO-GO decisions due to their complementary statistical–clinical criterion. Limitations While conceptually simple, implementing a dual-criterion design needs care. The clinical DV must be elicited carefully in collaboration with clinicians, and understanding similarities and differences to a standard design is crucial. Conclusion To improve evidence-based decision-making, a formal yet transparent quantitative framework is important. Dual-criterion designs offer an appealing statistical–clinical compromise, which may be preferable to standard designs if evidence against the null hypothesis alone does not suffice for an efficacy claim.

[1]  G. Casella,et al.  Reconciling Bayesian and Frequentist Evidence in the One-Sided Testing Problem , 1987 .

[2]  L. Wernisch,et al.  An optimal stratified Simon two‐stage design , 2016, Pharmaceutical statistics.

[3]  Heinz Schmidli,et al.  A practical guide to Bayesian group sequential designs , 2014, Pharmaceutical statistics.

[4]  J. Berger,et al.  Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence , 1987 .

[5]  S. Steinberg,et al.  Clinical trial designs for the early clinical development of therapeutic cancer vaccines. , 2001, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[6]  Terry M. Therneau,et al.  Optimal two-stage screening designs for survival comparisons , 1990 .

[7]  P Y Liu,et al.  False positive rates of randomized phase II designs. , 1999, Controlled clinical trials.

[8]  Christy Chuang-Stein,et al.  The role of the minimum clinically important difference and its impact on designing a trial , 2011, Pharmaceutical statistics.

[9]  Roland Fisch,et al.  Bayesian Design of Proof-of-Concept Trials , 2015, Therapeutic innovation & regulatory science.

[10]  J. French,et al.  A Quantitative Approach for Making Go/No-Go Decisions in Drug Development , 2011 .

[11]  W. Alan Nicewander,et al.  A Consonance Criterion for Choosing Sample Size , 1997 .

[12]  T R Fleming,et al.  One-sample multiple testing procedure for phase II clinical trials. , 1982, Biometrics.

[13]  Boris Freidlin,et al.  Design issues of randomized phase II trials and a proposal for phase II screening trials. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[14]  James Matcham,et al.  Decision-making in early clinical drug development. , 2016, Pharmaceutical statistics.

[15]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[16]  Michael Branson,et al.  A proof of concept phase II non‐inferiority criterion , 2011, Statistics in medicine.

[17]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[18]  P Y Liu,et al.  Selection designs for pilot studies based on survival. , 1993, Biometrics.

[19]  Eric R. Ziegel,et al.  Statistical Issues in Drug Development , 1997 .

[20]  S K Carter,et al.  Calibrated phase II clinical trials in oncology. , 1986, Statistics in medicine.

[21]  R. Simon,et al.  Optimal two-stage designs for phase II clinical trials. , 1989, Controlled clinical trials.

[22]  D. Sargent,et al.  A three-outcome design for phase II clinical trials. , 2001, Controlled Clinical Trials.

[23]  S Madani,et al.  Proof of Concept: A PhRMA Position Paper With Recommendations for Best Practice , 2010, Clinical pharmacology and therapeutics.

[24]  R Simon,et al.  Clinical trial designs for cytostatic agents: are new approaches needed? , 2001, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[25]  B. Storer A class of phase II designs with three possible outcomes. , 1992, Biometrics.