Handling an uncertain control group event risk in non-inferiority trials: non-inferiority frontiers and the power-stabilising transformation

Background Non-inferiority trials are increasingly used to evaluate new treatments that are expected to have secondary advantages over standard of care, but similar efficacy on the primary outcome. When designing a non-inferiority trial with a binary primary outcome, the choice of effect measure for the non-inferiority margin (e.g. risk ratio or risk difference) has an important effect on sample size calculations; furthermore, if the control event risk observed is markedly different from that assumed, the trial can quickly lose power or the results become difficult to interpret. Methods We propose a new way of designing non-inferiority trials to overcome the issues raised by unexpected control event risks. Our proposal involves using clinical judgement to specify a ‘non-inferiority frontier’, i.e. a curve defining the most appropriate non-inferiority margin for each possible value of control event risk. Existing trials implicitly use frontiers defined by a fixed risk ratio or a fixed risk difference. We discuss their limitations and propose a fixed arcsine difference frontier, using the power-stabilising transformation for binary outcomes, which may better represent clinical judgement. We propose and compare three ways of designing a trial using this frontier: testing and reporting on the arcsine scale; testing on the arcsine scale but reporting on the risk difference or risk ratio scale; and modifying the margin on the risk difference or risk ratio scale after observing the control event risk according to the power-stabilising frontier. Results Testing and reporting on the arcsine scale leads to results which are challenging to interpret clinically. For small values of control event risk, testing on the arcsine scale and reporting results on the risk difference scale produces confidence intervals at a higher level than the nominal one or non-inferiority margins that are slightly smaller than those back-calculated from the power-stabilising frontier alone. However, working on the arcsine scale generally requires a larger sample size compared to the risk difference scale. Therefore, working on the risk difference scale, modifying the margin after observing the control event risk, might be preferable, as it requires a smaller sample size. However, this approach tends to slightly inflate type I error rate; a solution is to use a slightly lower significance level for testing, although this modestly reduces power. When working on the risk ratio scale instead, the same approach based on the modification of the margin leads to power levels above the nominal one, maintaining type I error under control. Conclusions Our proposed methods of designing non-inferiority trials using power-stabilising non-inferiority frontiers make trial design more resilient to unexpected values of the control event risk, at the only cost of requiring somewhat larger sample sizes when the goal is to report results on the risk difference scale.

[1]  Sheena McCormack,et al.  The averted infections ratio: a novel measure of effectiveness of experimental HIV pre-exposure prophylaxis agents. , 2018, The lancet. HIV.

[2]  S. Snapinn Noninferiority trials , 2000, Current controlled trials in cardiovascular medicine.

[3]  A Lotery,et al.  Clinical efficacy of intravitreal aflibercept versus panretinal photocoagulation for best corrected visual acuity in patients with proliferative diabetic retinopathy at 52 weeks (CLARITY): a multicentre, single-blinded, randomised, controlled, phase 2b, non-inferiority trial , 2017, The Lancet.

[4]  Lei Nie,et al.  An adaptive noninferiority margin and sample size adjustment in covariate-adjustment regression model approach to noninferiority clinical trials , 2010, Model. Assist. Stat. Appl..

[5]  James R Carpenter,et al.  Non-inferiority trials: are they inferior? A systematic review of reporting in major medical journals , 2016, BMJ Open.

[6]  Rolf H. H. Groenwold,et al.  Defining the noninferiority margin and analysing noninferiority: An overview , 2017, British journal of clinical pharmacology.

[7]  Or Non-Inferiority Clinical Trials to Establish Effectiveness Guidance for Industry , 2016 .

[8]  James P Hughes,et al.  Adaptive non-inferiority margins under observable non-constancy , 2018, Statistical methods in medical research.

[9]  Ingram Olkin,et al.  Why add anything to nothing? The arcsine difference as a measure of treatment effect in meta‐analysis with zero cells , 2009, Statistics in medicine.

[10]  Baptiste Leurent,et al.  Switching HIV Treatment in Adults Based on CD4 Count Versus Viral Load Monitoring: A Randomized, Non-Inferiority Trial in Thailand , 2013, PLoS medicine.

[11]  Lei Nie,et al.  A covariate‐adjustment regression model approach to noninferiority margin definition , 2010, Statistics in medicine.

[12]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[13]  Mark Rothmann,et al.  Design and analysis of non‐inferiority mortality trials in oncology , 2002, Statistics in medicine.

[14]  Jason Madan,et al.  Evaluation of a standardized treatment regimen of anti-tuberculosis drugs for patients with multi-drug-resistant tuberculosis (STREAM): study protocol for a randomized controlled trial , 2014, Trials.

[15]  J. Schumi,et al.  TRIALS REVIEW Open Access , 2022 .

[16]  Enno Schmidt,et al.  Doxycycline versus prednisolone as an initial treatment strategy for bullous pemphigoid: a pragmatic, non-inferiority, randomised controlled trial , 2017, The Lancet.

[17]  Sanjay Kaul,et al.  Non-inferiority study design: lessons to be learned from cardiovascular trials. , 2012, European heart journal.

[18]  Nicola McMeekin,et al.  ORAL VERSUS INTRAVENOUS ANTIBIOTICS FOR THE TREATMENT OF BONE AND JOINT INFECTION (OVIVA): A MULTICENTRE RANDOMISED CONTROLLED TRIAL , 2018 .

[19]  Sanjay Kaul,et al.  Good Enough: A Primer on the Analysis and Interpretation of Noninferiority Trials , 2006, Annals of Internal Medicine.

[20]  Ralph B. D'Agostino,et al.  Challenges in the Design and Interpretation of Noninferiority Trials , 2017, The New England journal of medicine.

[21]  Walter Desmet,et al.  ISAR-SAFE: a randomized, double-blind, placebo-controlled trial of 6 vs. 12 months of clopidogrel therapy after drug-eluting stenting. , 2015, European heart journal.

[22]  F. J. Anscombe,et al.  THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA , 1948 .

[23]  A. Copas,et al.  Superiority and non-inferiority: two sides of the same coin? , 2018, Trials.

[24]  Brian P Hobbs,et al.  Detecting and accounting for violations of the constancy assumption in non-inferiority clinical trials , 2018, Statistical methods in medical research.

[25]  Pablo Salinas,et al.  Challenges in the Design and Interpretation of Noninferiority Trials: Insights From Recent Stent Trials. , 2017, Journal of the American College of Cardiology.

[26]  R. Newcombe,et al.  Interval estimation for the difference between independent proportions: comparison of eleven methods. , 1998, Statistics in medicine.