Bayes factors for superiority, non-inferiority, and equivalence designs

BackgroundIn clinical trials, study designs may focus on assessment of superiority, equivalence, or non-inferiority, of a new medicine or treatment as compared to a control. Typically, evidence in each of these paradigms is quantified with a variant of the null hypothesis significance test. A null hypothesis is assumed (null effect, inferior by a specific amount, inferior by a specific amount and superior by a specific amount, for superiority, non-inferiority, and equivalence respectively), after which the probabilities of obtaining data more extreme than those observed under these null hypotheses are quantified by p-values. Although ubiquitous in clinical testing, the null hypothesis significance test can lead to a number of difficulties in interpretation of the results of the statistical evidence.MethodsWe advocate quantifying evidence instead by means of Bayes factors and highlight how these can be calculated for different types of research design.ResultsWe illustrate Bayes factors in practice with reanalyses of data from existing published studies.ConclusionsBayes factors for superiority, non-inferiority, and equivalence designs allow for explicit quantification of evidence in favor of the null hypothesis. They also allow for interim testing without the need to employ explicit corrections for multiple testing.

[1]  Daniel B. Chamberlain,et al.  Making Sense of a Negative Clinical Trial Result: A Bayesian Analysis of a Clinical Trial of Lorazepam and Diazepam for Pediatric Status Epilepticus , 2017, Annals of emergency medicine.

[2]  Eric-Jan Wagenmakers,et al.  Informed Bayesian t-Tests , 2017, The American Statistician.

[3]  E. Wagenmakers,et al.  The comparative evidence basis for the efficacy of second-generation antidepressants in the treatment of depression in the US: A Bayesian meta-analysis of Food and Drug Administration reviews. , 2018, Journal of affective disorders.

[4]  J. Rouder Optional stopping: No problem for Bayesians , 2014, Psychonomic bulletin & review.

[5]  E. H. Witte,et al.  From Discovery to Justification: Outline of an Ideal Research Program in Empirical Psychology , 2017, Front. Psychol..

[6]  Herbert Hoijtink,et al.  Approximated adjusted fractional Bayes factors: A general method for testing informative hypotheses , 2018, The British journal of mathematical and statistical psychology.

[7]  S. Luthra The scientific foundation, rationale and argument for a nonfrequentist Bayesian analysis in clinical trials in coronary artery disease. , 2015, Heart, lung & circulation.

[8]  Don van Ravenzwaaij,et al.  Bayesian Reanalysis of Null Results Reported in the New England Journal of Medicine: Strong yet Variable Evidence for the Absence of Treatment Effects , 2018 .

[9]  S. Goodman,et al.  Toward Evidence-Based Medical Statistics. 2: The Bayes Factor , 1999, Annals of Internal Medicine.

[10]  R. Weiss Bayesian sample size calculations for hypothesis testing , 1997 .

[11]  Mithat Gönen,et al.  Assessing noninferiority in a three-arm trial using the Bayesian approach. , 2011, Statistics in medicine.

[12]  A R Feinstein,et al.  Claims of Equivalence in Medical Research: Are They Supported by the Evidence? , 2000, Annals of Internal Medicine.

[13]  W. Boersma,et al.  Antibiotic treatment strategies for community-acquired pneumonia in adults. , 2015, The New England journal of medicine.

[14]  D. Berry Bayesian clinical trials , 2006, Nature Reviews Drug Discovery.

[15]  U. Dirnagl,et al.  Biomedical research: increasing value, reducing waste , 2014, The Lancet.

[16]  L. Hothorn Statistical issues in drug development (2nd edn). Stephen Senn, John Wiley & Sons Ltd, Chichester, 2007. No. of pages: xix + 498. Price: $130.00. ISBN 978‐0‐470‐01877‐4 , 2009 .

[17]  Jeffrey N. Rouder,et al.  Bayesian t tests for accepting and rejecting the null hypothesis , 2009, Psychonomic bulletin & review.

[18]  Muhtarjan Osman,et al.  Semiparametric Bayesian Testing Procedure for Noninferiority Trials with Binary Endpoints , 2011, Journal of biopharmaceutical statistics.

[19]  Ward Edwards,et al.  Bayesian statistical inference for psychological research. , 1963 .

[20]  Anthony O'Hagan,et al.  Kendall's Advanced Theory of Statistics: Vol. 2B, Bayesian Inference. , 1996 .

[21]  T. Daimon Bayesian sample size calculations for a non-inferiority test of two proportions in clinical trials. , 2008, Contemporary clinical trials.

[22]  G. Guyatt,et al.  Ethical Issues in Stopping Randomized Trials Early Because of Apparent Benefit , 2007, Annals of Internal Medicine.

[23]  Guochen Song,et al.  Bayesian methods for the design and analysis of noninferiority trials , 2016, Journal of biopharmaceutical statistics.

[24]  S. Pocock,et al.  Reporting of noninferiority and equivalence randomized trials: extension of the CONSORT 2010 statement. , 2006, JAMA.

[25]  Tony O'Hagan,et al.  Dicing with the unknown , 2004 .

[26]  G. Andersson,et al.  Randomised controlled non-inferiority trial with 3-year follow-up of internet-delivered versus face-to-face group cognitive behavioural therapy for depression. , 2013, Journal of affective disorders.

[27]  D. Bakan,et al.  The test of significance in psychological research. , 1966, Psychological bulletin.

[28]  S. Assmann,et al.  Effects of red-cell storage duration on patients undergoing cardiac surgery. , 2015, The New England journal of medicine.

[29]  Christopher Jennison,et al.  Statistical Approaches to Interim Monitoring of Medical Trials: A Review and Commentary , 1990 .

[30]  Haolun Shi,et al.  Bayesian randomized clinical trials: From fixed to adaptive design. , 2017, Contemporary clinical trials.

[31]  A. Kalil,et al.  Low-dose steroids for septic shock and severe sepsis: the use of Bayesian statistics to resolve clinical trial controversies , 2011, Intensive Care Medicine.

[32]  J Jack Lee,et al.  Bayesian clinical trials in action , 2012, Statistics in medicine.

[33]  Stephen Senn,et al.  Statistical Issues in Drug Development , 1997 .

[34]  Sarah Zohar,et al.  Software to compute and conduct sequential Bayesian phase I or II dose-ranging clinical trials with stopping rules , 2003, Comput. Methods Programs Biomed..

[35]  M. Clyde,et al.  Mixtures of g Priors for Bayesian Variable Selection , 2008 .

[36]  K. Carlsen,et al.  Racemic adrenaline and inhalation strategies in acute bronchiolitis. , 2013, The New England journal of medicine.

[37]  W. Johnson,et al.  The Bayesian Two-Sample t Test , 2005 .

[38]  L. Inoue,et al.  Use of Bayesian Techniques in Randomized Clinical Trials: A CMS Case Study , 2009 .

[39]  Hamid Pezeshk,et al.  Bayesian techniques for sample size determination in clinical trials: a short review , 2003, Statistical methods in medical research.

[40]  B. Zaslavsky Bayesian Hypothesis Testing in Two‐Arm Trials with Dichotomous Outcomes , 2013, Biometrics.

[41]  J. Ioannidis,et al.  A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results , 2017, PloS one.

[42]  Ram C Tiwari,et al.  Bayesian approach to non-inferiority trials for normal means , 2016, Statistical methods in medical research.

[43]  Whole Grain Label Statements Guidance for Industry and FDA Staff , 2006 .

[44]  A. O'Hagan,et al.  Kendall's Advanced Theory of Statistics, Vol. 2b: Bayesian Inference. , 1996 .

[45]  H. Jeffreys,et al.  Theory of probability , 1896 .

[46]  E. Wagenmakers,et al.  UvA-DARE ( Digital Academic Repository ) Bayesian reanalysis of null results reported in medicine Strong yet variable evidence for the absence of treatment effects , 2018 .

[47]  M. J. Bayarri,et al.  Criteria for Bayesian model choice with application to variable selection , 2012, 1209.5240.