Making 'null effects' informative: statistical techniques and inferential frameworks

Being able to interpret ‘null effects?is important for cumulative knowledge generation in science. To draw informative conclusions from null-effects, researchers need to move beyond the incorrect interpretation of a non-significant result in a null-hypothesis significance test as evidence of the absence of an effect. We explain how to statistically evaluate null-results using equivalence tests, Bayesian estimation, and Bayes factors. A worked example demonstrates how to apply these statistical tools and interpret the results. Finally, we explain how no statistical approach can actually prove that the null-hypothesis is true, and briefly discuss the philosophical differences between statistical approaches to examine null-effects. The increasing availability of easy-to-use software and online tools to perform equivalence tests, Bayesian estimation, and calculate Bayes factors make it timely and feasible to complement or move beyond traditional null-hypothesis tests, and allow researchers to draw more informative conclusions about null-effects. Relevance for patients: Conclusions based on clinical trial data often focus on demonstrating differences due to treatments, despite demonstrating the absence of differences is an equally important statistical question. Researchers commonly conclude the absence of an effect based on the incorrect use of traditional methods. By providing an accessible overview of different approaches to exploring null-results, we hope researchers improve their statistical inferences. This should lead to a more accurate interpretation of studies, and facilitate knowledge generation about proposed treatments.

[1]  John K. Kruschke,et al.  Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan , 2014 .

[2]  A. Gelman Induction and Deduction in Baysian Data Analysis , 2011 .

[3]  Andrew F. Hayes,et al.  A Primer on Multilevel Modeling , 2006 .

[4]  R. Kerns,et al.  Meta-analysis of psychological interventions for chronic low back pain. , 2007, Health psychology : official journal of the Division of Health Psychology, American Psychological Association.

[5]  Anton Kühberger,et al.  Publication Bias in Psychology: A Diagnosis Based on the Correlation between Effect Size and Sample Size , 2014, PloS one.

[6]  Eric-Jan Wagenmakers,et al.  Informed Bayesian t-Tests , 2017, The American Statistician.

[7]  J. L. Rogers,et al.  Using significance tests to evaluate equivalence between two experimental groups. , 1993, Psychological bulletin.

[8]  Felix D. Schönbrodt,et al.  Sequential Hypothesis Testing With Bayes Factors: Efficiently Testing Mean Differences , 2017, Psychological methods.

[9]  R Z Omar,et al.  Bayesian methods of analysis for cluster randomized trials with binary outcome data. , 2001, Statistics in medicine.

[10]  J. Kruschke Bayesian estimation supersedes the t test. , 2013, Journal of experimental psychology. General.

[11]  J. Kruschke Rejecting or Accepting Parameter Values in Bayesian Estimation , 2018 .

[12]  Richard McElreath,et al.  Statistical Rethinking: A Bayesian Course with Examples in R and Stan , 2015 .

[13]  A. Greenwald Consequences of Prejudice Against the Null Hypothesis , 1975 .

[14]  Brian D. Earp,et al.  The need for reporting negative results - a 90 year update , 2017, Journal of clinical and translational research.

[15]  D. Lakens Equivalence Tests: A Practical Primer for t Tests, Correlations, and Meta-Analyses , 2015 .

[16]  Zoltan Dienes,et al.  Using Bayes to get the most out of non-significant results , 2014, Front. Psychol..

[17]  Anne M. Scheel,et al.  Equivalence Testing for Psychological Research: A Tutorial , 2018, Advances in Methods and Practices in Psychological Science.

[18]  Eric-Jan Wagenmakers,et al.  Bayesian tests to quantify the result of a replication attempt. , 2014, Journal of experimental psychology. General.

[19]  Torrin M. Liddell,et al.  Bayesian data analysis for newcomers , 2018, Psychonomic bulletin & review.

[20]  M. Samore,et al.  Do non-inferiority trials of reduced intensity therapies show reduced effects? A descriptive analysis , 2018, BMJ Open.

[21]  J. Locascio Results Blind Science Publishing , 2017, Basic and applied social psychology.

[22]  Jeffrey N. Rouder,et al.  The philosophy of Bayes’ factors and the quantification of statistical evidence , 2016 .

[23]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[24]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[25]  P. Meehl Why Summaries of Research on Psychological Theories are Often Uninterpretable , 1990 .

[26]  Kylie A. Williams,et al.  Efficacy , Tolerability , and Dose-Dependent Effects of Opioid Analgesics for LowBack Pain , 2016 .

[27]  Torrin M. Liddell,et al.  The Bayesian New Statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective , 2016, Psychonomic bulletin & review.

[28]  Donald J. Schuirmann A comparison of the Two One-Sided Tests Procedure and the Power Approach for assessing the equivalence of average bioavailability , 1987, Journal of Pharmacokinetics and Biopharmaceutics.

[29]  Jeffrey N. Rouder,et al.  The fallacy of placing confidence in confidence intervals , 2015, Psychonomic bulletin & review.

[30]  M. Ferreira,et al.  Efficacy and safety of paracetamol for spinal pain and osteoarthritis: systematic review and meta-analysis of randomised placebo controlled trials , 2015, BMJ : British Medical Journal.

[31]  Nicholas P. Holmes,et al.  Justify your alpha , 2018, Nature Human Behaviour.

[32]  D. Ravenzwaaij,et al.  Credible Confidence: A Pragmatic View on the Frequentist vs Bayesian Debate , 2018 .

[33]  Harvey Goldstein,et al.  Multilevel modelling of medical data , 2002, Statistics in medicine.

[34]  F. Fidler,et al.  The Epistemic Importance of Establishing the Absence of an Effect , 2018, Advances in Methods and Practices in Psychological Science.

[35]  Jeffrey N. Rouder,et al.  Bayesian t tests for accepting and rejecting the null hypothesis , 2009, Psychonomic bulletin & review.

[36]  Scott S Emerson,et al.  Bio‐creep in non‐inferiority clinical trials , 2010, Statistics in medicine.

[37]  Jose D. Perezgonzalez,et al.  Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing , 2015, Front. Psychol..

[38]  E. Wagenmakers,et al.  Bayesian hypothesis testing for psychologists: A tutorial on the Savage–Dickey method , 2010, Cognitive Psychology.

[39]  Michael Meyners,et al.  Equivalence tests – A review , 2012 .

[40]  Christopher Harms,et al.  A Bayes Factor for Replications of ANOVA Results , 2016, The American Statistician.

[41]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[42]  Nicky J Welton,et al.  Effects of glucosamine, chondroitin, or placebo in patients with osteoarthritis of hip or knee: network meta-analysis , 2010, BMJ : British Medical Journal.