The Replication Crisis in Epidemiology: Snowball, Snow Job, or Winter Solstice?

Purpose of ReviewLike a snowball rolling down a steep hill, the most recent crisis over the perceived lack of reproducibility of scientific results has outpaced the evidence of crisis. It has led to new actions and new guidelines that have been rushed to market without plans for evaluation, metrics for success, or due consideration of the potential for unintended consequences.Recent FindingsThe perception of the crisis is at least partly a snow job, heavily influenced by a small number of centers lavishly funded by a single foundation, with undue and unsupported attention to preregistration as a solution to the perceived crisis. At the same time, the perception of crisis provides an opportunity for introspection. Two studies’ estimates of association may differ because of undue attention on null hypothesis statistical testing, because of differences in the distribution of effect modifiers, because of differential susceptibility to threats to validity, or for other reasons. Perhaps the expectation of what reproducible epidemiology ought to look like is more misguided than the practice of epidemiology. We advocate for the idea of “replication and advancement.” Studies should not only replicate earlier work, but also improve on it in by enhancing the design or analysis.SummaryAbandoning blind reliance on null hypothesis significance testing for statistical inference, finding consensus on when preregistration of non-randomized study protocols has merit, and focusing on replication and advance are the most certain ways to emerge from this solstice for the better.

[1]  T. Sterling Publication Decisions and their Possible Effects on Inferences Drawn from Tests of Significance—or Vice Versa , 1959 .

[2]  A. B. Hill The Environment and Disease: Association or Causation? , 1965, Proceedings of the Royal Society of Medicine.

[3]  C B Begg,et al.  A measure to aid in the interpretation of published clinical trials. , 1985, Statistics in medicine.

[4]  K J Rothman,et al.  Significance questing. , 1986, Annals of internal medicine.

[5]  K J Rothman,et al.  A strengthening programme for weak associations. , 1988, International journal of epidemiology.

[6]  A R Feinstein,et al.  Scientific standards in epidemiologic studies of the menace of daily life. , 1988, Science.

[7]  A R Feinstein,et al.  A collection of 56 topics with contradictory results in case-control research. , 1988, International journal of epidemiology.

[8]  J. Utts Replication and Meta-Analysis in Parapsychology , 1991 .

[9]  G. Colditz,et al.  Estrogen replacement therapy and coronary heart disease: A quantitative assessment of the epidemiologic evidence☆☆☆ , 1991 .

[10]  S. Cummings,et al.  Hormone therapy to prevent disease and prolong life in postmenopausal women , 1992, Annals of internal medicine.

[11]  Casual inference. , 1993, Epidemiology.

[12]  G. Taubes Epidemiology faces its limits. , 1995, Science.

[13]  N. Kerr HARKing: Hypothesizing After the Results are Known , 1998, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[14]  D. Weed Epidemiologic evidence and causal inference. , 2000, Hematology/oncology clinics of North America.

[15]  J. Palmgren,et al.  Sinistrality—a side-effect of prenatal sonography: A comparative study of young men , 2001, Epidemiology.

[16]  C D Holman,et al.  A Psychometric Experiment in Causal Inference to Estimate Evidential Weights Used by Epidemiologists , 2001, Epidemiology.

[17]  Charles Kooperberg,et al.  Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results From the Women's Health Initiative randomized controlled trial. , 2002, JAMA.

[18]  D. Petitti Hormone replacement therapy and coronary heart disease: results of randomized trials. , 2003, Progress in cardiovascular diseases.

[19]  R. Lemen Chrysotile Asbestos as a Cause of Mesothelioma: Application of the Hill Causation Model , 2004, International journal of occupational and environmental health.

[20]  Shah Ebrahim,et al.  Commentary: the hormone replacement-coronary heart disease conundrum: is this the death of observational epidemiology? , 2004, International journal of epidemiology.

[21]  John Hoey,et al.  Clinical trial registration: a statement from the International Committee of Medical Journal Editors. , 2004, JAMA.

[22]  Ida Sim,et al.  Principles for international registration of protocol information and results from human trials of health related interventions: Ottawa statement (part 1) , 2005, BMJ : British Medical Journal.

[23]  A. Gelman,et al.  The Difference Between “Significant” and “Not Significant” is not Itself Statistically Significant , 2006 .

[24]  Sander Greenland,et al.  Why Most Published Research Findings Are False: Problems in the Analysis , 2007, PLoS medicine.

[25]  S. Pocock,et al.  Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies , 2007, BMJ : British Medical Journal.

[26]  J. Ioannidis Why Most Discovered True Associations Are Inflated , 2008, Epidemiology.

[27]  James M. Robins,et al.  Observational Studies Analyzed Like Randomized Experiments: An Application to Postmenopausal Hormone Therapy and Coronary Heart Disease , 2008, Epidemiology.

[28]  G Maldonado,et al.  Adjusting a relative-risk estimate for study imperfections , 2008, Journal of Epidemiology and Community Health.

[29]  J. Brooks Why most published research findings are false: Ioannidis JP, Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece , 2008 .

[30]  Paolo Vineis,et al.  Epidemiology, Public Health, and the Rhetoric of False Positives , 2009, Environmental health perspectives.

[31]  Tony Tse,et al.  Registration of observational studies: Is it time? , 2010, Canadian Medical Association Journal.

[32]  S. Cole,et al.  Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial. , 2010, American journal of epidemiology.

[33]  Trish Groves,et al.  Registration of observational studies , 2010, BMJ : British Medical Journal.

[34]  G. Bedogni Applying Quantitative Bias Analysis to Epidemiologic Data , 2011 .

[35]  Sander Greenland,et al.  Causation and Causal Inference , 2021, International Encyclopedia of Statistical Science.

[36]  M. Bracken Preregistration of epidemiology protocols: a commentary in support. , 2011, Epidemiology.

[37]  K. Salvesen Ultrasound in pregnancy and non‐right handedness: meta‐analysis of randomized trials , 2011, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[38]  J. Ioannidis,et al.  The False-positive to False-negative Ratio in Epidemiologic Studies , 2011, Epidemiology.

[39]  Timothy L Lash,et al.  Commentary: Should Preregistration of Epidemiologic Study Protocols Become Compulsory? Reflections and a Counterproposal , 2012, Epidemiology.

[40]  J. Mclaughlin,et al.  False Positives in Cancer Epidemiology , 2012, Cancer Epidemiology, Biomarkers & Prevention.

[41]  Sandro Galea,et al.  An argument for a consequentialist epidemiology. , 2013, American journal of epidemiology.

[42]  Marcia McNutt,et al.  Journals unite for reproducibility , 2014, Science.

[43]  F. Collins,et al.  Policy: NIH plans to enhance reproducibility , 2014, Nature.

[44]  Richard F MacLehose,et al.  Good practices for quantitative bias analysis. , 2014, International journal of epidemiology.

[45]  John P. A. Ioannidis,et al.  How to Make More Published Research True , 2014, PLoS medicine.

[46]  Harvey J Motulsky,et al.  Common Misconceptions about Data Analysis and Statistics , 2014, The Journal of Pharmacology and Experimental Therapeutics.

[47]  J. Ioannidis,et al.  Reproducibility in Science: Improving the Standard for Basic and Preclinical Research , 2015, Circulation research.

[48]  T. Lash Declining the Transparency and Openness Promotion Guidelines. , 2015, Epidemiology.

[49]  T. Lash Advancing Research through Replication. , 2015, Paediatric and perinatal epidemiology.

[50]  S. Buck,et al.  Solving reproducibility , 2015, Science.

[51]  Brian A. Nosek,et al.  Promoting an open research culture , 2015, Science.

[52]  Muin J Khoury,et al.  Planning for the Future of Epidemiology in the Era of Big Data and Precision Medicine. , 2015, American journal of epidemiology.

[53]  Brian A. Nosek,et al.  Using prediction markets to estimate the reproducibility of scientific research , 2015, Proceedings of the National Academy of Sciences.

[54]  Michael C. Frank,et al.  Estimating the reproducibility of psychological science , 2015, Science.

[55]  Stavroula A Chrysanthopoulou,et al.  Probabilistic bias analysis in pharmacoepidemiology and comparative effectiveness research: a systematic review , 2016, Pharmacoepidemiology and drug safety.

[56]  S. Goodman,et al.  Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations , 2016, European Journal of Epidemiology.

[57]  Yolanda Gil,et al.  Enhancing reproducibility for computational methods , 2016, Science.

[58]  John P. A. Ioannidis,et al.  Reproducible Research Practices and Transparency across the Biomedical Literature , 2016, PLoS biology.

[59]  Derek Abbott,et al.  Too good to be true: when overwhelming evidence fails to convince , 2016, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[60]  J. Ioannidis,et al.  Registration practices for observational studies on ClinicalTrials.gov indicated low adherence. , 2016, Journal of clinical epidemiology.

[61]  Robert Platt,et al.  Specifying a target trial prevents immortal time bias and other self-inflicted injuries in observational analyses. , 2016, Journal of clinical epidemiology.

[62]  U. Bogdahn,et al.  Statin use and risk of glioma: population-based case–control analysis , 2016, European Journal of Epidemiology.

[63]  Michelle L Degelman,et al.  Smoking and multiple sclerosis: A systematic review and meta-analysis using the Bradford Hill criteria for causation. , 2017, Multiple sclerosis and related disorders.

[64]  G. Banks,et al.  The Chrysalis Effect , 2017 .

[65]  Tara Gomes,et al.  Association Between Serotonergic Antidepressant Use During Pregnancy and Autism Spectrum Disorder in Children , 2017, JAMA.

[66]  T. Lash,et al.  On the Need for Quantitative Bias Analysis in the Peer-Review Process , 2017, American journal of epidemiology.

[67]  Stephen R Cole,et al.  Transportability of Trial Results Using Inverse Odds of Sampling Weights. , 2017, American journal of epidemiology.

[68]  Sander Greenland,et al.  Invited Commentary: The Need for Cognitive Science in Methodology , 2017, American journal of epidemiology.

[69]  John P. A. Ioannidis,et al.  A manifesto for reproducible science , 2017, Nature Human Behaviour.

[70]  Harry Crane Why 'Redefining Statistical Significance' Will Not Improve Reproducibility and Could Make the Replication Crisis Worse , 2017, 1711.07801.

[71]  Announcement: Transparency upgrade for Nature journals , 2017, Nature.

[72]  Yoshinori Hatori,et al.  The Difference between Significant and Non-significant , 2017 .

[73]  Manipulating the alpha level cannot cure significance testing – comments on "Redefine statistical significance" , 2017 .

[74]  David J. Spiegelhalter,et al.  The ASA's p‐value statement, one year on , 2017 .

[75]  Lash Responds to "Is Reproducibility Thwarted by Hypothesis Testing?" and "The Need for Cognitive Science in Methodology". , 2017, American journal of epidemiology.

[76]  Michael G Hudgens,et al.  Generalizing Study Results: A Potential Outcomes Perspective. , 2017, Epidemiology.

[77]  Timothy L Lash,et al.  The Harm Done to Reproducibility by the Culture of Null Hypothesis Significance Testing. , 2017, American journal of epidemiology.

[78]  M. Munafo,et al.  Robust research needs many lines of evidence , 2018, Nature.

[80]  David Gal,et al.  Abandon Statistical Significance , 2017, The American Statistician.