The logical structure of experiments lays the foundation for a theory of reproducibility

The scientific reform movement has proposed openness as a potential remedy to the putative reproducibility or replication crisis. However, the conceptual relationship between openness, replication experiments, and results reproducibility has been obscure. We analyze the logical structure of experiments, define the mathematical notion of idealized experiment, and use this notion to advance a theory of reproducibility. Idealized experiments clearly delineate the concepts of replication and results reproducibility, and capture key differences with precision, allowing us to study the relationship among them. We show how results reproducibility varies as a function of: the elements of an idealized experiment, the true data generating mechanism, and the closeness of the replication experiment to an original experiment. We clarify how openness of experiments is related to designing informative replication experiments and to obtaining reproducible results. With formal backing and evidence, we argue that the current “crisis” reflects inadequate attention to a theoretical understanding of results reproducibility.

[1]  Berna Devezer,et al.  Bridging the gap between formal theory and scientific reform practices , 2022, bioRxiv.

[2]  J. Murre Addendum to: Murre (2021). The Godden and Baddeley (1975) experiment on context-dependent memory on land and underwater: a replication , 2022, Royal Society Open Science.

[3]  J. Murre The Godden and Baddeley (1975) experiment on context-dependent memory on land and underwater: a replication , 2021, Royal Society Open Science.

[4]  Michèle B. Nuijten,et al.  Replicability, Robustness, and Reproducibility in Psychological Science. , 2021, Annual review of psychology.

[5]  Samuel C. Fletcher How (not) to measure replication , 2021, European Journal for Philosophy of Science.

[6]  S. Gruijters Making inferential leaps: Manipulation checks and the road towards strong inference , 2020, Journal of Experimental Social Psychology.

[7]  Michèle B. Nuijten,et al.  Replicability, Robustness, and Reproducibility in Psychological Science. , 2020, Annual review of psychology.

[8]  L. Hedges,et al.  Reconsidering statistical methods for assessing replication. , 2020, Psychological methods.

[9]  D. Navarro,et al.  The case for formal methodology in scientific reform , 2020, bioRxiv.

[10]  Andrea E. Martin,et al.  How Computational Modeling Can Force Theory Building in Psychological Science , 2020, Perspectives on psychological science : a journal of the Association for Psychological Science.

[11]  Uljana Feest Why Replication Is Overrated , 2019, Philosophy of Science.

[12]  Bradley C. Love,et al.  Variability in the analysis of a single neuroimaging dataset by many teams , 2019, Nature.

[13]  Brian Dennis,et al.  Errors in Statistical Inference Under Model Misspecification: Evidence, Hypothesis Testing, and AIC , 2019, Front. Ecol. Evol..

[14]  Sarah de Rijcke,et al.  Rinse and Repeat: Understanding the Value of Replication across Different Ways of Knowing , 2019, Publ..

[15]  Fiona Fidler,et al.  Reproducibility of Scientific Results , 2018 .

[16]  Bert O. Baumgaertner,et al.  Openness and Reproducibility: Insights from a Model-Centric Approach. , 2018, 1811.04525.

[17]  Sabina Leonelli,et al.  Rethinking Reproducibility as a Criterion for Research Quality , 2018, Including a Symposium on Mary Morgan: Curiosity, Imagination, and Surprise.

[18]  Mathew H. Evans,et al.  Many Analysts, One Data Set: Making Transparent How Variations in Analytic Choices Affect Results , 2018, Advances in Methods and Practices in Psychological Science.

[19]  Michael C. Frank,et al.  Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition , 2018, Royal Society Open Science.

[20]  Erkan Ozge Buzbas,et al.  Scientific discovery in a model-centric framework: Reproducibility, innovation, and epistemic diversity , 2018, PloS one.

[21]  Joachim Vandekerckhove,et al.  Metastudies for robust tests of theory , 2018, Proceedings of the National Academy of Sciences.

[22]  Brian A. Nosek,et al.  The preregistration revolution , 2018, Proceedings of the National Academy of Sciences.

[23]  Andrew Gelman,et al.  Measurement error and the replication crisis , 2017, Science.

[24]  John P. A. Ioannidis,et al.  A manifesto for reproducible science , 2017, Nature Human Behaviour.

[25]  R. Giner-Sorolla,et al.  Pre-registration in social psychology—A discussion and suggested template , 2016 .

[26]  Francis Tuerlinckx,et al.  Increasing Transparency Through a Multiverse Analysis , 2016, Perspectives on psychological science : a journal of the Association for Psychological Science.

[27]  Gideon Nave,et al.  Evaluating replicability of laboratory experiments in economics , 2016, Science.

[28]  John P. A. Ioannidis,et al.  p-Curve and p-Hacking in Observational Research , 2016, PloS one.

[29]  John P. A. Ioannidis,et al.  Reproducible Research Practices and Transparency across the Biomedical Literature , 2016, PLoS biology.

[30]  Michael C. Frank,et al.  Estimating the reproducibility of psychological science , 2015, Science.

[31]  Brian A. Nosek,et al.  Promoting an open research culture , 2015, Science.

[32]  Jeffrey R. Spence,et al.  Expectations for Replications , 2014, Perspectives on psychological science : a journal of the Association for Psychological Science.

[33]  Olivia Guest,et al.  Implementations are not specifications: Specification, replication and experimentation in computational cognitive modeling , 2014, Cognitive Systems Research.

[34]  F. Collins,et al.  Policy: NIH plans to enhance reproducibility , 2014, Nature.

[35]  Thomas S. Shively,et al.  On the Equivalence between Bayesian and Classical Hypothesis Testing , 2013, 1312.0302.

[36]  Han L. J. van der Maas,et al.  Science Perspectives on Psychological an Agenda for Purely Confirmatory Research on Behalf Of: Association for Psychological Science , 2022 .

[37]  Yannis Charalabidis,et al.  Benefits, Adoption Barriers and Myths of Open Data and Open Government , 2012, Inf. Syst. Manag..

[38]  C. Borgman The conundrum of sharing research data , 2012, J. Assoc. Inf. Sci. Technol..

[39]  Jennifer C Molloy,et al.  The Open Knowledge Foundation: Open Data Means Better Science , 2011, PLoS biology.

[40]  D. Lindley The Philosophy of Statistics , 2000 .

[41]  N. Kerr HARKing: Hypothesizing After the Results are Known , 1998, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[42]  Friedrich Steinle,et al.  Entering New Fields: Exploratory Uses of Experimentation , 1997, Philosophy of Science.

[43]  Hans Radder,et al.  In and about the World: Philosophical Studies of Science and Technology , 1996 .

[44]  Hans Radder,et al.  Experimental Reproducibility and the Experimenters' Regress , 1992, PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.

[45]  E. Kenney … and Statistics , 1980, The Classical Review.

[46]  G. Box Science and Statistics , 1976 .

[47]  A. Baddeley,et al.  Context-dependent memory in two natural environments: on land and underwater. , 1975 .

[48]  N. L. Johnson Linear Statistical Inference and Its Applications , 1966 .

[49]  A. Gelman,et al.  The garden of forking paths : Why multiple comparisons can be a problem , even when there is no “ fishing expedition ” or “ p-hacking ” and the research hypothesis was posited ahead of time ∗ , 2019 .

[50]  Inventing Temperature , 2019, Thermodynamic Weirdness.

[51]  ProTon Europe,et al.  Fostering Integrity in Research , 2017 .

[52]  V. Stodden Trust your science? Open your data and code , 2011 .

[53]  Hasok Chang,et al.  Inventing Temperature: Measurement and Scientific Progress , 2004 .

[54]  T. Kuhn The structure of scientific revolutions, 3rd ed. , 1996 .

[55]  J. Sayers Against Method , 2016 .

[56]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[57]  C. Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications, Second Editon , 1973, Wiley Series in Probability and Statistics.