Irreproducibility; Nothing is More Predictable.

The increasing ease of data capture and storage has led to a corresponding increase in the choice of data, the type of analysis performed on that data, and the complexity of the analysis performed. The main contribution of this paper is to show that the subjective choice of data and analysis methodology substantially impacts the identification of factors and outcomes of observational studies. This subjective variability of inference is at the heart of recent discussions around irreproducibility in scientific research. To demonstrate this subjective variability, data is taken from an existing study, where interest centres on understanding the factors associated with a young adult's propensity to fall into the category of `not in employment, education or training' (NEET). A fully probabilistic analysis is performed, set in a Bayesian framework and implemented using Reversible Jump Markov chain Monte Carlo (RJMCMC). The results show that different techniques lead to different inference but that models consisting of different factors often have the same predictive performance, whether the analysis is frequentist or Bayesian, making inference problematic. We demonstrate how the use of prior distributions in Bayesian techniques can be used to as a tool for assessing a factor's importance.

[1]  S. Goodman,et al.  Using Design Thinking to Differentiate Useful From Misleading Evidence in Observational Research. , 2017, JAMA.

[2]  I. Hickie,et al.  A prospective cohort study of depression course, functional disability, and NEET status in help-seeking young adults , 2016, Social Psychiatry and Psychiatric Epidemiology.

[3]  D. Curran‐Everett,et al.  The fickle P value generates irreproducible results , 2015, Nature Methods.

[4]  A. Mackinnon,et al.  Transitions Study of predictors of illness progression in young people with mental ill health: study methodology , 2015, Early intervention in psychiatry.

[5]  Bridianne O'Dea,et al.  A cross-sectional exploration of the clinical characteristics of disengaged (NEET) young people in primary mental healthcare , 2014, BMJ Open.

[6]  Leif D. Nelson,et al.  Data from Paper “False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant” , 2014 .

[7]  Regina Nuzzo,et al.  Scientific method: Statistical errors , 2014, Nature.

[8]  Andrew Gelman,et al.  Data-dependent analysis—a "garden of forking paths"— explains why many statistically significant comparisons don't hold up. , 2014 .

[9]  I. Hickie,et al.  Adolescents and young adults who are not in employment, education, or training , 2013, BMJ : British Medical Journal.

[10]  H. Beek F1000Prime recommendation of False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. , 2012 .

[11]  Massimiliano Mascherini,et al.  NEETs: young people not in employment, education or training: characteristics, costs and policy responses in Europe , 2012 .

[12]  Daniele Fanelli,et al.  Negative results are disappearing from most disciplines and countries , 2011, Scientometrics.

[13]  Anna Genell,et al.  Model selection in Medical Research: A simulation study comparing Bayesian Model Averaging and Stepwise Regression , 2010, BMC medical research methodology.

[14]  James G. Scott,et al.  Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[15]  D. Fanelli “Positive” Results Increase Down the Hierarchy of the Sciences , 2010, PloS one.

[16]  Jim E. Griffin,et al.  Transdimensional Sampling Algorithms for Bayesian Variable Selection in Classification Problems With Many More Variables Than Observations , 2009 .

[17]  Victoria Stodden,et al.  Breakdown Point of Model Selection When the Number of Variables Exceeds the Number of Observations , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[18]  F. Dominici,et al.  Reproducible epidemiologic research. , 2006, American journal of epidemiology.

[19]  C. Holmes,et al.  Bayesian auxiliary variable models for binary and multinomial regression , 2006 .

[20]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[21]  Ameet Bakhai,et al.  Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression , 2004, Statistics in medicine.

[22]  J. Bynner,et al.  Social exclusion and the transition from school to work: The case of young people not in education, employment, or training (NEET) , 2002 .

[23]  S. Richardson,et al.  Variable selection and Bayesian model averaging in case‐control studies , 2001, Statistics in medicine.

[24]  Elias Carayannis,et al.  Study of Methodology , 2000 .

[25]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[26]  M. Artés Statistical errors. , 1977, Medicina clinica.

[27]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[28]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .