An Ethical Approach to Peeking at Data

When data analyses produce encouraging but nonsignificant results, researchers often respond by collecting more data. This may transform a disappointing dataset into a publishable study, but it does so at the cost of increasing the Type I error rate. How big of a problem is this, and what can we do about it? To answer the first question, we estimate the Type I error inflation based on the initial sample size, the number of participants used to augment the dataset, the critical value for determining significance (typically .05), and the maximum p value within the initial sample such that the dataset would be augmented. With one round of augmentation, Type I error inflation maximizes at .0975 with typical values from .0564 to .0883. To answer the second question, we review methods of adjusting the critical value to allow augmentation while maintaining p < .05, but we note that such methods must be applied a priori. For the common occurrence of post-hoc dataset augmentation, we develop a new statistic, paugmented, that represents the magnitude of the resulting Type I error inflation. We argue that the disclosure of post-hoc dataset augmentation via paugmented elevates such augmentation from a questionable research practice to an ethical research decision.

[1]  E. Wagenmakers A practical solution to the pervasive problems ofp values , 2007, Psychonomic bulletin & review.

[2]  Thom Baguley,et al.  Serious stats: a guide to advanced statistics for the behavioral sciences , 2012 .

[3]  R. Frick,et al.  A better stopping rule for conventional statistical tests , 1998 .

[4]  D. A. Fitts Improved stopping rules for the design of efficient small-sample experiments in biomedical and biobehavioral research , 2010, Behavior research methods.

[5]  R. Rosenthal,et al.  Statistical Procedures and the Justification of Knowledge in Psychological Science , 1989 .

[6]  Robert A. Heinlein,et al.  The Moon Is a Harsh Mistress , 1966 .

[7]  E. Eich Business Not as Usual , 2014, Psychological science.

[8]  D. Lakens,et al.  Sailing From the Seas of Chaos Into the Corridor of Stability , 2014, Perspectives on psychological science : a journal of the Association for Psychological Science.

[9]  W. Lehmacher,et al.  Adaptive Sample Size Calculations in Group Sequential Trials , 1999, Biometrics.

[10]  P. O'Brien,et al.  A multiple testing procedure for clinical trials. , 1979, Biometrics.

[11]  G. Loewenstein,et al.  Measuring the Prevalence of Questionable Research Practices With Incentives for Truth Telling , 2012, Psychological science.

[12]  J. Cornfield Sequential Trials, Sequential Analysis and the Likelihood Principle , 1966 .

[13]  Sue-Jane Wang,et al.  Modification of Sample Size in Group Sequential Clinical Trials , 1999, Biometrics.

[14]  Kate E Decleene,et al.  Publication Manual of the American Psychological Association , 2011 .

[15]  Jeffrey N. Rouder,et al.  Bayesian t tests for accepting and rejecting the null hypothesis , 2009, Psychonomic bulletin & review.

[16]  V. Vieland,et al.  Statistical Evidence: A Likelihood Paradigm , 1998 .

[17]  J. P. Morgan,et al.  Design and Analysis: A Researcher's Handbook , 2005, Technometrics.

[18]  J. Revuelta,et al.  Optimization of sample size in controlled experiments: The CLAST rule , 2006, Behavior research methods.

[19]  James M. Culberson A Billion Here, a Billion There , 1996 .

[20]  P. Armitage,et al.  Repeated Significance Tests on Accumulating Data , 1969 .

[21]  M. Whitlock Combining probability from independent tests: the weighted Z‐method is superior to Fisher's approach , 2005, Journal of evolutionary biology.

[22]  G. Cumming,et al.  The New Statistics , 2014, Psychological science.

[23]  S J Pocock,et al.  Interim analyses for randomized clinical trials: the group sequential approach. , 1982, Biometrics.

[24]  Z. Dienes Bayesian Versus Orthodox Statistics: Which Side Are You On? , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[25]  Leif D. Nelson,et al.  False-Positive Psychology , 2011, Psychological science.

[26]  S. Pocock Group sequential methods in the design and analysis of clinical trials , 1977 .