A taxonomy of model assumptions on which P is based and implications for added benefit in the sciences

ABSTRACT Although the null hypothesis significance testing procedure is problematic, many still favor the use of p-values as indicating the state of evidence against the model used to generate the p-value. From this perspective, p-values benefit science; or would benefit science if used correctly. In contrast, the novel argument to be presented introduces a taxonomy of assumptions included in the model; such as theoretical assumptions, auxiliary assumptions, statistical assumptions, and inferential assumptions. Careful attention is paid to the different categories of assumptions that models necessarily include to render p-value calculations sensible. Such careful attention suggests unappreciated limitations of p-values. Considering these limitations in the context of the descriptive statistics researchers routinely have available to them clarifies that p-values provide no added benefit to the scientist, above and beyond such descriptive statistics. The lack of added value, combined with the obvious harms documented in recent reviews, suggests that researchers in the sciences should rarely, or never, use p-values. Not even for indicating the state of evidence against models.

[1]  William Briggs,et al.  Uncertainty: The Soul of Modeling, Probability & Statistics , 2016 .

[2]  R. Kline Comment on Locascio, Results Blind Science Publishing , 2017 .

[3]  Michael J. Marks Commentary on Locascio , 2017 .

[4]  David Gal,et al.  Abandon Statistical Significance , 2017, The American Statistician.

[5]  I. Ajzen,et al.  Belief, Attitude, Intention, and Behavior: An Introduction to Theory and Research , 1977 .

[6]  Ariel M. Aloe,et al.  Life After NHST: How to Describe Your Data Without “p-ing” Everywhere , 2015 .

[7]  D. Trafimow Confidence intervals, precision and confounding , 2018, New Ideas in Psychology.

[8]  I. Ajzen,et al.  Predicting and Changing Behavior: The Reasoned Action Approach , 2009 .

[9]  Sander Greenland,et al.  Invited Commentary: The Need for Cognitive Science in Methodology , 2017, American journal of epidemiology.

[10]  D. Trafimow,et al.  From a Sampling Precision Perspective, Skewness Is a Friend and Not an Enemy! , 2019, Educational and psychological measurement.

[11]  Raymond Hubbard,et al.  Corrupt Research: The Case for Reconceptualizing Empirical Management and SocialScience , 2016 .

[12]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[13]  George E. P. Box,et al.  Empirical Model‐Building and Response Surfaces , 1988 .

[14]  Michael R. Hyman,et al.  Can “Results Blind Manuscript Evaluation” Assuage “Publication Bias”? , 2017 .

[15]  D. Trafimow Hypothesis testing and theory evaluation at the boundaries: surprising insights from Bayes's theorem. , 2003, Psychological review.

[16]  R. Nickerson,et al.  Null hypothesis significance testing: a review of an old and continuing controversy. , 2000, Psychological methods.

[17]  J. Locascio Results Blind Science Publishing , 2017, Basic and applied social psychology.

[18]  Imre Lakatos,et al.  The Methodology of Scientific Research Programmes , 1978 .

[19]  D. Mccloskey,et al.  The Cult of Statistical Significance: How the Standard Error Costs Us Jobs, Justice, and Lives , 2008 .

[20]  Elizabeth Gilbert,et al.  Reproducibility Project: Results (Part of symposium called "The Reproducibility Project: Estimating the Reproducibility of Psychological Science") , 2014 .

[21]  J. Locascio Rejoinder to Responses to “Results-Blind Publishing” , 2017, Basic and applied social psychology.

[22]  William M. Briggs,et al.  Everything Wrong with P-Values Under One Roof , 2018, ECONVN.

[23]  M. Fishbein A theory of reasoned action: some applications and implications. , 1980, Nebraska Symposium on Motivation. Nebraska Symposium on Motivation.

[24]  Jacob Cohen The earth is round (p < .05) , 1994 .

[25]  Means and standard deviations, or locations and scales? That is the question! , 2018, New Ideas in Psychology.

[26]  L. Harlow,et al.  What if there were no significance tests , 1997 .

[27]  J. Grice Comment on Locascio’s Results Blind Manuscript Evaluation Proposal , 2017 .

[28]  Richard A. Berk,et al.  Statistical Assumptions as Empirical Commitments , 2001 .

[29]  J. Gallacher,et al.  Why representativeness should be avoided. , 2013, International journal of epidemiology.

[30]  D. Trafimow Implications of an initial empirical victory for the truth of the theory and additional empirical victories , 2017 .

[31]  E. Caldin,et al.  The Aim and Structure of Physical Theory , 1955 .

[32]  M. T. Bradley,et al.  Significance Testing Needs a Taxonomy , 2016, Psychological reports.

[33]  D. Trafimow Why It Is Problematic to Calculate Probabilities of Findings Given Range Null Hypotheses , 2017 .