Two cheers for P-values?

P-values are a practical success but a critical failure. Scientists the world over use them, but scarcely a statistician can be found to defend them. Bayesians in particular find them ridiculous, but even the modern frequentist has little time for them. In this essay, I consider what, if anything, might be said in their favour.

[1]  Student,et al.  THE PROBABLE ERROR OF A MEAN , 1908 .

[2]  R. A. Fisher,et al.  Design of Experiments , 1936 .

[3]  H. O. Lancaster Statistical control of counting experiments , 1952 .

[4]  D. Lindley A STATISTICAL PARADOX , 1957 .

[5]  M. Bartlett A comment on D. V. Lindley's statistical paradox , 1957 .

[6]  I. Good A Bayesian Significance Test for Multinomial Distributions , 1967 .

[7]  Rory A. Fisher,et al.  Statistical methods and scientific inference. , 1957 .

[8]  Irving John Good,et al.  Some Logic and History of Hypothesis Testing , 1981 .

[9]  R. Royall The Effect of Sample Size on the Meaning of Significance Tests , 1986 .

[10]  D. Johnstone,et al.  Tests of Significance in Theory and Practice , 1986 .

[11]  J. Berger,et al.  Testing Precise Hypotheses , 1987 .

[12]  Peter Urbach,et al.  Scientific Reasoning: The Bayesian Approach , 1989 .

[13]  F. J. Anscombe The summarizing of clinical experiments by significance levels. , 1990, Statistics in medicine.

[14]  D. Hand A History of Probability and Statistics and Their Applications before 1750 , 1990 .

[15]  G. Barnard Must clinical trials be large? The interpretation of P-values and the combination of test results. , 1990, Statistics in medicine.

[16]  K J Rothman,et al.  No Adjustments Are Needed for Multiple Comparisons , 1990, Epidemiology.

[17]  G A Colditz,et al.  Relation of meat, fat, and fiber intake to the risk of colon cancer in a prospective study among women. , 1990, The New England journal of medicine.

[18]  Ronald Aylmer Sir Fisher,et al.  Statistical Methods, Experimental Design, and Scientific Inference , 1990 .

[19]  Marcel Dekker Weldon's Dice Data Revisited , 1991 .

[20]  I Russell,et al.  Statistics--with confidence? , 1991, The British journal of general practice : the journal of the Royal College of General Practitioners.

[21]  S. Goodman,et al.  A comment on replication, p-values and evidence. , 1992, Statistics in medicine.

[22]  A. Edwards,et al.  A History of Probability and Statistics and Their Applications before 1750 , 1992 .

[23]  P. Freeman,et al.  The role of p-values in analysing trial results. , 1993, Statistics in medicine.

[24]  D. Lindley,et al.  The Analysis of Experimental Data: The Appreciation of Tea and Wine , 1993 .

[25]  S. Senn Suspended judgment n-of-1 trials. , 1993, Controlled clinical trials.

[26]  J. Potter,et al.  Vegetables, fruit, and colon cancer in the Iowa Women's Health Study. , 1994, American journal of epidemiology.

[27]  R T O'Neill,et al.  The behavior of the P-value when the alternative hypothesis is true. , 1997, Biometrics.

[28]  Samuel Kotz,et al.  Leading Personalities in Statistical Sciences. , 1997 .

[29]  Samuel Kotz,et al.  Leading Personalities in Statistical Sciences. , 1997 .

[30]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[31]  Steven Goodman Toward Evidence-Based Medical Statistics. 2: The Bayes Factor , 1999, Annals of Internal Medicine.

[32]  M. Smithson Statistics with confidence , 2000 .