论文信息 - Two cheers for P-values?

Two cheers for P-values?

P-values are a practical success but a critical failure. Scientists the world over use them, but scarcely a statistician can be found to defend them. Bayesians in particular find them ridiculous, but even the modern frequentist has little time for them. In this essay, I consider what, if anything, might be said in their favour.

S. Senn

[1] Student,et al. THE PROBABLE ERROR OF A MEAN , 1908 .

[2] R. A. Fisher,et al. Design of Experiments , 1936 .

[3] H. O. Lancaster. Statistical control of counting experiments , 1952 .

[4] D. Lindley. A STATISTICAL PARADOX , 1957 .

[5] M. Bartlett. A comment on D. V. Lindley's statistical paradox , 1957 .

[6] I. Good. A Bayesian Significance Test for Multinomial Distributions , 1967 .

[7] Rory A. Fisher,et al. Statistical methods and scientific inference. , 1957 .

[8] Irving John Good,et al. Some Logic and History of Hypothesis Testing , 1981 .

[9] R. Royall. The Effect of Sample Size on the Meaning of Significance Tests , 1986 .

[10] D. Johnstone,et al. Tests of Significance in Theory and Practice , 1986 .

[11] J. Berger,et al. Testing Precise Hypotheses , 1987 .

[12] Peter Urbach,et al. Scientific Reasoning: The Bayesian Approach , 1989 .

[13] F. J. Anscombe. The summarizing of clinical experiments by significance levels. , 1990, Statistics in medicine.

[14] D. Hand. A History of Probability and Statistics and Their Applications before 1750 , 1990 .

[15] G. Barnard. Must clinical trials be large? The interpretation of P-values and the combination of test results. , 1990, Statistics in medicine.

[16] K J Rothman,et al. No Adjustments Are Needed for Multiple Comparisons , 1990, Epidemiology.

[17] G A Colditz,et al. Relation of meat, fat, and fiber intake to the risk of colon cancer in a prospective study among women. , 1990, The New England journal of medicine.

[18] Ronald Aylmer Sir Fisher,et al. Statistical Methods, Experimental Design, and Scientific Inference , 1990 .

[19] Marcel Dekker. Weldon's Dice Data Revisited , 1991 .

[20] I Russell,et al. Statistics--with confidence? , 1991, The British journal of general practice : the journal of the Royal College of General Practitioners.

[21] S. Goodman,et al. A comment on replication, p-values and evidence. , 1992, Statistics in medicine.

[22] A. Edwards,et al. A History of Probability and Statistics and Their Applications before 1750 , 1992 .

[23] P. Freeman,et al. The role of p-values in analysing trial results. , 1993, Statistics in medicine.

[24] D. Lindley,et al. The Analysis of Experimental Data: The Appreciation of Tea and Wine , 1993 .

[25] S. Senn. Suspended judgment n-of-1 trials. , 1993, Controlled clinical trials.

[26] J. Potter,et al. Vegetables, fruit, and colon cancer in the Iowa Women's Health Study. , 1994, American journal of epidemiology.

[27] R T O'Neill,et al. The behavior of the P-value when the alternative hypothesis is true. , 1997, Biometrics.

[28] Samuel Kotz,et al. Leading Personalities in Statistical Sciences. , 1997 .

[29] Samuel Kotz,et al. Leading Personalities in Statistical Sciences. , 1997 .

[30] S. Goodman. Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[31] Steven Goodman. Toward Evidence-Based Medical Statistics. 2: The Bayes Factor , 1999, Annals of Internal Medicine.

[32] M. Smithson. Statistics with confidence , 2000 .