Statistical methods helping and hindering environmental science and management

Environmental scientists face the reality that many of their journals’ editors and referees routinely insist that results be accompanied by statements of statistical significance, obtained from two-sided tests of point-null hypotheses. Many in these three groups of people appear only vaguely a ware of the arbitrarinessoften invoked by this procedure and of the information sterility in a single p-value. The interpretation to be made of the failure of a test to attain such significance is not clear. For such reasons, some colleagues (and senior statisticans) have called current usage of the procedures into serious question. Some reasons for this dislocation and some of the more dramatic consequences for environmental science and management are presented. Interval and Bayesian approaches can offer remedies.

[1]  J. Nelder,et al.  Statistics for the millennium. From statistics to statistical science. Commentary. Author's of reply , 1999 .

[2]  C. Poole Feelings and frequencies: two kinds of probability in public health research. , 1988, American journal of public health.

[3]  R. P. Carver The Case Against Statistical Significance Testing , 1978 .

[4]  Peter G. Fairweather,et al.  Statistical Power and Design Requirements for Environmental Monitoring , 1991 .

[5]  Kenneth H. Reckhow,et al.  Bayesian inference in non-replicated ecological studies , 1990 .

[6]  R. Calderon,et al.  Health effects of swimmers and nonpoint sources of contaminated water. , 1991, International journal of environmental health research.

[7]  Glenn W. Suter,et al.  Abuse of hypothesis testing statistics in ecological risk assessment , 1996 .

[8]  Kenneth H. Reckhow,et al.  Engineering Approaches for Lake Management, Volume 1: Data Analysis and Empirical Modeling , 1982 .

[9]  G B McBride,et al.  Confidence of compliance: a Bayesian approach for percentile standards. , 2001, Water research.

[10]  R. Hilborn,et al.  The Ecological Detective: Confronting Models with Data , 1997 .

[11]  Jean D. Gibbons,et al.  P-values: Interpretation and Methodology , 1975 .

[12]  Randall M. Peterman,et al.  Statistical power analysis and the precautionary principle , 1992 .

[13]  Graham B. McBride,et al.  Applications: Equivalence Tests Can Enhance Environmental Science and Management , 1999 .

[14]  V. Vieland,et al.  Statistical Evidence: A Likelihood Paradigm , 1998 .

[15]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[16]  Denton E. Morrison,et al.  The Significance Test Controversy , 1972 .

[17]  John S. Gray,et al.  Statistics and the precautionary principle , 1990 .

[18]  S. Goodman,et al.  p values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. , 1993, American journal of epidemiology.

[19]  L. Harlow,et al.  What if there were no significance tests , 1997 .

[20]  Joseph Berkson,et al.  Tests of significance considered as evidence , 1942 .

[21]  Joseph Berkson,et al.  Some Difficulties of Interpretation Encountered in the Application of the Chi-Square Test , 1938 .

[22]  R. O. Gilbert Statistical Methods for Environmental Pollution Monitoring , 1987 .

[23]  Shein-Chung Chow,et al.  Design and Analysis of Bioavailability and Bioequivalence Studies , 1994 .

[24]  C Poole,et al.  Beyond the confidence interval. , 1987, American journal of public health.

[25]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[26]  R. P. Carver The Case Against Statistical Significance Testing, Revisited , 1993 .

[27]  P. Dayton,et al.  Reversal of the Burden of Proof in Fisheries Management , 1998, Science.

[28]  R. Royall Statistical Evidence: A Likelihood Paradigm , 1997 .

[29]  B. Mapstone Scalable Decision Rules for Environmental Impact Studies: Effect Size, Type I, and Type II Errors , 1995 .

[30]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[31]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[32]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[33]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[34]  D. Helsel,et al.  Statistical methods in water resources , 2020, Techniques and Methods.

[35]  B. Bower Null science: Psychology's statistical status quo draws fire , 1997 .

[36]  Kenneth H. Reckhow,et al.  Engineering approaches for lake management , 1983 .

[37]  Joseph D. Germano,et al.  Ecology, statistics, and the art of misdiagnosis: The need for a paradigm shift , 1999 .

[38]  W. W. Rozeboom The fallacy of the null-hypothesis significance test. , 1960, Psychological bulletin.

[39]  J. Tukey The Philosophy of Multiple Comparisons , 1991 .

[40]  G. McBride,et al.  What do significance tests really tell us about the environment? , 1993 .

[41]  Donald A. Berry,et al.  Statistics: A Bayesian Perspective , 1995 .

[42]  Lene Buhl-Mortensen,et al.  Type-II statistical errors in environmental science and the precautionary principle , 1996 .

[43]  Shein-Chung Chow,et al.  Design and Analysis of Bioavailability and Bioequivalence Studies , 1994 .

[44]  Douglas H. Johnson The Insignificance of Statistical Significance Testing , 1999 .