Increasing scientific power with statistical power.

A survey of basic ideas in statistical power analysis demonstrates the advantages and ease of using power analysis throughout the design, analysis, and interpretation of research. The power of a statistical test is the probability of rejecting the null hypothesis of the test. The traditional approach to power involves computation of only a single power value. The more general power curve allows examining the range of power determinants, which are sample size, population difference, and error variance, in traditional ANOVA. Power analysis can be useful not only in study planning, but also in the evaluation of existing research. An important application is in concluding that no scientifically important treatment difference exists. Choosing an appropriate power depends on: a) opportunity costs, b) ethical trade-offs, c) the size of effect considered important, d) the uncertainty of parameter estimates, and e) the analyst's preferences. Although precise rules seem inappropriate, several guidelines are defensible. First, the sensitivity of the power curve to particular characteristics of the study, such as the error variance, should be examined in any power analysis. Second, just as a small type I error rate should be demonstrated in order to declare a difference nonzero, a small type II error should be demonstrated in order to declare a difference zero. Third, when ethical and opportunity costs do not preclude it, power should be at least .84, and preferably greater than .90.

[1]  M K Kaiser,et al.  MANOVA method for analyzing repeated measures designs: an extensive primer. , 1985, Psychological bulletin.

[2]  C Gatsonis,et al.  Multiple correlation: exact power and sample size calculations. , 1989, Psychological bulletin.

[3]  V. Benignus,et al.  Dose-effects functions for carboxyhemoglobin and behavior. , 1990, Neurotoxicology and teratology.

[4]  P. Lachenbruch,et al.  Design Sensitivity: Statistical Power for Experimental Research. , 1989 .

[5]  Mark W. Lipsey,et al.  Design Sensitivity: Statistical Power for Experimental Research. , 1989 .

[6]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .

[7]  G. Glass,et al.  Meta-analysis in social research , 1981 .

[8]  Harris Cooper,et al.  Integrating Research: A Guide for Literature Reviews , 1989 .

[9]  Scott E. Maxwell,et al.  Designing Experiments and Analyzing Data , 1991 .

[10]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[11]  V A Benignus,et al.  Recommendations for appropriate statistical practice in toxicologic experiments. , 1984, Neurotoxicology.

[12]  M. Dagenais Comment on Zeger and Brookmeyer , 1991 .

[13]  V. Benignus,et al.  Absence of symptoms with carboxyhemoglobin levels of 16-23%. , 1987, Neurotoxicology and teratology.

[14]  L. Hedges,et al.  Statistical Methods for Meta-Analysis , 1987 .

[15]  W. Holtzman Fundamental statistics in psychology and education. , 1951 .

[16]  Stephen Dubin How many subjects? Statistical power analysis in research , 1990 .

[17]  V. Benignus,et al.  The effects of low-level carbon monoxide exposure upon evoked cortical potentials in young and elderly men. , 1988, Neurotoxicology and teratology.

[18]  Richard Goldstein Vice President Power and Sample Size via MS/PC-DOS Computers , 1989 .

[19]  Scott E. Maxwell,et al.  Designing Experiments and Analyzing Data , 1992 .

[20]  Keith E. Muller,et al.  Approximate Power for Repeated-Measures ANOVA Lacking Sphericity , 1989 .

[21]  Keith E. Muller,et al.  Practical methods for computing power in testing the multivariate general linear hypothesis , 1984 .

[22]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[23]  Linda S. Franck,et al.  Integrating Research: A Guide for Literature Reviews (2nd ed.). By H. M. Cooper. 157 pp. Newbury Park, CA Sage Publications, 1989, $19.95 , 1990 .

[24]  Lawrence L. Kupper,et al.  How Appropriate are Popular Sample Size Formulas , 1989 .

[25]  R. Rosenthal Meta-analytic procedures for social research , 1984 .

[26]  V. Benignus,et al.  Compensatory tracking in humans with elevated carboxyhemoglobin. , 1990, Neurotoxicology and teratology.

[27]  David B. Pillemer,et al.  Summing Up: The Science of Reviewing Research , 1984 .

[28]  Helena Chmura Kraemer,et al.  How many subjects , 1989 .

[29]  W. J. Langford Statistical Methods , 1959, Nature.

[30]  R. Kirk Experimental Design: Procedures for the Behavioral Sciences , 1970 .