论文信息 - Methodological Commentary The Precision of Reliability and Validity Estimates Re-Visited: Distinguishing Between Clinical and Statistical Significance of Sample Size Requirements

Methodological Commentary The Precision of Reliability and Validity Estimates Re-Visited: Distinguishing Between Clinical and Statistical Significance of Sample Size Requirements

In a previous JCEN Methodological Commentary (Cicchetti, 1999), I proposed very speci®c and cogent arguments to question the clinical meaningfulness of Charter's (1999) recommendations to the wider community of clinical and experimental neuropsychologists of a minimum of 400 subjects for determining precise split-half, coef®cient alpha, test±retest, alternate forms, and inter-examiner reliability assessment procedures, and validity coef®cients. To refresh the reader's memory, and using Charter's own example, I concluded unabashedly that to increase sample size N from 50 to 300 (a factor of 600%) was simply not worth the considerable added cost and time, to `increase' a lower-bound precision reliability estimate from .82 to .87, while simultaneously `decreasing' the upper bound reliability estimate from .94 to .92. I stand ®rmly behind that statement, and would add to it that Charter's further conclusion of an N, in fact of `400 or more' in both his earlier and current Methodological Commentary strains credulity even more. Unfortunately, the content of Charter's reply, or rebuttal to my comments uses precisely the same arguments as previously, and the interested reader is referred, once again to my earlier critique of his work (Cicchetti, 1999). This critique will stress the inappropriateness of considering precision solely in the context of increasing N, or using sample sizes of 400 and more, as appears to be Charter's main objective or desideratum. This will be discussed in the broader context of both the necessity to consider the practical or clinical meaningfulness of precision estimates, and the underlying rationale for calculating con®dence intervals (CIs) around these estimates, in the ®rst place. Other less critical issues will also be raised, as required.

D. Cicchetti

[1] D. V. Cocchetti. Sample size requirements for increasing the precision of reliability estimates: problems and proposed solutions. , 1999, Journal of clinical and experimental neuropsychology.

[2] Patrick Leung,et al. Power and Precision , 1999 .

[3] F. Volkmar,et al. Diagnosing Autism using ICD-10 criteria: A comparison of neural networks and standard multivariate procedures , 1995 .

[4] Jacob Cohen. The earth is round (p < .05) , 1994 .

[5] M. Eliasziw,et al. Sample size requirements for reliability studies. , 1987, Statistics in medicine.

[6] W. Grove. Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[7] Donald B. Rubin,et al. A Note on Percent Variance Explained as A Measure of the Importance of Effects , 1979 .

[8] J. Richard Landis,et al. Large sample variance of kappa in the case of different sets of raters. , 1979 .

[9] L. Costa,et al. Editorial policy II , 1979 .

[10] J. Fleiss,et al. Inference About Weighted Kappa in the Non-Null Case , 1978 .

[11] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[12] R. Rosenthal. Meta-analytic procedures for social research , 1984 .

[13] C. T. Lê. Sample size requirements. , 1983, Journal of chronic diseases.

[14] D. Cicchetti,et al. Developing criteria for establishing interrater reliability of specific items: applications to assessment of adaptive behavior. , 1981, American journal of mental deficiency.