论文信息 - General observer-agreement measures on individual subjects and groups of subjects

General observer-agreement measures on individual subjects and groups of subjects

General chance-corrected measures of agreement on individual subjects, for several observers using nominal or ordinal categories, are developed. The subject-specific measures can be used to identify subjects whom the observers find difficult to rate. The relationship of the subject-specific measures to a general chance-corrected measure of agreement for a group of subjects is demonstrated. By suitable choices of disagreement functions, the measure of agreement for a group of subjects is shown to include, as special cases, many of the kappa-like statistics. Also, it is asymptotically equivalent to various intraclass correlation coefficients. The measures do not require that the observers all use the classification scale in the same way. The asymptotic null and non-null variances obtained by Taylorseries approximations for the statistics are presented. The application of the measures is illustrated by data obtained when seven pathologists classified slides on a five-point ordinal scale for the diagnosis of carcinoma in situ of the uterine cervix.

Annette J. Dobson | A. Dobson | D. O’Connell | Dianne L. O'Connell

[1] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[2] Jacob Cohen,et al. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[3] N D Holmquist,et al. Variability in classification of carcinoma in situ of the uterine cervix. , 1967, Archives of pathology.

[4] J. Fleiss,et al. Measuring Agreement for Multinomial Data , 1982 .

[5] B. Everitt,et al. Large sample standard errors of kappa and weighted kappa. , 1969 .

[6] J. R. Landis,et al. An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. , 1977, Biometrics.

[7] H. Kraemer,et al. Extension of the kappa coefficient. , 1980, Biometrics.

[8] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[9] R. Light. Measures of response agreement for qualitative data: Some generalizations and alternatives. , 1971 .

[10] H. Schouten,et al. Measuring pairwise interobserver agreement when all subjects are judged by the same observers , 1982 .

[11] J. R. Landis,et al. A one-way components of variance model for categorical data , 1977 .