论文信息 - Confidence intervals for the interrater agreement measure kappa

Confidence intervals for the interrater agreement measure kappa

The asympotic normal approximation to the distribution of the estimated measure [kcirc] for evaluating agreement between two raters has been shown to perform poorly for small sample sizes when the true kappa is nonzero. This paper examines the use of skewness corrections and transformations of [kcirc] on the attained confidence levels. Small sample simulations demonstrate the improvement in the agreement between the desired and actual levels of confidence intervals and hypothesis tests that incorporate these corrections.

V. Flack

[1] Reliability models for categorical data , 1984 .

[2] I. James. Analysis of nonagreements among multiple raters , 1983 .

[3] J. Fleiss,et al. Measuring Agreement for Multinomial Data , 1982 .

[4] H. Kraemer,et al. Extension of the kappa coefficient. , 1980, Biometrics.

[5] J. Fleiss,et al. Inference About Weighted Kappa in the Non-Null Case , 1978 .

[6] Joseph L. Fleiss,et al. Comparison of the Null Distributions of Weighted Kappa and the C Ordinal Statistic , 1977 .

[7] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[8] J. R. Landis,et al. A one-way components of variance model for categorical data , 1977 .

[9] J. Fleiss,et al. Statistical methods for rates and proportions , 1973 .

[10] Jacob Cohen,et al. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[11] Rupert G. Miller. A Trustworthy Jackknife , 1964 .

[12] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .