Kappa statistic for clustered matched‐pair data

Kappa statistic is widely used to assess the agreement between two procedures in the independent matched-pair data. For matched-pair data collected in clusters, on the basis of the delta method and sampling techniques, we propose a nonparametric variance estimator for the kappa statistic without within-cluster correlation structure or distributional assumptions. The results of an extensive Monte Carlo simulation study demonstrate that the proposed kappa statistic provides consistent estimation and the proposed variance estimator behaves reasonably well for at least a moderately large number of clusters (e.g., K ≥50). Compared with the variance estimator ignoring dependence within a cluster, the proposed variance estimator performs better in maintaining the nominal coverage probability when the intra-cluster correlation is fair (ρ ≥0.3), with more pronounced improvement when ρ is further increased. To illustrate the practical application of the proposed estimator, we analyze two real data examples of clustered matched-pair data.

[1]  Matthijs J. Warrens,et al.  Inequalities between multi-rater kappas , 2010, Adv. Data Anal. Classif..

[2]  A. Feinstein,et al.  High agreement but low kappa: I. The problems of two paradoxes. , 1990, Journal of clinical epidemiology.

[3]  H J Schouten Estimating kappa from binocular data and comparing marginal probabilities. , 1993, Statistics in medicine.

[4]  J. Hardin,et al.  A note on the tests for clustered matched‐pair binary data , 2010, Biometrical journal. Biometrische Zeitschrift.

[5]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[6]  A. Agresti Categorical data analysis , 1993 .

[7]  D M Clarke,et al.  Comparing correlated kappas by resampling: is one level of agreement significantly different from another? , 1996, Journal of psychiatric research.

[8]  Albert Westergren,et al.  Statistical methods for assessing agreement for ordinal data. , 2005, Scandinavian journal of caring sciences.

[9]  M. Shoukri Measures of Interobserver Agreement and Reliability, Second Edition , 2010 .

[10]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[11]  Adelin Albert,et al.  Agreement between Two Independent Groups of Raters , 2009 .

[12]  A. Scott,et al.  A simple method for the analysis of clustered binary data. , 1992, Biometrics.

[13]  N A Obuchowski,et al.  On the comparison of correlated proportions for clustered data. , 1998, Statistics in medicine.

[14]  A Donner,et al.  Testing the equality of two dependent kappa statistics. , 2000, Statistics in medicine.

[15]  Neil Klar,et al.  An Estimating Equations Approach for Modelling Kappa , 2000 .

[16]  B. Everitt,et al.  Large sample standard errors of kappa and weighted kappa. , 1969 .

[17]  Wan Tang,et al.  Inference for kappas for longitudinal study data: applications to sexual health research. , 2008, Biometrics.

[18]  Adelin Albert,et al.  Agreement between an isolated rater and a group of raters , 2009 .

[19]  Ming Zhou,et al.  A note on the kappa statistic for clustered dichotomous data , 2014, Statistics in medicine.

[20]  H. Kundel,et al.  Measurement of observer agreement. , 2003, Radiology.

[21]  S D Walter,et al.  A reappraisal of the kappa coefficient. , 1988, Journal of clinical epidemiology.

[22]  H. Kraemer Ramifications of a population model forκ as a coefficient of reliability , 1979 .

[23]  The statistical analysis of matched data in psychiatric research , 1989, Psychiatry Research.

[24]  J. Hardin,et al.  Testing marginal homogeneity in clustered matched-pair data , 2011 .

[25]  Geert Molenberghs,et al.  Regression modelling of weighted κ by using generalized estimating equations , 2000 .

[26]  J. Hardin,et al.  Confidence intervals for the difference of marginal probabilities in clustered matched‐pair binary data , 2012, Pharmaceutical statistics.

[27]  Jianwen Cai,et al.  Kappa statistic for clustered dichotomous responses from physicians and patients , 2013, Statistics in medicine.

[28]  A. Donner,et al.  Adjusted inference procedures for the interobserver agreement in twin studies , 2016, Statistical methods in medical research.

[29]  N L Oden Estimating kappa from binocular data. , 1991, Statistics in medicine.

[30]  Huiman X Barnhart,et al.  Weighted Least‐Squares Approach for Comparing Correlated Kappa , 2002, Biometrics.

[31]  Jun-mo Nam,et al.  Homogeneity Score Test for the Intraclass Version of the Kappa Statistics and Sample‐Size Determination in Multiple or Stratified Studies , 2003, Biometrics.

[32]  M. Eliasziw,et al.  Testing the homogeneity of kappa statistics. , 1996, Biometrics.

[33]  A K Manatunga,et al.  Modeling kappa for measuring dependent categorical agreement data. , 2000, Biostatistics.

[34]  V M Chinchilli,et al.  A generalized concordance correlation coefficient for continuous and categorical data , 2001, Statistics in medicine.

[35]  W. Barlow,et al.  A comparison of methods for calculating a stratified kappa. , 1990, Statistics in medicine.

[36]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[37]  H. Kraemer,et al.  Extension of the kappa coefficient. , 1980, Biometrics.

[38]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[39]  A. J. Conger Integration and generalization of kappas for multiple raters. , 1980 .

[40]  Emmanuel Lesaffre,et al.  Hierarchical modeling of agreement , 2012, Statistics in medicine.

[41]  A. Albert,et al.  A bootstrap method for comparing correlated kappa coefficients , 2008 .