论文信息 - Observer Variability: A New Approach in Evaluating Interobserver Agreement

Observer Variability: A New Approach in Evaluating Interobserver Agreement

Existing indices of observer agreement for continuous data, such as the intraclass correlation coefficient or the concordance correlation coefficient, measure the total observer-related variability, which includes the variabilities between and within observers. This work introduces a new index that measures the interobserver variability, which is defined in terms of the distances among the 'true values' assigned by different observers on the same subject. The new coefficient of interobserver variability (CIV) is defined as the ratio of the interobserver and the total observer variability. We show how to estimate the CIV and how to use bootstrap and ANOVAbased methods for inference. We also develop a coefficient of excess observer variability, which compares the total observer variability to the expected total observer variability when there are no differences among the observers. This coefficient is a simple function of the CIV. In addition, we show how the value of the CIV, estimated from an agreement study, can be used in the design of measurements studies. We illustrate the new concepts and methods by two examples, where (1) two radiologists used calcium scores to evaluate the severity of coronary artery arteriosclerosis, and (2) two methods were used to measure knee joint angle.

Huiman X. Barnhart | Jingli Song | Michael Haber | James Gruden

[1] Huiman X Barnhart,et al. Overall Concordance Correlation Coefficient for Evaluating Agreement Among Multiple Observers , 2002, Biometrics.

[2] Huiman X Barnhart,et al. Assessing intra, inter and total agreement with replicated readings , 2005, Statistics in medicine.

[3] M. Kendall. Statistical Methods for Research Workers , 1937, Nature.

[4] L. Lin,et al. A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.

[5] K. McGraw,et al. Forming inferences about some intraclass correlation coefficients. , 1996 .

[6] M. Eliasziw,et al. Statistical methodology for the concurrent assessment of interrater and intrarater reliability: using goniometric measurements as an example. , 1994, Physical therapy.

[7] J. Bartko. The Intraclass Correlation Coefficient as a Measure of Reliability , 1966, Psychological reports.