Observer Variability in BI-RADS Ultrasound Features and Its Influence on Computer-Aided Diagnosis of Breast Masses

Objective: Computer classification of sonographic BI-RADS features can aid differentiation of the malignant and benign masses. However, the variability in the diagnosis due to the differences in the observed features between the observations is not known. The goal of this study is to measure the variation in sonographic features between multiple observations and determine the effect of features variation on computer-aided diagnosis of the breast masses. Materials and Methods: Ultrasound images of biopsy proven solid breast masses were analyzed in three independent observations for BI-RADS sonographic features. The BI-RADS features from each observation were used with Bayes classifier to determine probability of malignancy. The observer agreement in the sonographic features was measured by kappa coefficient and the difference in the diagnostic performances between observations was determined by the area under the ROC curve, Az, and interclass correlation coefficient. Results: While some features were repeatedly observed, κ = 0.95, other showed a significant variation, κ = 0.16. For all features, combined intra-observer agreement was substantial, κ = 0.77. The agreement, however, decreased steadily to 0.66 and 0.56 as time between the observations increased from 1 to 2 and 3 months, respectively. Despite the variation in features between observations the probabilities of malignancy estimates from Bayes classifier were robust and consistently yielded same level of diagnostic performance, Az was 0.772 – 0.817 for sonographic features alone and 0.828 – 0.849 for sonographic features and age combined. The difference in the performance, ΔAz, between the observations for the two groups was small (0.003 – 0.044) and was not statistically significant (p < 0.05). Interclass correlation coefficient for the observations was 0.822 (CI: 0.787 – 0.853) for BI-RADS sonographic features alone and for those combined with age was 0.833 (CI: 0.800 – 0.862). Conclusion: Despite the differences in the BI- RADS sonographic features between different observations, the diagnostic performance of computer-aided analysis for differentiating breast masses did not change. Through continual retraining, the computer-aided analysis provides consistent diagnostic performance independent of the variations in the observed sonographic features.

[1]  Jeon-Hor Chen,et al.  Quantitative Ultrasound Analysis for Classification of BI-RADS Category 3 Breast Masses , 2013, Journal of Digital Imaging.

[2]  Mergen G. Doraev The "Memory Effect , 2015 .

[3]  D. Vanel The American College of Radiology (ACR) Breast Imaging and Reporting Data System (BI-RADS): a step towards a universal radiological language? , 2007, European journal of radiology.

[4]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[5]  M. Giger,et al.  Improving breast cancer diagnosis with computer-aided diagnosis. , 1999, Academic radiology.

[6]  D. Kopans The positive predictive value of mammography. , 1992, AJR. American journal of roentgenology.

[7]  Ellen Kao,et al.  Breast imaging reporting and data system lexicon for US: interobserver agreement for assessment of breast masses. , 2009, Radiology.

[8]  W C A Pereira,et al.  Intraobserver interpretation of breast ultrasonography following the BI-RADS classification. , 2010, European journal of radiology.

[9]  Alyssa Cwanger,et al.  Bayesian Probability of Malignancy With BI‐RADS Sonographic Features , 2014, Journal of ultrasound in medicine : official journal of the American Institute of Ultrasound in Medicine.

[10]  Tamara Miner Haygood,et al.  The "memory effect" for repeated radiologic observations. , 2011, AJR. American journal of roentgenology.

[11]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[12]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[13]  Ruey-Feng Chang,et al.  Breast ultrasound computer-aided diagnosis using BI-RADS features. , 2007, Academic radiology.

[14]  R. Chang,et al.  Computer aided classification system for breast ultrasound based on Breast Imaging Reporting and Data System (BI-RADS). , 2007, Ultrasound in medicine & biology.

[15]  Jeon-Hor Chen,et al.  Computer-aided classification of breast masses using speckle features of automated breast ultrasound images. , 2012, Medical physics.

[16]  A. Stavros,et al.  Solid breast nodules: use of sonography to distinguish between benign and malignant lesions. , 1995, Radiology.

[17]  Ki Keun Oh,et al.  Observer variability of Breast Imaging Reporting and Data System (BI-RADS) for breast ultrasound. , 2008, European journal of radiology.

[18]  Sung Hun Kim,et al.  Observer Agreement Using the ACR Breast Imaging Reporting and Data System (BI-RADS)-Ultrasound, First Edition (2003) , 2007, Korean journal of radiology.

[19]  L. Liberman,et al.  Breast imaging reporting and data system (BI-RADS). , 2002, Radiologic clinics of North America.

[20]  Theodore W. Cary,et al.  Comparison of naïve Bayes and logistic regression for computer-aided diagnosis of breast masses using ultrasound imaging , 2012, Medical Imaging.