Visual assessment of breast density using Visual Analogue Scales: observer variability, reader attributes and reading time

Breast density is a strong risk factor for breast cancer and has potential use in breast cancer risk prediction, with subjective methods of density assessment providing a strong relationship with the development of breast cancer. This study aims to assess intra- and inter-observer variability in visual density assessment recorded on Visual Analogue Scales (VAS) among trained readers, and examine whether reader age, gender and experience are associated with assessed density. Eleven readers estimated the breast density of 120 mammograms on two occasions 3 years apart using VAS. Intra- and inter-observer agreement was assessed with Intraclass Correlation Coefficient (ICC) and variation between readers visualised on Bland-Altman plots. The mean scores of all mammograms per reader were used to analyse the effect of reader attributes on assessed density. Excellent intra-observer agreement (ICC>0.80) was found in the majority of the readers. All but one reader had a mean difference of <10 percentage points from the first to the second reading. Inter-observer agreement was excellent for consistency (ICC 0.82) and substantial for absolute agreement (ICC 0.69). However, the 95% limits of agreement for pairwise differences were -6.8 to 15.7 at the narrowest and 0.8 to 62.3 at the widest. No significant association was found between assessed density and reader age, experience or gender, or with reading time. Overall, the readers were consistent in their scores, although some large variations were observed. Reader evaluation and targeted training may alleviate this problem.

[1]  Susan M. Astley,et al.  Mammographic density adds accuracy to both the Tyrer-Cuzick and Gail breast cancer risk models in a prospective UK screening cohort , 2015, Breast Cancer Research.

[2]  Akiko Shimauchi,et al.  Breast density: the trend in breast cancer screening , 2015, Breast Cancer.

[3]  Susan M. Astley,et al.  Should We Adjust Visually Assessed Mammographic Density for Observer Variability? , 2016, Digital Mammography / IWDM.

[4]  Anders Tingberg,et al.  Breast Imaging , 2016, Lecture Notes in Computer Science.

[5]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[6]  N. Boyd,et al.  Mammographic density and the risk and detection of breast cancer. , 2007, The New England journal of medicine.

[7]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[8]  Mary Wilson,et al.  Assessing Individual Breast Cancer Risk within the U.K. National Health Service Breast Screening Program: A New Paradigm for Cancer Prevention , 2012, Cancer Prevention Research.

[9]  Susan M. Astley,et al.  Same task, same observers, different values: the problem with visual assessment of breast density , 2013, Medical Imaging.

[10]  D. Altman,et al.  Measuring agreement in method comparison studies , 1999, Statistical methods in medical research.

[11]  S. Duffy,et al.  Tamoxifen-induced reduction in mammographic density and breast cancer risk reduction: a nested case-control study. , 2011, Journal of the National Cancer Institute.

[12]  Iain Buchan,et al.  Correcting for rater bias in scores on a continuous scale, with application to breast density , 2013, Statistics in medicine.

[13]  Susan M. Astley,et al.  Improvement in risk prediction, early detection and prevention of breast cancer in the NHS Breast Screening Programme and family history clinics: a dual cohort study , 2016 .