Mammographic density: Comparison of visual assessment with fully automatic calculation on a multivendor dataset

AbstractObjectivesTo compare breast density (BD) assessment provided by an automated BD evaluator (ABDE) with that provided by a panel of experienced breast radiologists, on a multivendor dataset.MethodsTwenty-one radiologists assessed 613 screening/diagnostic digital mammograms from nine centers and six different vendors, using the BI-RADS a, b, c, and d density classification. The same mammograms were also evaluated by an ABDE providing the ratio between fibroglandular and total breast area on a continuous scale and, automatically, the BI-RADS score. A panel majority report (PMR) was used as reference standard. Agreement (κ) and accuracy (proportion of cases correctly classified) were calculated for binary (BI-RADS a-b versus c-d) and 4-class classification.ResultsWhile the agreement of individual radiologists with the PMR ranged from κ = 0.483 to κ = 0.885, the ABDE correctly classified 563/613 mammograms (92 %). A substantial agreement for binary classification was found for individual reader pairs (κ = 0.620, standard deviation [SD] = 0.140), individual versus PMR (κ = 0.736, SD = 0.117), and individual versus ABDE (κ = 0.674, SD = 0.095). Agreement between ABDE and PMR was almost perfect (κ = 0.831).ConclusionsThe ABDE showed an almost perfect agreement with a 21-radiologist panel in binary BD classification on a multivendor dataset, earning a chance as a reproducible alternative to visual evaluation.Key Points• Individual BD assessment differs from PMR with κ as low as 0.483. • An ABDE correctly classified 92 % of mammograms with almost perfect agreement (κ = 0.831). • An ABDE can be a valid alternative to subjective BD assessment.

[1]  H. Kundel,et al.  Measurement of observer agreement. , 2003, Radiology.

[2]  Markus Hahn,et al.  Early detection of breast cancer: benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue. A systematic review , 2009, BMC Cancer.

[3]  N Houssami,et al.  Estimation of percentage breast tissue density: comparison between digital mammography (2D full field digital mammography) and digital breast tomosynthesis according to different BI-RADS categories. , 2013, The British journal of radiology.

[4]  S. Ciatto,et al.  Interobserver agreement in breast radiological density attribution according to BI-RADS quantitative classification , 2012, La radiologia medica.

[5]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[6]  S. Ciatto,et al.  Integration of 3D digital mammography with tomosynthesis for population breast-cancer screening (STORM): a prospective comparison study. , 2013, The Lancet. Oncology.

[7]  R. J. Brenner Using Mammographic Density to Improve Breast Cancer Screening Outcomes , 2009 .

[8]  A. Beckett,et al.  AKUFO AND IBARAPA. , 1965, Lancet.

[9]  D. Vanel The American College of Radiology (ACR) Breast Imaging and Reporting Data System (BI-RADS): a step towards a universal radiological language? , 2007, European journal of radiology.

[10]  X. Castells,et al.  Inter- and intraradiologist variability in the BI-RADS assessment and breast density categories for screening mammograms. , 2012, The British journal of radiology.

[11]  J. Youk,et al.  Radiologist assessment of breast density by BI-RADS categories versus fully automated volumetric assessment. , 2013, AJR. American journal of roentgenology.

[12]  J M Bland,et al.  Statistical methods for assessing agreement between two methods of clinical measurement , 1986 .

[13]  Emily White,et al.  Factors contributing to mammography failure in women aged 40-49 years. , 2004, Journal of the National Cancer Institute.

[14]  Carolyn Nickson,et al.  A microsimulation model of the BreastScreen Australia program , 2009 .

[15]  Manuela Durando,et al.  A first evaluation of breast radiological density assessment by QUANTRA software as compared to visual classification. , 2012, Breast.

[16]  Anne-Marie Schott,et al.  Mammographic density is not a worthwhile examination to distinguish high cancer risk women in screening , 2014, European Radiology.

[17]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[18]  T. M. Kolb,et al.  Comparison of the performance of screening mammography, physical examination, and breast US and evaluation of factors that influence them: an analysis of 27,825 patient evaluations. , 2002, Radiology.

[19]  Anne M Kavanagh,et al.  Using Mammographic Density to Improve Breast Cancer Screening Outcomes , 2008, Cancer Epidemiology Biomarkers & Prevention.

[20]  Andriy I. Bandos,et al.  Comparison of digital mammography alone and digital mammography plus tomosynthesis in a population-based screening program. , 2013, Radiology.

[21]  Markus Hahn,et al.  ' s response to reviews Title : Early detection of breast cancer : Benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue , 2009 .

[22]  Giske Ursin,et al.  Mammographic density - a useful biomarker for breast cancer risk in epidemiologic studies , 2009 .