Prospective trial comparing full-field digital mammography (FFDM) versus combined FFDM and tomosynthesis in a population-based screening programme using independent double reading with arbitration

AbstractObjectivesTo compare double readings when interpreting full field digital mammography (2D) and tomosynthesis (3D) during mammographic screening.MethodsA prospective, Ethical Committee approved screening study is underway. During the first year 12,621 consenting women underwent both 2D and 3D imaging. Each examination was independently interpreted by four radiologists under four reading modes: Arm A—2D; Arm B—2D + CAD; Arm C—2D + 3D; Arm D—synthesised 2D + 3D. Examinations with a positive score by at least one reader were discussed at an arbitration meeting before a final management decision. Paired double reading of 2D (Arm A + B) and 2D + 3D (Arm C + D) were analysed. Performance measures were compared using generalised linear mixed models, accounting for inter-reader performance heterogeneity (P < 0.05).ResultsPre-arbitration false-positive scores were 10.3 % (1,286/12,501) and 8.5 % (1,057/12,501) for 2D and 2D + 3D, respectively (P < 0.001). Recall rates were 2.9 % (365/12,621) and 3.7 % (463/12,621), respectively (P = 0.005). Cancer detection was 7.1 (90/12,621) and 9.4 (119/12,621) per 1,000 examinations, respectively (30 % increase, P < 0.001); positive predictive values (detected cancer patients per 100 recalls) were 24.7 % and 25.5 %, respectively (P = 0.97). Using 2D + 3D, double-reading radiologists detected 27 additional invasive cancers (P < 0.001).ConclusionDouble reading of 2D + 3D significantly improves the cancer detection rate in mammography screening.Key Points• Tomosynthesis-based screening was successfully implemented in a large prospective screening trial. • Double reading of tomosynthesis-based examinations significantly reduced false-positive interpretations. • Double reading of tomosynthesis significantly increased the detection of invasive cancers.

[1]  David Gur,et al.  Localized detection and classification of abnormalities on FFDM and tomosynthesis examinations rated under an FROC paradigm. , 2011, AJR. American journal of roentgenology.

[2]  Breast screening: the use of consensus opinion for all recalls. , 2004, Breast.

[3]  R. Edward Hendrick,et al.  Performance comparison of single-view digital breast tomosynthesis plus single-view digital mammography with two-view digital mammography , 2013, European Radiology.

[4]  A. Verbeek,et al.  A remarkable reduction of breast cancer deaths in screened versus unscreened women: a case-referent study , 2010, Cancer Causes & Control.

[5]  Unni Haakenaasen,et al.  Digital breast tomosynthesis (DBT): initial experience in a clinical setting , 2012, Acta radiologica.

[6]  Tao Wu,et al.  A new generation FFDM/tomosynthesis fusion system with selenium detector , 2010, Medical Imaging.

[7]  L. Tabár,et al.  Swedish two-county trial: impact of mammographic screening on breast cancer mortality during 3 decades. , 2011, Radiology.

[8]  P. Skaane,et al.  Observer variability in screen-film mammography versus full-field digital mammography with soft-copy reading , 2008, European Radiology.

[9]  Per Skaane,et al.  Screen-film mammography versus full-field digital mammography with soft-copy reading: randomized trial in a population-based screening program--the Oslo II Study. , 2004, Radiology.

[10]  David Gur,et al.  Dose reduction in digital breast tomosynthesis (DBT) screening using synthetically reconstructed projection images: an observer performance study. , 2012, Academic radiology.

[11]  Per Skaane,et al.  Screening-detected breast cancers: discordant independent double reading in a population-based screening program. , 2009, Radiology.

[12]  S. Moss,et al.  Effectiveness and cost-effectiveness of double reading of mammograms in breast cancer screening: findings of a systematic review. , 2001, Breast.

[13]  David Gur,et al.  Digital breast tomosynthesis: observer performance study. , 2009, AJR. American journal of roentgenology.

[14]  T M Svahn,et al.  Breast tomosynthesis and digital mammography: a comparison of diagnostic accuracy. , 2012, The British journal of radiology.

[15]  Melanie Pinet,et al.  Increase in cancer detection and recall rates with independent double interpretation of screening mammography. , 2003, AJR. American journal of roentgenology.

[16]  S. Ciatto,et al.  Second reading of screening mammograms increases cancer detection and recall rates. Results in the Florence screening programme , 2005, Journal of medical screening.

[17]  S. Ciatto,et al.  The role of arbitration of discordant reports at double reading of screening mammograms , 2005, Journal of medical screening.

[18]  Alan C. Evans,et al.  The pathological and radiological features of screen-detected breast cancers diagnosed following arbitration of discordant double reading opinions. , 2005, Clinical radiology.

[19]  C E Metz,et al.  Gains in Accuracy from Replicated Readings of Diagnostic Images , 1992, Medical decision making : an international journal of the Society for Medical Decision Making.

[20]  Paolo Peterlongo,et al.  Prospective study of breast tomosynthesis as a triage to assessment in screening , 2012, Breast Cancer Research and Treatment.

[21]  Gisella Gennaro,et al.  Digital breast tomosynthesis versus digital mammography: a clinical performance study , 2010, European Radiology.

[22]  I Andersson,et al.  The diagnostic accuracy of dual-view digital mammography, single-view breast tomosynthesis and a dual-view combination of breast tomosynthesis and digital mammography in a free-response observer performance study. , 2010, Radiation protection dosimetry.

[23]  Eugenio Paci,et al.  Summary of the Evidence of Breast Cancer Service Screening Outcomes in Europe and First Estimate of the Benefit and Harm Balance Sheet , 2012, Journal of medical screening.

[24]  P E Shile,et al.  Variability in the interpretation of screening mammograms by US radiologists. , 1996, Academic radiology.

[25]  H. Fenlon,et al.  Consensus review of discordant findings maximizes cancer detection rate in double-reader screening mammography: Irish National Breast Screening Program experience. , 2009, Radiology.

[26]  Craig A. Beam,et al.  Variability in the interpretation of screening mammograms by US radiologists. Findings from a national sample. , 1996, Archives of internal medicine.

[27]  Federica Zanca,et al.  Two-view and single-view tomosynthesis versus full-field digital mammography: high-resolution X-ray imaging observer study. , 2012, Radiology.

[28]  Andriy I. Bandos,et al.  Comparison of digital mammography alone and digital mammography plus tomosynthesis in a population-based screening program. , 2013, Radiology.

[29]  Kenneth G. A. Gilhuijs,et al.  Breast tomosynthesis in clinical practice: initial results , 2009, European Radiology.

[30]  David Gur,et al.  Time to diagnosis and performance levels during repeat interpretations of digital breast tomosynthesis: preliminary observations. , 2010, Academic radiology.

[31]  C P Lawinski,et al.  A comparison of the accuracy of film-screen mammography, full-field digital mammography, and digital breast tomosynthesis. , 2012, Clinical radiology.

[32]  Anders Tingberg,et al.  Breast tomosynthesis and digital mammography: a comparison of breast cancer visibility and BIRADS classification in a population of cancers with subtle mammographic findings , 2008, European Radiology.

[33]  Tor D Tosteson,et al.  Digital breast tomosynthesis: initial experience in 98 women with abnormal digital screening mammography. , 2007, AJR. American journal of roentgenology.

[34]  Per Skaane,et al.  Population-based mammography screening: comparison of screen-film and full-field digital mammography with soft-copy reading--Oslo I study. , 2003, Radiology.

[35]  EUROSCREENWorking Group,et al.  Summary of the evidence of breast cancer service screening outcomes in Europe and first estimate of the benefit and harm balance sheet , 2012 .

[36]  Harry J de Koning,et al.  Independent double reading of screening mammograms in The Netherlands: effect of arbitration following reader disagreements. , 2004, Radiology.

[37]  Peilin Wu,et al.  Preliminary Observations , 1830, The Medico-chirurgical review.

[38]  D Gur,et al.  Observer variation and the performance accuracy gained by averaging ratings of abnormality. , 2000, Medical physics.