Association of volume and volume-independent factors with accuracy in screening mammogram interpretation.

BACKGROUND Early detection of breast cancer is associated with the accurate reading of screening mammograms, but factors that influence reading accuracy are not well understood. We thus investigated whether reading volume and other factors were independently associated with accuracy in reading screening mammograms in a population of U.S. radiologists. METHODS A random selection of 110 of 292 radiologists who agreed to participate, if selected, interpreted screening mammograms from 148 randomly selected women. Original index mammograms (i.e., mediolateral oblique and craniocaudal views of each breast) were used; comparison original mammograms were provided when available. Radiologist-level and facility-level factors were surveyed. Two standard metrics of screening accuracy, both based on receiver operating characteristic curves, were analyzed. The influence of volume on accuracy after controlling for other factors was assessed with multiple regression analysis. RESULTS Current reading volume was not statistically significantly associated with interpretive accuracy. More recently trained radiologists interpreted mammograms more accurately than those trained earlier (-0.76% [95% confidence interval (CI) = -1.75% to -0.02%] reduction in sensitivity per year since residency). Facility-level factors that were statistically significantly and independently associated with better accuracy were the number of diagnostic breast imaging examinations and image-guided breast interventional procedures performed (0.55% [95% CI = 0.11% to 2.40%] increase in accuracy per examination or procedure offered), being classified as a comprehensive breast diagnostic and/or screening center or freestanding mammography center (1.39% [95% CI = 0.15% to 3.82%] higher than a hospital radiology department or multispecialty medical clinic), and being a facility that practiced double reading (1.61% [95% CI = 1.99% to 11.65%]) higher than in a facility without such practice). CONCLUSIONS Individual radiologists' current reading volume was not statistically significantly associated with accuracy in reading screening mammograms, but several other factors were. Expertise reflects a complex multifactorial process that needs further clarification.

[1]  W Zucchini,et al.  On the statistical analysis of ROC curves. , 1989, Statistics in medicine.

[2]  C. McArdle,et al.  Impact of variability among surgeons on postoperative morbidity and mortality and ultimate survival. , 1991, BMJ.

[3]  E. Conant,et al.  How experience and training influence mammography expertise. , 1999, Academic radiology.

[4]  H L Kundel,et al.  Nature of expertise in searching mammograms for breast masses , 1996, Medical Imaging.

[5]  J. Birkmeyer,et al.  Hospital volume and surgical mortality in the United States. , 2002, The New England journal of medicine.

[6]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[7]  C. Beam,et al.  Variability in the interpretation of screening mammograms by US radiologists. Findings from a national sample. , 1996, Archives of internal medicine.

[8]  C. Metz ROC Methodology in Radiologic Imaging , 1986, Investigative radiology.

[9]  S. Edge,et al.  Provider case volume and outcome in the evaluation and treatment of patients with mammogram‐detected breast carcinoma , 2002, Cancer.

[10]  A R Feinstein,et al.  Context bias. A problem in diagnostic radiology. , 1996, JAMA.

[11]  Mitchell H. Gail,et al.  A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data , 1989 .

[12]  Helen C. Cowley,et al.  Improving the accuracy of mammography: volume and outcome relationships. , 2002, Journal of the National Cancer Institute.

[13]  A. Pollock,et al.  Breast cancer management: is volume related to quality? Clinical Advisory Panel. , 1997, British Journal of Cancer.

[14]  C. Round,et al.  Influence of clinician workload and patterns of treatment on survival from breast cancer , 1995, The Lancet.

[15]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[16]  Dev P. Chakraborty,et al.  Medical Imaging 2002: Image Perception, Observer Performance, and Technology Assessment , 2002 .

[17]  D. Mark,et al.  Patient and hospital characteristics related to in-hospital mortality after lung cancer resection. , 1992, Chest.

[18]  Craig A. Beam,et al.  Variability in the interpretation of screening mammograms by US radiologists. Findings from a national sample. , 1996, Archives of internal medicine.

[19]  H S Luft,et al.  Effects of Surgeon Volume and Hospital Volume on Quality of Care in Hospitals , 1987, Medical care.

[20]  Andrew D. A. Maidment,et al.  Comparison of receiver operating characteristic curves on the basis of optimal operating points. , 1996, Academic radiology.

[21]  C. Gillis,et al.  Survival outcome of care by specialist surgeons in breast cancer: a study of 3786 patients in the west of Scotland , 1996, BMJ.

[22]  D. McClish Analyzing a Portion of the ROC Curve , 1989, Medical decision making : an international journal of the Society for Medical Decision Making.

[23]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[24]  Craig A. Beam,et al.  Two-stage ROC curve regression model when sampling a population of diagnosticians , 2002, SPIE Medical Imaging.

[25]  E A Sickles,et al.  Standardized abnormal interpretation and cancer detection ratios to assess reading volume and reader performance in a breast screening program. , 2000, Radiology.

[26]  Craig A. Beam,et al.  Reader strategies: variability and error- methodology, findings, and health policy implications from a study of the U.S. population of mammographers , 2002, SPIE Medical Imaging.

[27]  C. Begg,et al.  Impact of hospital volume on operative mortality for major cancer surgery. , 1998, JAMA.

[28]  C. Metz,et al.  A receiver operating characteristic partial area index for highly sensitive diagnostic tests. , 1996, Radiology.

[29]  Colin B Begg,et al.  Variations in morbidity after radical prostatectomy. , 2002, The New England journal of medicine.

[30]  J A Hanley,et al.  The use of the 'binormal' model for parametric ROC analysis of quantitative diagnostic tests. , 1996, Statistics in medicine.

[31]  J. Elmore,et al.  Does diagnostic accuracy in mammography depend on radiologists' experience? , 1998, Journal of women's health.