Development of an online, publicly accessible naive Bayesian decision support tool for mammographic mass lesions based on the American College of Radiology (ACR) BI-RADS lexicon

AbstractPurposeTo develop and validate a decision support tool for mammographic mass lesions based on a standardized descriptor terminology (BI-RADS lexicon) to reduce variability of practice.Materials and methodsWe used separate training data (1,276 lesions, 138 malignant) and validation data (1,177 lesions, 175 malignant). We created naïve Bayes (NB) classifiers from the training data with tenfold cross-validation. Our “inclusive model” comprised BI-RADS categories, BI-RADS descriptors, and age as predictive variables; our “descriptor model” comprised BI-RADS descriptors and age. The resulting NB classifiers were applied to the validation data. We evaluated and compared classifier performance with ROC-analysis.ResultsIn the training data, the inclusive model yields an AUC of 0.959; the descriptor model yields an AUC of 0.910 (P < 0.001). The inclusive model is superior to the clinical performance (BI-RADS categories alone, P < 0.001); the descriptor model performs similarly. When applied to the validation data, the inclusive model yields an AUC of 0.935; the descriptor model yields an AUC of 0.876 (P < 0.001). Again, the inclusive model is superior to the clinical performance (P < 0.001); the descriptor model performs similarly.ConclusionWe consider our classifier a step towards a more uniform interpretation of combinations of BI-RADS descriptors. We provide our classifier at www.ebm-radiology.com/nbmm/index.html.Key Points• We provide a decision support tool for mammographic masses atwww.ebm-radiology.com/nbmm/index.html. • Our tool may reduce variability of practice in BI-RADS category assignment. • A formal analysis of BI-RADS descriptors may enhance radiologists’ diagnostic performance.

[1]  T. Forshaw Everything you always wanted to know , 1977 .

[2]  A. Feinstein,et al.  Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. , 1978, The New England journal of medicine.

[3]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[4]  Eugene Charniak,et al.  Bayesian Networks without Tears , 1991, AI Mag..

[5]  M. Slattery,et al.  A comprehensive evaluation of family history and breast cancer risk. The Utah Population Database. , 1993, JAMA.

[6]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[7]  C. Floyd,et al.  Breast cancer: prediction with artificial neural network based on BI-RADS standardized lexicon. , 1995, Radiology.

[8]  Pedro M. Domingos,et al.  Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[9]  C. Floyd,et al.  Breast imaging reporting and data system standardized mammography lexicon: observer variability in lesion description. , 1996, AJR. American journal of roentgenology.

[10]  M. Nadel,et al.  Coding mammograms using the classification "probably benign finding--short interval follow-up suggested". , 1999, AJR. American journal of roentgenology.

[11]  P. Langenberg,et al.  Breast Imaging Reporting and Data System: inter- and intraobserver variability in feature analysis and final assessment. , 2000, AJR. American journal of roentgenology.

[12]  Bianca Zadrozny,et al.  Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers , 2001, ICML.

[13]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[14]  L. Liberman,et al.  Breast imaging reporting and data system (BI-RADS). , 2002, Radiologic clinics of North America.

[15]  Rebecca S Lewis,et al.  Does training in the Breast Imaging Reporting and Data System (BI-RADS) improve biopsy recommendations or feature analysis agreement with experienced breast imagers at mammography? , 2002, Radiology.

[16]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[17]  M.K. Markey,et al.  Bayesian networks of BI-RADS/spl trade/ descriptors for breast lesion classification , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[18]  P. Bossuyt,et al.  Sources of Variation and Bias in Studies of Diagnostic Accuracy , 2004, Annals of Internal Medicine.

[19]  Joseph Y. Lo,et al.  Bayesian networks of BI-RADS™ descriptors for breast lesion Classification , 2004 .

[20]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[21]  E. Feuer,et al.  SEER Cancer Statistics Review, 1975-2003 , 2006 .

[22]  V. McCormack,et al.  Breast Density and Parenchymal Patterns as Markers of Breast Cancer Risk: A Meta-analysis , 2006, Cancer Epidemiology Biomarkers & Prevention.

[23]  M. Mainiero,et al.  BI-RADS lexicon for US and mammography: interobserver variability and positive predictive value. , 2006, Radiology.

[24]  M. Elter,et al.  The prediction of breast cancer biopsy outcomes using two CAD approaches that both emphasize an intelligible decision process. , 2007, Medical physics.

[25]  BIRADS mammography: exercises. , 2007, European journal of radiology.

[26]  Jean B. Cormack,et al.  Diagnostic accuracy of digital versus film mammography: exploratory analysis of selected population subgroups in DMIST. , 2008, Radiology.

[27]  F. Schmidt Meta-Analysis , 2008 .

[28]  C. D. Page,et al.  Probabilistic computer model developed from clinical data in national mammography database format to classify mammographic findings. , 2009, Radiology.

[29]  E. Burnside,et al.  The ACR BI-RADS experience: learning from history. , 2009, Journal of the American College of Radiology : JACR.

[30]  M. Elter,et al.  CADx of mammographic masses and clustered microcalcifications: a review. , 2009, Medical physics.

[31]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[32]  Martin Krapcho,et al.  SEER Cancer Statistics Review, 1975–2009 (Vintage 2009 Populations) , 2012 .

[33]  M J M Broeders,et al.  A dedicated BI-RADS training programme: effect on the inter-observer variation among screening radiologists. , 2012, European journal of radiology.

[34]  Mireille J. M. Broeders,et al.  Breast cancer risk prediction model: a nomogram based on common mammographic screening findings , 2013, European Radiology.

[35]  Miguel Ángel Guevara-López,et al.  An evaluation of image descriptors combined with clinical data for breast cancer diagnosis , 2013, International Journal of Computer Assisted Radiology and Surgery.

[36]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[37]  G. Collins,et al.  External validation of multivariable prediction models: a systematic review of methodological conduct and reporting , 2014, BMC Medical Research Methodology.

[38]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .