A deep learning method for classifying mammographic breast density categories

PURPOSE Mammographic breast density is an established risk marker for breast cancer and is visually assessed by radiologists in routine mammogram image reading, using four qualitative Breast Imaging and Reporting Data System (BI-RADS) breast density categories. It is particularly difficult for radiologists to consistently distinguish the two most common and most variably assigned BI-RADS categories, i.e., "scattered density" and "heterogeneously dense". The aim of this work was to investigate a deep learning-based breast density classifier to consistently distinguish these two categories, aiming at providing a potential computerized tool to assist radiologists in assigning a BI-RADS category in current clinical workflow. METHODS In this study, we constructed a convolutional neural network (CNN)-based model coupled with a large (i.e., 22,000 images) digital mammogram imaging dataset to evaluate the classification performance between the two aforementioned breast density categories. All images were collected from a cohort of 1,427 women who underwent standard digital mammography screening from 2005 to 2016 at our institution. The truths of the density categories were based on standard clinical assessment made by board-certified breast imaging radiologists. Effects of direct training from scratch solely using digital mammogram images and transfer learning of a pretrained model on a large nonmedical imaging dataset were evaluated for the specific task of breast density classification. In order to measure the classification performance, the CNN classifier was also tested on a refined version of the mammogram image dataset by removing some potentially inaccurately labeled images. Receiver operating characteristic (ROC) curves and the area under the curve (AUC) were used to measure the accuracy of the classifier. RESULTS The AUC was 0.9421 when the CNN-model was trained from scratch on our own mammogram images, and the accuracy increased gradually along with an increased size of training samples. Using the pretrained model followed by a fine-tuning process with as few as 500 mammogram images led to an AUC of 0.9265. After removing the potentially inaccurately labeled images, AUC was increased to 0.9882 and 0.9857 for without and with the pretrained model, respectively, both significantly higher (P < 0.001) than when using the full imaging dataset. CONCLUSIONS Our study demonstrated high classification accuracies between two difficult to distinguish breast density categories that are routinely assessed by radiologists. We anticipate that our approach will help enhance current clinical assessment of breast density and better support consistent density notification to patients in breast cancer screening.

[1]  Eun Ju Son,et al.  Automated Volumetric Breast Density Measurements in the Era of the BI-RADS Fifth Edition: A Comparison With Visual Assessment. , 2016, AJR. American journal of roentgenology.

[2]  Andrew Janowczyk,et al.  Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases , 2016, Journal of pathology informatics.

[3]  Nico Karssemeijer,et al.  Unsupervised Deep Learning Applied to Breast Density Segmentation and Mammographic Risk Scoring , 2016, IEEE Transactions on Medical Imaging.

[4]  Manuela Durando,et al.  A first evaluation of breast radiological density assessment by QUANTRA software as compared to visual classification. , 2012, Breast.

[5]  P. Narula MAMMOGRAPHIC DENSITY AND THE RISK AND DETECTION OF BREAST CANCER , 2016 .

[6]  Ron Kimmel,et al.  Computational mammography using deep neural networks , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[7]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[8]  Hongmin Cai,et al.  Discrimination of Breast Cancer with Microcalcifications on Mammography by Deep Learning , 2016, Scientific Reports.

[9]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  V. McCormack,et al.  Breast Density and Parenchymal Patterns as Markers of Breast Cancer Risk: A Meta-analysis , 2006, Cancer Epidemiology Biomarkers & Prevention.

[11]  Gustavo Carneiro,et al.  Deep Learning and Structured Prediction for the Segmentation of Mass in Mammograms , 2015, MICCAI.

[12]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[13]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[14]  John L Hopper,et al.  The Heritability of Mammographically Dense and Nondense Breast Tissue , 2006, Cancer Epidemiology Biomarkers & Prevention.

[15]  J. Wolfe Breast patterns as an index of risk for developing breast cancer. , 1976, AJR. American journal of roentgenology.

[16]  L. Kolonel,et al.  A Longitudinal Investigation of Mammographic Density: The Multiethnic Cohort , 2006, Cancer Epidemiology Biomarkers & Prevention.

[17]  D. Montoya-Zapata,et al.  Detection and Diagnosis of Breast Tumors using Deep Convolutional Neural Networks , 2016 .

[18]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[19]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[20]  P. Langenberg,et al.  Breast Imaging Reporting and Data System: inter- and intraobserver variability in feature analysis and final assessment. , 2000, AJR. American journal of roentgenology.

[21]  N. Boyd,et al.  The quantitative analysis of mammographic densities. , 1994, Physics in medicine and biology.

[22]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[23]  N F Boyd,et al.  Mammographic densities and breast cancer risk. , 1998, Breast disease.

[24]  Gustavo Carneiro,et al.  Automated Mass Detection in Mammograms Using Cascaded Deep Learning and Random Forests , 2015, 2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[25]  L. Tabár,et al.  The Tabár classification of mammographic parenchymal patterns. , 1997, European journal of radiology.

[26]  J. Hopper,et al.  Mammographic density—a review on the current understanding of its association with breast cancer , 2014, Breast Cancer Research and Treatment.

[27]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  D. Shen,et al.  Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans , 2016, Scientific Reports.

[29]  Lei Zhang,et al.  Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Z. Jane Wang,et al.  A CNN Regression Approach for Real-Time 2D/3D Registration , 2016, IEEE Transactions on Medical Imaging.

[31]  Andrea J Cook,et al.  Breast cancer risk by breast density, menopause, and postmenopausal hormone therapy use. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[32]  Dayong Wang,et al.  Deep Learning for Identifying Metastatic Breast Cancer , 2016, ArXiv.

[33]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[34]  Yahong Luo,et al.  Understanding Clinical Mammographic Breast Density Assessment: a Deep Learning Perspective , 2018, Journal of Digital Imaging.

[35]  J. Cauley,et al.  Mammographic density in a multiethnic cohort , 2007, Menopause.

[36]  Bram van Ginneken,et al.  Off-the-shelf convolutional neural network features for pulmonary nodule detection in computed tomography scans , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[37]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[38]  B. Keller,et al.  Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation. , 2012, Medical physics.

[39]  Makoto Yoshizawa,et al.  Mass detection using deep convolutional neural network for mammographic computer-aided diagnosis , 2016, 2016 55th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE).