Representation learning for mammography mass lesion classification with convolutional neural networks

BACKGROUND AND OBJECTIVE The automatic classification of breast imaging lesions is currently an unsolved problem. This paper describes an innovative representation learning framework for breast cancer diagnosis in mammography that integrates deep learning techniques to automatically learn discriminative features avoiding the design of specific hand-crafted image-based feature detectors. METHODS A new biopsy proven benchmarking dataset was built from 344 breast cancer patients' cases containing a total of 736 film mammography (mediolateral oblique and craniocaudal) views, representative of manually segmented lesions associated with masses: 426 benign lesions and 310 malignant lesions. The developed method comprises two main stages: (i) preprocessing to enhance image details and (ii) supervised training for learning both the features and the breast imaging lesions classifier. In contrast to previous works, we adopt a hybrid approach where convolutional neural networks are used to learn the representation in a supervised way instead of designing particular descriptors to explain the content of mammography images. RESULTS Experimental results using the developed benchmarking breast cancer dataset demonstrated that our method exhibits significant improved performance when compared to state-of-the-art image descriptors, such as histogram of oriented gradients (HOG) and histogram of the gradient divergence (HGD), increasing the performance from 0.787 to 0.822 in terms of the area under the ROC curve (AUC). Interestingly, this model also outperforms a set of hand-crafted features that take advantage of additional information from segmentation by the radiologist. Finally, the combination of both representations, learned and hand-crafted, resulted in the best descriptor for mass lesion classification, obtaining 0.826 in the AUC score. CONCLUSIONS A novel deep learning based framework to automatically address classification of breast mass lesions in mammography was developed.

[1]  Dinggang Shen,et al.  Deep Learning-Based Feature Representation for AD/MCI Classification , 2013, MICCAI.

[2]  Miguel Ángel Guevara-López,et al.  Discovering Mammography-based Machine Learning Classifiers for Breast Cancer Diagnosis , 2012, Journal of Medical Systems.

[3]  Nicolas Pinto,et al.  Why is Real-World Visual Object Recognition Hard? , 2008, PLoS Comput. Biol..

[4]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Christian Igel,et al.  Deep Feature Learning for Knee Cartilage Segmentation Using a Triplanar Convolutional Neural Network , 2013, MICCAI.

[6]  A. Ramli,et al.  Computer-aided detection/diagnosis of breast cancer in mammography and ultrasound: a review. , 2013, Clinical imaging.

[7]  Seong-Whan Lee,et al.  Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis , 2014, NeuroImage.

[8]  Fabio A. González,et al.  Supervised Greedy Layer-Wise Training for Deep Convolutional Networks with Small Datasets , 2015, ICCCI.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Miguel Ángel Guevara-López,et al.  Improving the Mann-Whitney statistical test for feature selection: An approach in breast cancer diagnosis on mammography , 2015, Artif. Intell. Medicine.

[11]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[12]  Maryellen L. Giger,et al.  Breast image feature learning with adaptive deconvolutional networks , 2012, Medical Imaging.

[13]  Seong-Whan Lee,et al.  Latent feature representation with stacked auto-encoder for AD/MCI diagnosis , 2013, Brain Structure and Function.

[14]  Berkman Sahiner,et al.  Computer aided detection of clusters of microcalcifications on full field digital mammograms. , 2006, Medical physics.

[15]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.

[16]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[17]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[18]  Fabio A. González,et al.  A Deep Learning Architecture for Image Representation, Visual Interpretability and Automated Basal-Cell Carcinoma Cancer Detection , 2013, MICCAI.

[19]  E. Burnside,et al.  Computer-aided diagnostic models in breast cancer screening. , 2010, Imaging in medicine.

[20]  Yoshua Bengio,et al.  Maxout Networks , 2013, ICML.

[21]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[22]  Dinggang Shen,et al.  Robust Deep Learning for Improved Classification of AD/MCI Patients , 2014, MLMI.

[23]  Xin-Sheng Zhang,et al.  A New Approach for Clustered MCs Classification with Sparse Features Learning and TWSVM , 2014, TheScientificWorldJournal.

[24]  Miguel Ángel Guevara-López,et al.  An evaluation of image descriptors combined with clinical data for breast cancer diagnosis , 2013, International Journal of Computer Assisted Radiology and Surgery.

[25]  L. Tabár,et al.  Swedish two-county trial: impact of mammographic screening on breast cancer mortality during 3 decades. , 2011, Radiology.

[26]  Yide Ma,et al.  An Efficient Approach for Automated Mass Segmentation and Classification in Mammograms , 2015, Journal of Digital Imaging.

[27]  Miguel Ángel Guevara-López,et al.  A Software Framework for Building Biomedical Machine Learning Classifiers through Grid Computing Resources , 2012, Journal of Medical Systems.

[28]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  Eero P. Simoncelli,et al.  Nonlinear image representation using divisive normalization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Yann LeCun,et al.  Learning Invariant Feature Hierarchies , 2012, ECCV Workshops.

[31]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[32]  Nico Karssemeijer,et al.  Normalization of local contrast in mammograms , 2000, IEEE Transactions on Medical Imaging.

[33]  Xiaoming Liu,et al.  Mass Classification in Mammograms Using Selected Geometry and Texture Features, and a New SVM-Based Feature Selection Method , 2014, IEEE Systems Journal.

[34]  Yi Li,et al.  Is Rotation a Nuisance in Shape Recognition? , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Razvan Pascanu,et al.  Pylearn2: a machine learning research library , 2013, ArXiv.

[36]  Fei-Fei Li,et al.  Detecting Avocados to Zucchinis: What Have We Done, and Where Are We Going? , 2013, 2013 IEEE International Conference on Computer Vision.

[37]  Nico Karssemeijer,et al.  Breast Tissue Segmentation and Mammographic Risk Scoring Using Deep Learning , 2014, Digital Mammography / IWDM.

[38]  Angel Cruz-Roa,et al.  Hybrid image representation learning model with invariant features for basal cell carcinoma detection , 2013, Other Conferences.