Digital mammographic tumor classification using transfer learning from deep convolutional neural networks

Abstract. Convolutional neural networks (CNNs) show potential for computer-aided diagnosis (CADx) by learning features directly from the image data instead of using analytically extracted features. However, CNNs are difficult to train from scratch for medical images due to small sample sizes and variations in tumor presentations. Instead, transfer learning can be used to extract tumor information from medical images via CNNs originally pretrained for nonmedical tasks, alleviating the need for large datasets. Our database includes 219 breast lesions (607 full-field digital mammographic images). We compared support vector machine classifiers based on the CNN-extracted image features and our prior computer-extracted tumor features in the task of distinguishing between benign and malignant breast lesions. Five-fold cross validation (by lesion) was conducted with the area under the receiver operating characteristic (ROC) curve as the performance metric. Results show that classifiers based on CNN-extracted features (with transfer learning) perform comparably to those using analytically extracted features [area under the ROC curve (AUC)=0.81]. Further, the performance of ensemble classifiers based on both types was significantly better than that of either classifier type alone (AUC=0.86 versus 0.81, p=0.022). We conclude that transfer learning can improve current CADx methods while also providing standalone classifiers without large datasets, facilitating machine-learning methods in radiomics and precision medicine.

[1]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  Hayit Greenspan,et al.  A comparative study for chest radiograph image retrieval using binary texture and deep learning classification , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[3]  K Doi,et al.  Computerized detection of clustered microcalcifications in digital mammograms using a shift-invariant artificial neural network. , 1994, Medical physics.

[4]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[5]  M. Giger,et al.  Anniversary paper: History and status of CAD and quantitative image analysis: the role of Medical Physics and AAPM. , 2008, Medical physics.

[6]  Huai Li,et al.  Artificial convolution neural network for medical image pattern recognition , 1995, Neural Networks.

[7]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[8]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Kurt Hornik,et al.  The Design and Analysis of Benchmark Experiments , 2005 .

[10]  Hayit Greenspan,et al.  Chest pathology detection using deep learning with non-medical training , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[11]  M. Giger,et al.  Automated computerized classification of malignant and benign masses on digitized mammograms. , 1998, Academic radiology.

[12]  Li Lan,et al.  Evaluation of computer-aided diagnosis on a large clinical full-field digital mammographic dataset. , 2008, Academic radiology.

[13]  M. Giger,et al.  Breast image analysis for risk assessment, detection, diagnosis, and treatment of cancer. , 2013, Annual review of biomedical engineering.

[14]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[15]  H. Chan,et al.  Advances in computer-aided diagnosis for breast cancer , 2006, Current opinion in obstetrics & gynecology.

[16]  M. Giger,et al.  Volumetric texture analysis of breast lesions on contrast‐enhanced magnetic resonance images , 2007, Magnetic resonance in medicine.

[17]  Angel Cruz-Roa,et al.  Mitosis detection in breast cancer pathology images by combining handcrafted and convolutional neural network features , 2014, Journal of medical imaging.

[18]  Maryellen L. Giger,et al.  Breast image feature learning with adaptive deconvolutional networks , 2012, Medical Imaging.