Lung nodule malignancy classification in chest computed tomography images using transfer learning and convolutional neural networks

Lung cancer accounts for more than 1.5 million deaths worldwide, and it corresponded to 26% of all deaths due to cancer in 2017. However, lung computer-aided diagnosis systems developed to identify lung cancer at early stages are increasing survival rates. This study explores the performance of deep transfer learning from non-medical images on lung nodule malignancy classification tasks in order to improve such systems. Initially, the 1018 chest computed tomography (CT) examinations and medical annotations from the LIDC/IDRI were processed. Then, several convolutional neural networks (VGG16, VGG19, MobileNet, Xception, InceptionV3, ResNet50, InceptionResNetV2, DenseNet169, DenseNet201, NASNetMobile and NASNetLarge) were built, trained on the ImageNet dataset, converted into feature extractors and applied on the LIDC/IDRI nodule images. Following this, each set of deep features was submitted to 10-fold cross-validations with naive Bayes, multilayer perceptron, support vector machine (SVM), K-nearest neighbors KNN and random forest classifiers. Finally, the evaluation metrics accuracy (ACC), area under the curve (AUC), true positive rate (TPR), precision (PPV) and F1-score of each cross-validation average result were computed and compared. The results showed that the deep feature extractor based on the ResNet50 and the SVM RBF classifier, achieved an AUC metric of 93.1% (the highest value not only among the evaluated combinations, but also among the related works in the literature evaluated), a TPR of 85.38%, an ACC of 88.41%, a PPV of 73.48% and an F1-score of 78.83%. Based on these results, deep transfer learning proves to be a relevant strategy to extract representative features from lung nodule CT images.

[1]  He Ma,et al.  Lung nodule classification using local kernel regression models with out-of-sample extension , 2018, Biomed. Signal Process. Control..

[2]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[3]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[6]  Hui Li,et al.  Digital mammographic tumor classification using transfer learning from deep convolutional neural networks , 2016, Journal of medical imaging.

[7]  Wei Shen,et al.  Multi-scale Convolutional Neural Networks for Lung Nodule Classification , 2015, IPMI.

[8]  P. Cortez,et al.  Modelo de Contorno Ativo Crisp: nova técnica de segmentação dos pulmões em imagens de TC , 2011 .

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Natalia F. Gusarova,et al.  Classification of pulmonary nodules on computed tomography scans. Evaluation of the effectiveness of application of textural features extracted using wavelet transform of image , 2016, 2016 18th Conference of Open Innovations Association and Seminar on Information Security and Protection of Information Technology (FRUCT-ISPIT).

[12]  Ulas Bagci,et al.  Risk Stratification of Lung Nodules Using 3D CNN-Based Multi-task Learning , 2017, IPMI.

[13]  P. Lambin,et al.  Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach , 2014, Nature Communications.

[14]  Guixia Kang,et al.  3D multi-view convolutional neural networks for lung nodule classification , 2017, PloS one.

[15]  João Manuel R. S. Tavares,et al.  Novel and powerful 3D adaptive crisp active contour method applied in the segmentation of CT lung images , 2017, Medical Image Anal..

[16]  Edward Y. Chang,et al.  Transfer representation learning for medical image analysis , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[17]  Joel J. P. C. Rodrigues,et al.  Detection of subtype blood cells using deep learning , 2018, Cognitive Systems Research.

[18]  Yan Liu,et al.  Deep residual learning for image steganalysis , 2018, Multimedia Tools and Applications.

[19]  Hiroyuki Yoshida,et al.  Deep transfer learning of virtual endoluminal views for the detection of polyps in CT colonography , 2016, SPIE Medical Imaging.

[20]  J. Rathmell,et al.  Did death certificates and a death review process agree on lung cancer cause of death in the National Lung Screening Trial? , 2016, Clinical trials.

[21]  Qian Wang,et al.  Automatic lung nodule classification with radiomics approach , 2016, SPIE Medical Imaging.

[22]  Xiaohui Xie,et al.  DeepLung: 3D Deep Convolutional Nets for Automated Pulmonary Nodule Detection and Classification , 2017, bioRxiv.

[23]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[24]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[25]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[27]  Quoc V. Le,et al.  Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[28]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[29]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[30]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[31]  Wei Qian,et al.  Content-based image retrieval for Lung Nodule Classification Using Texture Features and Learned Distance Metric , 2017, Journal of Medical Systems.

[32]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[33]  A. Jemal,et al.  Cancer statistics, 2017 , 2017, CA: a cancer journal for clinicians.

[34]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Marios Anthimopoulos,et al.  Multi-source Transfer Learning with Convolutional Neural Networks for Lung Pattern Analysis , 2016, IEEE journal of biomedical and health informatics.

[36]  Niranjan Khandelwal,et al.  A Combination of Shape and Texture Features for Classification of Pulmonary Nodules in Lung CT Images , 2016, Journal of Digital Imaging.

[37]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Victor Hugo C. De Albuquerque,et al.  Health of Things Algorithms for Malignancy Level Classification of Lung Nodules , 2018, IEEE Access.

[39]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Robertas Damaševičius,et al.  Brain Computer Interface Systems for Neurorobotics: Methods and Applications , 2017, BioMed research international.

[41]  Andino Maseleno,et al.  Optimal feature-based multi-kernel SVM approach for thyroid disease classification , 2018, The Journal of Supercomputing.

[42]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[44]  Veronica Swallow,et al.  Insights into Living with Kidney Disease , 2017, BioMed research international.

[45]  Hyo-Eun Kim,et al.  A novel approach for tuberculosis screening based on deep convolutional neural networks , 2016, SPIE Medical Imaging.

[46]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[47]  Wei Shen,et al.  Multi-crop Convolutional Neural Networks for lung nodule malignancy suspiciousness classification , 2017, Pattern Recognit..

[48]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  N. Arunkumar,et al.  Optimized cuttlefish algorithm for diagnosis of Parkinson’s disease , 2018, Cognitive Systems Research.

[50]  Pedro Pedrosa Rebouças Filho,et al.  New approach to detect and classify stroke in skull CT images via analysis of brain tissue densities , 2017, Comput. Methods Programs Biomed..

[51]  Hong Zhao,et al.  Texture Feature Analysis for Computer-Aided Diagnosis on Pulmonary Nodules , 2015, Journal of Digital Imaging.

[52]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[53]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[54]  Amy Loutfi,et al.  A review of unsupervised feature learning and deep learning for time-series modeling , 2014, Pattern Recognit. Lett..