Pre-Training Autoencoder for Lung Nodule Malignancy Assessment Using CT Images

Lung cancer late diagnosis has a large impact on the mortality rate numbers, leading to a very low five-year survival rate of 5%. This issue emphasises the importance of developing systems to support a diagnostic at earlier stages. Clinicians use Computed Tomography (CT) scans to assess the nodules and the likelihood of malignancy. Automatic solutions can help to make a faster and more accurate diagnosis, which is crucial for the early detection of lung cancer. Convolutional neural networks (CNN) based approaches have shown to provide a reliable feature extraction ability to detect the malignancy risk associated with pulmonary nodules. This type of approach requires a massive amount of data to model training, which usually represents a limitation in the biomedical field due to medical data privacy and security issues. Transfer learning (TL) methods have been widely explored in medical imaging applications, offering a solution to overcome problems related to the lack of training data publicly available. For the clinical annotations experts with a deep understanding of the complex physiological phenomena represented in the data are required, which represents a huge investment. In this direction, this work explored a TL method based on unsupervised learning achieved when training a Convolutional Autoencoder (CAE) using images in the same domain. For this, lung nodules from the Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) were extracted and used to train a CAE. Then, the encoder part was transferred, and the malignancy risk was assessed in a binary classification—benign and malignant lung nodules, achieving an Area Under the Curve (AUC) value of 0.936. To evaluate the reliability of this TL approach, the same architecture was trained from scratch and achieved an AUC value of 0.928. The results reported in this comparison suggested that the feature learning achieved when reconstructing the input with an encoder-decoder based architecture can be considered an useful knowledge that might allow overcoming labelling constraints.

[1]  Tareq Abed Mohammed,et al.  Understanding of a convolutional neural network , 2017, 2017 International Conference on Engineering and Technology (ICET).

[2]  James C. Gee,et al.  Transfer Learning Approach to Predict Biopsy-Confirmed Malignancy of Lung Nodules from Imaging Data: A Pilot Study , 2018, RAMBO+BIA+TIA@MICCAI.

[3]  Nasser M. Nasrabadi,et al.  Multi-Level Feature Abstraction from Convolutional Neural Networks for Multimodal Biometric Identification , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[4]  Hongli Lin,et al.  Measuring Interobserver Disagreement in Rating Diagnostic Characteristics of Pulmonary Nodule Using the Lung Imaging Database Consortium and Image Database Resource Initiative. , 2017, Academic radiology.

[5]  Liu Lu,et al.  Benign and Malignant Solitary Pulmonary Nodules Classification Based on CNN and SVM , 2018, ICMVA.

[6]  Wei Shen,et al.  Multi-scale Convolutional Neural Networks for Lung Nodule Classification , 2015, IPMI.

[7]  Jan Kautz,et al.  Loss Functions for Image Restoration With Neural Networks , 2017, IEEE Transactions on Computational Imaging.

[8]  Alexander Wong,et al.  Lung Nodule Classification Using Deep Features in CT Images , 2015, 2015 12th Conference on Computer and Robot Vision.

[9]  M. Gaga,et al.  Lung nodules: A comprehensive review on current approach and management , 2019, Annals of thoracic medicine.

[10]  Qiang Zhang,et al.  Classification of Benign and Malignant Pulmonary Nodules Based on Deep Learning , 2018, 2018 5th International Conference on Information Science and Control Engineering (ICISCE).

[11]  Massimo Bellomi,et al.  Radiomics: the facts and the challenges of image analysis , 2018, European Radiology Experimental.

[12]  Sven Kabus,et al.  Agreement of CAD features with expert observer ratings for characterization of pulmonary nodules in CT using the LIDC-IDRI database , 2009, Medical Imaging.

[13]  Moulay A. Akhloufi,et al.  Deep Learning for Lung Cancer Nodules Detection and Classification in CT Scans , 2020 .

[14]  Suane Pires P. da Silva,et al.  Lung Nodule Classification via Deep Transfer Learning in CT Lung Images , 2018, 2018 IEEE 31st International Symposium on Computer-Based Medical Systems (CBMS).

[15]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[16]  Kenji Suzuki,et al.  A deep CNN based transfer learning method for false positive reduction , 2018, Multimedia Tools and Applications.

[17]  Moacir Antonelli Ponti,et al.  Unsupervised Representation Learning Using Convolutional and Stacked Auto-Encoders: A Domain and Cross-Domain Feature Space Analysis , 2018, 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI).

[18]  Yanning Zhang,et al.  Fusing texture, shape and deep model-learned information at decision level for automated classification of lung nodules on chest CT , 2018, Inf. Fusion.

[19]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[20]  Victor Hugo C. de Albuquerque,et al.  Lung nodule malignancy classification in chest computed tomography images using transfer learning and convolutional neural networks , 2018, Neural Computing and Applications.

[21]  Dennis Wollersheim,et al.  Pulmonary nodule classification with deep residual networks , 2017, International Journal of Computer Assisted Radiology and Surgery.

[22]  Qingzeng Song,et al.  Using Deep Learning for Classification of Lung Nodules on Computed Tomography Images , 2017, Journal of healthcare engineering.

[23]  Jaime S. Cardoso,et al.  Machine Learning Interpretability: A Survey on Methods and Metrics , 2019, Electronics.

[24]  Xin Geng,et al.  Classification of Lung Nodule Malignancy Risk on Computed Tomography Images Using Convolutional Neural Network: A Comparison Between 2D and 3D Strategies , 2016, ACCV Workshops.

[25]  Hongxun Yao,et al.  Dimensionality reduction strategy based on auto-encoder , 2015, ICIMCS '15.

[26]  Shiqian Ma,et al.  Highly accurate model for prediction of lung nodule malignancy with CT scans , 2018, Scientific Reports.

[27]  D. Shen,et al.  Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans , 2016, Scientific Reports.