Highly accurate model for prediction of lung nodule malignancy with CT scans

Computed tomography (CT) examinations are commonly used to predict lung nodule malignancy in patients, which are shown to improve noninvasive early diagnosis of lung cancer. It remains challenging for computational approaches to achieve performance comparable to experienced radiologists. Here we present NoduleX, a systematic approach to predict lung nodule malignancy from CT data, based on deep learning convolutional neural networks (CNN). For training and validation, we analyze >1000 lung nodules in images from the LIDC/IDRI cohort. All nodules were identified and classified by four experienced thoracic radiologists who participated in the LIDC project. NoduleX achieves high accuracy for nodule malignancy classification, with an AUC of ~0.99. This is commensurate with the analysis of the dataset by experienced radiologists. Our approach, NoduleX, provides an effective framework for highly accurate nodule malignancy prediction with the model trained on a large patient population. Our results are replicable with software available at http://bioinformatics.astate.edu/NoduleX.

[1]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[2]  Anne Graham,et al.  The US experience , 2001 .

[3]  R. F. Wagner,et al.  Assessment methodologies and statistical issues for computer-aided diagnosis of lung nodules in computed tomography: contemporary research topics relevant to the lung image database consortium. , 2004, Academic radiology.

[4]  E. V. van Beek,et al.  The Lung Image Database Consortium (LIDC): a comparison of different size metrics for pulmonary nodule measurements. , 2007, Academic radiology.

[5]  Heber MacMahon,et al.  The Lung Image Database Consortium (LIDC): ensuring the integrity of expert-defined "truth". , 2007, Academic radiology.

[6]  Zaid J. Towfic,et al.  The Lung Image Database Consortium (LIDC) data collection process for nodule detection and annotation , 2007, SPIE Medical Imaging.

[7]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[8]  B. Zheng,et al.  Computerized comprehensive data analysis of lung imaging database consortium (LIDC). , 2010, Medical physics.

[9]  Jame Abraham,et al.  Reduced lung cancer mortality with low-dose computed tomographic screening , 2011 .

[10]  C. Gatsonis,et al.  Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening , 2012 .

[11]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[12]  Amanda Amy Harris Houk,et al.  Google for Research , 2012 .

[13]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[14]  A. Jemal,et al.  Cancer statistics, 2014 , 2014, CA: a cancer journal for clinicians.

[15]  P. Lambin,et al.  Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach , 2014, Nature Communications.

[16]  Masoom A. Haider,et al.  Discovery Radiomics for Computed Tomography Cancer Detection , 2015, ArXiv.

[17]  Johanna Uthoff,et al.  Improved pulmonary nodule classification utilizing quantitative lung parenchyma features , 2015, Journal of medical imaging.

[18]  Alexander Wong,et al.  Lung Nodule Classification Using Deep Features in CT Images , 2015, 2015 12th Conference on Computer and Robot Vision.

[19]  Anthony P. Reeves,et al.  Automated pulmonary nodule CT image characterization in lung cancer screening , 2015, International Journal of Computer Assisted Radiology and Surgery.

[20]  Wei Shen,et al.  Multi-scale Convolutional Neural Networks for Lung Nodule Classification , 2015, IPMI.

[21]  Jerry F. Magnan,et al.  Lung nodule malignancy classification using only radiologist-quantified image features as inputs to statistical learning algorithms: probing the Lung Image Database Consortium dataset with two statistical learning methods , 2016, Journal of medical imaging.

[22]  Ronald M. Summers,et al.  Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique , 2016 .

[23]  Kirk E. Smith,et al.  Quantitative Computed Tomography Classification of Lung Nodules: Initial Comparison of 2- and 3-Dimensional Analysis , 2016, Journal of computer assisted tomography.

[24]  Angélique Stéphanou,et al.  Towards the Design of a Patient-Specific Virtual Tumour , 2016, Comput. Math. Methods Medicine.

[25]  Giovanni Montana,et al.  Recurrent Convolutional Networks for Pulmonary Nodule Detection in CT Imaging , 2016, ArXiv.

[26]  Ricardo A. M. Valentim,et al.  Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy , 2016, BioMedical Engineering OnLine.

[27]  G. L. F. D. Silva,et al.  Taxonomic indexes for differentiating malignancy of lung nodules on CT images , 2016 .

[28]  Jörg Denzinger,et al.  Lung nodule detection in CT images using deep convolutional neural networks , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[29]  Alhayat Ali Mekonnen,et al.  Benign-Malignant Lung Nodule Classification with Geometric and Appearance Histogram Features , 2016, ArXiv.

[30]  P. Massion,et al.  The Pursuit of Noninvasive Diagnosis of Lung Cancer , 2016, Seminars in Respiratory and Critical Care Medicine.

[31]  D. Shen,et al.  Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans , 2016, Scientific Reports.

[32]  Wei Li,et al.  Pulmonary Nodule Classification with Deep Convolutional Neural Networks on Computed Tomography Images , 2016, Comput. Math. Methods Medicine.

[33]  Erich P Huang,et al.  RECIST 1.1-Update and clarification: From the RECIST committee. , 2016, European journal of cancer.

[34]  Jun Wang,et al.  Prediction of malignant and benign of lung tumor using a quantitative radiomic method , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[35]  Bai Ying Lei,et al.  Bridging Computational Features Toward Multiple Semantic Features with Multi-task Regression: A Study of CT Pulmonary Nodules , 2016, MICCAI.

[36]  Samuel H. Hawkins,et al.  Deep Feature Transfer Learning in Combination with Traditional Features Predicts Survival Among Patients with Lung Adenocarcinoma , 2016, Tomography.

[37]  R. Gillies,et al.  Radiological Image Traits Predictive of Cancer Status in Pulmonary Nodules , 2016, Clinical Cancer Research.

[38]  Samuel H. Hawkins,et al.  Predicting Malignant Nodules from Screening CT Scans , 2016, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[39]  Mei Xie,et al.  Automatic Categorization and Scoring of Solid, Part-Solid and Non-Solid Pulmonary Nodules in CT Images with Convolutional Neural Network , 2017, Scientific Reports.

[40]  D. Aberle Implementing lung cancer screening: the US experience. , 2017, Clinical radiology.

[41]  Hao Chen,et al.  Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge , 2016, Medical Image Anal..

[42]  Wei Shen,et al.  Multi-crop Convolutional Neural Networks for lung nodule malignancy suspiciousness classification , 2017, Pattern Recognit..

[43]  Bai Ying Lei,et al.  Automatic Scoring of Multiple Semantic Attributes With Multi-Task Feature Leverage: A Study on Pulmonary Nodules in CT Images , 2017, IEEE Transactions on Medical Imaging.