Highly Robust Prediction of Lung Nodule Malignancy by Deep Learning Model: A Multiracial, Multinational Study

Purpose Although statistical models have been employed to detect and classify lung nodules using deep learning-extracted and clinical features, there is a lack of model validation in independent, multinational datasets from computed tomography (CT) scans and patient clinical information. To this end, we developed a deep learning-based algorithm to predict the malignancy of pulmonary nodules and validated its performance in three independent datasets containing multiracial and multinational populations. Methods In this study, a convolutional neural network-based algorithm to predict lung nodule malignancy was built based on CT scans and patient-wise clinical features (i.e. sex, spiculation, and nodule location). The model consists of three steps: (1) a deep learning algorithm to automatically extract features from CT scans, (2) clinical features were concatenated with the nodule features after dimension reduction by the principal component analysis (PCA), and (3) a multivariate logistic regression model was employed to classify the malignancy of the lung nodules. The model was trained by a dataset containing 1,556 nodules from 813 patients from the National Lung Screening Trial (NLST). The performance of the model was evaluated on three independent, multi-institutional datasets LIDC and Infervision Multi-Center (IMC) dataset, which contains 562 nodules from 293 patients, and 2044 nodules from 589 patients, respectively. The model accuracy was measured by the area under curve (AUC) of receiver operating characteristic (ROC) analysis. Results The study shows that the AUCs of ROCs on the NLST dataset, LIDC dataset, and IMC dataset are 0.91, 0.86, and 0.95, respectively. The inclusion of clinical features does not significantly improve the model performance. Quantitatively, the summed-up weight on the prediction accuracy of the 10 nodule features extracted by the deep learning algorithm equals to 0.091, while the weight of patient sex, nodule spiculation, and location is 0.031, 0.052, and 0.008, respectively. Conclusion The convolutional neural network-based model for lung nodule classification could be generalized to multiple datasets containing diverse populations. The addition of three patient clinical features to the nodule features extracted by deep learning does not boost the performance of the model.

[1]  B. van Ginneken,et al.  Observer variability for Lung-RADS categorisation of lung cancer screening CTs: impact on patient management , 2018, European Radiology.

[2]  Stephen R. Baker,et al.  The causes of medical malpractice suits against radiologists in the United States. , 2013, Radiology.

[3]  S. Lam,et al.  Probability of cancer in pulmonary nodules detected on first screening CT. , 2013, The New England journal of medicine.

[4]  N J Wald,et al.  UK Lung Cancer RCT Pilot Screening Trial: baseline findings from the screening arm provide evidence for the potential implementation of lung cancer screening , 2015, Thorax.

[5]  Paul F. Pinsky,et al.  Evaluation of reader variability in the interpretation of follow-up CT scans at lung cancer screening. , 2011, Radiology.

[6]  Shiqian Ma,et al.  Highly accurate model for prediction of lung nodule malignancy with CT scans , 2018, Scientific Reports.

[7]  L. Tanoue,et al.  Reduced lung cancer mortality with low-dose computed tomographic screening , 2011 .

[8]  G. Corrado,et al.  End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography , 2019, Nature Medicine.

[9]  David E. Newman-Toker,et al.  Serious misdiagnosis-related harms in malpractice claims: The “Big Three” – vascular events, infections, and cancers , 2019, Diagnosis.

[10]  Arash Naeim,et al.  Cost-effectiveness of CT screening in the National Lung Screening Trial. , 2014, The New England journal of medicine.

[11]  Colleen Bouzan,et al.  Cost-Effectiveness of Computed Tomography Screening for Lung Cancer in the United States , 2011, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[12]  C. Gatsonis,et al.  Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening , 2012 .