Prediction of Obstructive Lung Disease from Chest Radiographs via Deep Learning Trained on Pulmonary Function Data

Background Chronic obstructive pulmonary disease (COPD), the third leading cause of death worldwide, is often underdiagnosed. Purpose To develop machine learning methods to predict COPD using chest radiographs and a convolutional neural network (CNN) trained with near-concurrent pulmonary function test (PFT) data. Comparison is made to natural language processing (NLP) of the associated radiologist text reports. Materials and Methods This IRB-approved single-institution retrospective study uses 6749 two-view chest radiograph exams (2012–2017, 4436 unique subjects, 54% female, 46% male), same-day associated radiologist text reports, and PFT exams acquired within 180 days. The Image Model (Resnet18 pre-trained with ImageNet CNN) is trained using frontal and lateral radiographs and PFTs with 10% of the subjects for validation and 19% for testing. The NLP Model is trained using radiologist text reports and PFTs. The primary metric of model comparison is the area under the receiver operating characteristic curve (AUC). Results The Image Model achieves an AUC of 0.814 for prediction of obstructive lung disease (FEV1/FVC <0.7) from chest radiographs and performs better than the NLP Model (AUC 0.704, p<0.001) from radiologist text reports where FEV1 = forced expiratory volume in 1 second and FVC = forced vital capacity. The Image Model performs better for prediction of severe or very severe COPD (FEV1 <0.5) with an AUC of 0.837 versus the NLP model AUC of 0.770 (p<0.001). Conclusion A CNN Image Model trained on physiologic lung function data (PFTs) can be applied to chest radiographs for quantitative prediction of obstructive lung disease with good accuracy.

[1]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[2]  Andy Kitchen,et al.  Chest Radiographs in Congestive Heart Failure: Visualizing Neural Network Learning. , 2019, Radiology.

[3]  G. Kloecker,et al.  Lung cancer screening rates: Data from the lung cancer screening registry. , 2018 .

[4]  J. Hurst,et al.  The unmet global burden of COPD , 2018, Global health, epidemiology and genomics.

[5]  Wei Wei,et al.  Thoracic Disease Identification and Localization with Limited Supervision , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[7]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[8]  Li Yao,et al.  Learning to diagnose from scratch by exploiting dependencies among labels , 2017, ArXiv.

[9]  Ahmedin Jemal,et al.  Lung Cancer Screening With Low-Dose Computed Tomography in the United States—2010 to 2015 , 2017, JAMA oncology.

[10]  T. Henry,et al.  Pulmonary Function Tests for the Radiologist. , 2017, Radiographics : a review publication of the Radiological Society of North America, Inc.

[11]  P. Lakhani,et al.  Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks. , 2017, Radiology.

[12]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[13]  J. Zulueta,et al.  Lung cancer screening in patients with chronic obstructive pulmonary disease. , 2016, Annals of translational medicine.

[14]  F. Martinez,et al.  Screening, Case-Finding, and Outcomes for Adults With Unrecognized COPD. , 2016, JAMA.

[15]  Meilan K. Han,et al.  Undiagnosed Obstructive Lung Disease in the United States. Associated Factors and Long-term Mortality. , 2015, Annals of the American Thoracic Society.

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Christopher Joseph Pal,et al.  Brain tumor segmentation with Deep Neural Networks , 2015, Medical Image Anal..

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[20]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[21]  R. Elashoff,et al.  Comparison of the variability of the annual rates of change in FEV1 determined from serial measurements of the pre- versus post-bronchodilator FEV1 over 5 years in mild to moderate COPD: Results of the lung health study , 2012, Respiratory Research.

[22]  Geoffrey E. Hinton,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[23]  Claudio Pedone,et al.  Comorbidities of chronic obstructive pulmonary disease , 2011, Current opinion in pulmonary medicine.

[24]  D. Mannino,et al.  Pre- and post-bronchodilator lung function as predictors of mortality in the Lung Health Study , 2011, Respiratory research.

[25]  E. Regan,et al.  Genetic Epidemiology of COPD (COPDGene) Study Design , 2011, COPD.

[26]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[27]  B. Make,et al.  Comorbidities in chronic obstructive pulmonary disease. , 2008, Proceedings of the American Thoracic Society.

[28]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[29]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[30]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[31]  A. Jemal,et al.  Cancer statistics, 2018 , 2018, CA: a cancer journal for clinicians.

[32]  Gustavo Carneiro,et al.  Combining deep learning and level set for the automated segmentation of the left ventricle of the heart from cardiac cine magnetic resonance , 2017, Medical Image Anal..

[33]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[34]  L. Tanoue,et al.  Reduced lung cancer mortality with low-dose computed tomographic screening , 2011 .

[35]  F. Martinez,et al.  Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. , 2007, American journal of respiratory and critical care medicine.

[36]  C. Lenfant,et al.  Global Initiative for chronic obstructive lung disease. Global strategy for the diagnosis, management and prevention of chronic obstructive pulmonary disease , 2006 .

[37]  A. Svejgaard,et al.  [Genetic epidemiology]. , 2003, Ugeskrift for laeger.