Automated identification of thoracic pathology from chest radiographs with enhanced training pipeline

Chest x-rays are the most common radiology studies for diagnosing lung and heart disease. Hence, a system for automated pre-reporting of pathologic findings on chest x-rays would greatly enhance radiologists’ productivity. To this end, we investigate a deep-learning framework with novel training schemes for classification of different thoracic pathology labels from chest x-rays. We use the currently largest publicly available annotated dataset ChestX-ray14 of 112,120 chest radiographs of 30,805 patients. Each image was annotated with either a 'NoFinding' class, or one or more of 14 thoracic pathology labels. Subjects can have multiple pathologies, resulting in a multi-class, multi-label problem. We encoded labels as binary vectors using k-hot encoding. We study the ResNet34 architecture, pre-trained on ImageNet, where two key modifications were incorporated into the training framework: (1) Stochastic gradient descent with momentum and with restarts using cosine annealing, (2) Variable image sizes for fine-tuning to prevent overfitting. Additionally, we use a heuristic algorithm to select a good learning rate. Learning with restarts was used to avoid local minima. Area Under receiver operating characteristics Curve (AUC) was used to quantitatively evaluate diagnostic quality. Our results are comparable to, or outperform the best results of current state-of-the-art methods with AUCs as follows: Atelectasis:0.81, Cardiomegaly:0.91, Consolidation:0.81, Edema:0.92, Effusion:0.89, Emphysema: 0.92, Fibrosis:0.81, Hernia:0.84, Infiltration:0.73, Mass:0.85, Nodule:0.76, Pleural Thickening:0.81, Pneumonia:0.77, Pneumothorax:0.89 and NoFinding:0.79. Our results suggest that, in addition to using sophisticated network architectures, a good learning rate, scheduler and a robust optimizer can boost performance.

[1]  Thomas Villmann,et al.  Exploratory Observation Machine (XOM) with Kullback-Leibler Divergence for Dimensionality Reduction and Visualization , 2010, ESANN.

[2]  B HuberMarkus,et al.  Texture feature ranking with relevance learning to classify interstitial lung disease patterns , 2012 .

[3]  T. Nattkemper,et al.  Breast MRI data analysis by LLE , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[4]  T. Twellmann,et al.  Detection of suspicious lesions in dynamic contrast enhanced MRI data , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[5]  Phil Hoole,et al.  A Segmentation and Analysis Method for MRI Data of the Human Vocal Tract , 2003, Bildverarbeitung für die Medizin.

[6]  Axel Wismüller,et al.  Classification of small lesions on dynamic breast MRI: Integrating dimension reduction and out-of-sample extension into CADx methodology , 2014, Artif. Intell. Medicine.

[7]  Thomas Villmann,et al.  Neighbor embedding XOM for dimension reduction and visualization , 2011, Neurocomputing.

[8]  Axel Wismüller,et al.  Adaptive local dissimilarity measures for discriminative dimension reduction of labeled data , 2010, Neurocomputing.

[9]  Anke Meyer-Bäse,et al.  Segmentation and classification of dynamic breast magnetic resonance image data , 2006, J. Electronic Imaging.

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  M. Reiser,et al.  Cluster analysis of signal-intensity time course in dynamic breast MRI: does unsupervised vector quantization help to evaluate small mammographic lesions? , 2006, European Radiology.

[12]  Axel Wismüller,et al.  Computer-Aided Diagnosis for Phase-Contrast X-ray Computed Tomography: Quantitative Characterization of Human Patellar Cartilage with High-Dimensional Geometric Features , 2014, Journal of Digital Imaging.

[13]  Axel Wismüller,et al.  A Multivariate Granger Causality Concept towards Full Brain Functional Connectivity , 2016, PloS one.

[14]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[15]  Axel Wismüller,et al.  A Neural Network Approach to Functional MRI Pattern Analysis — Clustering of Time-Series by Hierarchical Vector Quantization , 1998 .

[16]  Axel Wismüller,et al.  A Framework for Exploring Non-Linear Functional Connectivity and Causality in the Human Brain: Mutual Connectivity Analysis (MCA) of Resting-State Functional MRI with Convergent Cross-Mapping and Non-Metric Clustering , 2014, ArXiv.

[17]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[18]  Anke Meyer-Bäse,et al.  Fully automated biomedical image segmentation by self-organized model adaptation , 2004, Neural Networks.

[19]  Axel Wismüller,et al.  Segmentation with neural networks , 2000 .

[20]  Axel Wismüller,et al.  Classification of small lesions in dynamic breast MRI: eliminating the need for precise lesion segmentation through spatio-temporal analysis of contrast enhancement , 2012, Machine Vision and Applications.

[21]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[22]  Axel Wismüller A Computational Framework for Nonlinear Dimensionality Reduction and Clustering , 2009, WSOM.

[23]  Axel Wismüller,et al.  Classification of interstitial lung disease patterns with topological texture features , 2010, Medical Imaging.

[24]  Axel Wismüller The exploration machine: a novel method for analyzing high-dimensional data in computer-aided diagnosis , 2009, Medical Imaging.

[25]  Thomas Martinetz,et al.  Medical image compression using topology-preserving neural networks , 2005, Eng. Appl. Artif. Intell..

[26]  Wei Wei,et al.  Thoracic Disease Identification and Localization with Limited Supervision , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]  M.Kleinberg Jon,et al.  Advances in Self-Organizing Maps, 7th International Workshop, WSOM 2009, St. Augustine, FL, USA, June 8-10, 2009. Proceedings , 2009, WSOM.

[28]  Anke Meyer-Baese,et al.  Model-free functional MRI analysis using cluster-based methods , 2003, SPIE Defense + Commercial Sensing.

[29]  M. Reiser,et al.  Classification of Small Contrast Enhancing Breast Lesions in Dynamic Magnetic Resonance Imaging Using a Combination of Morphological Criteria and Dynamic Analysis Based on Unsupervised Vector-Quantization , 2008, Investigative radiology.

[30]  Helge J. Ritter,et al.  The deformable feature map - a novel neurocomputing algorithm for adaptive plasticity in pattern analysis , 2002, Neurocomputing.

[31]  Axel Wismüller,et al.  Computer-Aided Diagnosis in Phase Contrast Imaging X-Ray Computed Tomography for Quantitative Characterization of ex vivo Human Patellar Cartilage , 2013, IEEE Transactions on Biomedical Engineering.

[32]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[33]  Igor Pantic,et al.  Gray Level Co-Occurrence Matrix Texture Analysis of Germinal Center Light Zone Lymphocyte Nuclei: Physiology Viewpoint with Focus on Apoptosis , 2012, Microscopy and Microanalysis.

[34]  Christian Kroos,et al.  Analysis of tongue configuration in multi-speaker, multi-volume MRI data , 2000 .

[35]  A. Meyer-Base,et al.  Stability analysis of a self-organizing neural network with feedforward and feedback dynamics , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[36]  Peter Hastreiter,et al.  Bildverarbeitung für die Medizin 2003 , 2003 .

[37]  Michel Verleysen,et al.  Recent Advances in Nonlinear Dimensionality Reduction, Manifold and Topological Learning , 2010, ESANN.

[38]  Axel Wismüller,et al.  Texture feature ranking with relevance learning to classify interstitial lung disease patterns , 2012, Artif. Intell. Medicine.

[39]  Axel Wismüller,et al.  Prediction of Biomechanical Properties of Trabecular Bone in MR Images With Geometric Features and Support Vector Regression , 2011, IEEE Transactions on Biomedical Engineering.

[40]  H. Ritter,et al.  The Deformable Feature Map — Adaptive Plasticity for Function Approximation , 1998 .

[41]  Axel Wismüller,et al.  The Exploration Machine - A Novel Method for Data Visualization , 2009, WSOM.