论文信息 - Learning Invariant Feature Representation to Improve Generalization across Chest X-ray Datasets

Learning Invariant Feature Representation to Improve Generalization across Chest X-ray Datasets

Chest radiography is the most common medical image examination for screening and diagnosis in hospitals. Automatic interpretation of chest X-rays at the level of an entry-level radiologist can greatly benefit work prioritization and assist in analyzing a larger population. Subsequently, several datasets and deep learning-based solutions have been proposed to identify diseases based on chest X-ray images. However, these methods are shown to be vulnerable to shift in the source of data: a deep learning model performing well when tested on the same dataset as training data, starts to perform poorly when it is tested on a dataset from a different source. In this work, we address this challenge of generalization to a new source by forcing the network to learn a source-invariant representation. By employing an adversarial training strategy, we show that a network can be forced to learn a source-invariant representation. Through pneumonia-classification experiments on multi-source chest X-ray datasets, we show that this algorithm helps in improving classification accuracy on a new source of X-ray dataset.

[1] Tanveer F. Syeda-Mahmood,et al. Bimodal Network Architectures for Automatic Generation of Image Annotation from Text , 2018, MICCAI.

[2] Peter Szolovits,et al. MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[3] Hao Chen,et al. Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-ray Segmentation , 2018, MLMI@MICCAI.

[4] Sepp Hochreiter,et al. Self-Normalizing Neural Networks , 2017, NIPS.

[5] Yifan Yu,et al. CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison , 2019, AAAI.

[6] Kathryn J Fowler,et al. Assessing Radiology Research on Artificial Intelligence: A Brief Guide for Authors, Reviewers, and Readers-From the Radiology Editorial Board. , 2019, Radiology.

[7] David Lopez-Paz,et al. Invariant Risk Minimization , 2019, ArXiv.

[8] Andrew Y. Ng,et al. CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[9] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.

[10] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Victor S. Lempitsky,et al. Unsupervised Domain Adaptation by Backpropagation , 2014, ICML.

[12] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL 2006.

[13] Anup Pillai,et al. Chest X-ray Report Generation through Fine-Grained Label Learning , 2020, MICCAI.

[14] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[15] David S. Melnick,et al. International evaluation of an AI system for breast cancer screening , 2020, Nature.

[16] A McAllesterDavid. Some PAC-Bayesian Theorems , 1999 .

[17] Neal Lewis,et al. SPOT the Drug! An Unsupervised Pattern Matching Method to Extract Drug Names from Very Large Clinical Corpora , 2012, 2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology.

[18] Silvio Savarese,et al. Learning Transferrable Representations for Unsupervised Domain Adaptation , 2016, NIPS.

[19] Roger G. Mark,et al. MIMIC-CXR: A large publicly available database of labeled chest radiographs , 2019, ArXiv.

[20] Ronald M. Summers,et al. ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[21] André Elisseeff,et al. Stability and Generalization , 2002, J. Mach. Learn. Res..

[22] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.

[23] Gábor Lugosi,et al. Introduction to Statistical Learning Theory , 2004, Advanced Lectures on Machine Learning.

[24] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.

[25] David A. McAllester. Some PAC-Bayesian theorems , 1998, COLT' 98.

[26] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[27] Yiming Yang,et al. MMD GAN: Towards Deeper Understanding of Moment Matching Network , 2017, NIPS.

[28] Maxim Raginsky,et al. Information-theoretic analysis of generalization capability of learning algorithms , 2017, NIPS.

[29] Tatsuya Harada,et al. Domain Generalization Using a Mixture of Multiple Latent Domains , 2019, AAAI.