Fast few-shot transfer learning for disease identification from chest x-ray images using autoencoder ensemble

We propose a fast few-shot learning framework that uses transfer learning to identify different lung and chest diseases and conditions from chest x-rays. Our model can be trained with as few as five training examples, making it potentially applicable for diagnosis of rare diseases. In this work, we divide different chest diseases into two disjoint categories: (i) base classes (with large training set) and (ii) novel classes (with a few training examples per class). Our method consists of two steps, namely feature extraction and classification. For the feature extraction, we employ a deep convolutional neural network, customized for chest x-rays. We train the feature extractor with data only from base classes. So the novel classes are unseen to the feature extractor during training. However, we use the feature extractor for extracting features from the data of novel classes resulting in transfer learning. Our classifier, on the other hand, uses the data only from the novel classes for training. We introduce the idea of autoencoder ensemble to design the classifier. Only a few feature vectors from each of the novel classes are used for training the classifier making it a few-shot learner. Incorporating new novel classes require training only the classifier which makes the entire process extremely fast. The performance of the classifier is evaluated on the test data from the novel classes. Experiments show _ 18% improvement in the F1 score compared to the baseline on identifying the novel diseases from publicly available chest x-ray dataset.

[1]  Nassir Navab,et al.  'Squeeze & Excite' Guided Few-Shot Segmentation of Volumetric Images , 2019, Medical Image Anal..

[2]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[3]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Nico Karssemeijer,et al.  Large scale deep learning for computer aided detection of mammographic lesions , 2017, Medical Image Anal..

[5]  Jose Dolz,et al.  Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning , 2018, ArXiv.

[6]  Hao Chen,et al.  DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Nir Ailon,et al.  Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[8]  Santi Puch,et al.  Few-shot Learning with Deep Triplet Networks for Brain Imaging Modality Recognition , 2019, DART/MIL3ID@MICCAI.

[9]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[10]  Wenqing Sun,et al.  Computer aided lung cancer diagnosis with deep learning algorithms , 2016, SPIE Medical Imaging.

[11]  Quanming Yao,et al.  Few-shot Learning: A Survey , 2019, ArXiv.

[12]  Yu Tsao,et al.  Speech enhancement based on deep denoising autoencoder , 2013, INTERSPEECH.

[13]  Ayman M. Eldeib,et al.  Breast cancer classification using deep belief networks , 2016, Expert Syst. Appl..

[14]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[15]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  E. Finkelstein,et al.  Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes , 2017, JAMA.

[19]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[20]  Peijun Du,et al.  Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging , 2016, Neurocomputing.