Synthetic COVID-19 Chest X-ray Dataset for Computer-Aided Diagnosis

We introduce a new dataset called Synthetic COVID-19 Chest X-ray Dataset 1 for training machine learning models. The dataset consists of 21,295 synthetic COVID-19 chest X-ray images to be used for computer-aided diagnosis. These images, generated via an unsupervised domain adaptation approach, are of high quality. We find that the synthetic images not only improve performance of various deep learning architectures when used as additional training data under heavy imbalance conditions (skew > 90), but also detect the target class with high confidence. We also find that comparable performance can also be achieved when trained only on synthetic images. Further, salient features of the synthetic COVID-19 images indicate that the distribution is significantly different from Non-COVID-19 classes, enabling a proper decision boundary. We hope the availability of such high fidelity chest X-ray images of COVID-19 will encourage advances in the development of diagnostic and/or management tools.

[1]  Ali Narin,et al.  Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks , 2020, Pattern Analysis and Applications.

[2]  Antonio Pertusa,et al.  PadChest: A large chest x-ray image dataset with multi-label annotated reports , 2019, Medical Image Anal..

[3]  A. Ben Hamza,et al.  Synthesis of COVID-19 chest X-rays using unpaired image-to-image translation , 2020, Social Network Analysis and Mining.

[4]  A. Ben Hamza,et al.  Melanoma detection using adversarial training and deep transfer learning , 2020, Physics in medicine and biology.

[5]  Till Döhmen,et al.  DeepCOVIDExplainer: Explainable COVID-19 Predictions Based on Chest X-ray Images , 2020, ArXiv.

[6]  Roger G. Mark,et al.  MIMIC-CXR: A large publicly available database of labeled chest radiographs , 2019, ArXiv.

[7]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[8]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[9]  Kevin A. Schneider,et al.  Automatic Detection of Coronavirus Disease (COVID-19) in X-ray and CT Images: A Machine Learning Based Approach , 2020, Biocybernetics and Biomedical Engineering.

[10]  Q. Tao,et al.  Correlation of Chest CT and RT-PCR Testing in Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases , 2020, Radiology.

[11]  K. Yuen,et al.  Imaging Profile of the COVID-19 Infection: Radiologic Findings and Literature Review , 2020, Radiology. Cardiothoracic imaging.

[12]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[13]  Abdul Hafeez,et al.  COVID-ResNet: A Deep Learning Framework for Screening of COVID19 from Radiographs , 2020, ArXiv.

[14]  D. Zhu,et al.  COVID-MobileXpert: On-Device COVID-19 Screening using Snapshots of Chest X-Ray , 2020, ArXiv.

[15]  Joseph Paul Cohen,et al.  COVID-19 Image Data Collection: Prospective Predictions Are the Future , 2020, ArXiv.

[16]  Alexander Wong,et al.  COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest Radiography Images , 2020, ArXiv.