Open resource of clinical data from patients with pneumonia for the prediction of COVID-19 outcomes via deep learning

Data from patients with coronavirus disease 2019 (COVID-19) are essential for guiding clinical decision making, for furthering the understanding of this viral disease, and for diagnostic modelling. Here, we describe an open resource containing data from 1,521 patients with pneumonia (including COVID-19 pneumonia) consisting of chest computed tomography (CT) images, 130 clinical features (from a range of biochemical and cellular analyses of blood and urine samples) and laboratory-confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) clinical status. We show the utility of the database for prediction of COVID-19 morbidity and mortality outcomes using a deep learning algorithm trained with data from 1,170 patients and 19,685 manually labelled CT slices. In an independent validation cohort of 351 patients, the algorithm discriminated between negative, mild and severe cases with areas under the receiver operating characteristic curve of 0.944, 0.860 and 0.884, respectively. The open database may have further uses in the diagnosis and management of patients with COVID-19.

[1]  Rudy M. Baum,et al.  Perspective on China , 2005 .

[2]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[3]  P. Marik,et al.  A Descriptive Study , 2015 .

[4]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[5]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Chao Lu,et al.  Retrospective study , 2016, Medicine.

[7]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[8]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Stephen Lynch,et al.  Image Processing with Python , 2018 .

[10]  M. Delgado-Rodríguez,et al.  Systematic review and meta-analysis. , 2017, Medicina intensiva.

[11]  Dennis Andersson,et al.  A retrospective cohort study , 2018 .

[12]  Wenyu Liu,et al.  A Weakly-Supervised Framework for COVID-19 Classification and Lesion Localization From Chest CT , 2020, IEEE Transactions on Medical Imaging.

[13]  Umapada Pal,et al.  Truncated inception net: COVID-19 outbreak screening using chest X-rays , 2020, Physical and Engineering Sciences in Medicine.

[14]  Wei-Fu Lv,et al.  CT manifestations of coronavirus disease-2019: A retrospective analysis of 73 cases by disease severity , 2020, European Journal of Radiology.

[15]  G. Remuzzi,et al.  COVID-19 and Italy: what next? , 2020, The Lancet.

[16]  S. Lo,et al.  A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster , 2020, The Lancet.

[17]  Dasheng Li,et al.  False-Negative Results of Real-Time Reverse-Transcriptase Polymerase Chain Reaction for Severe Acute Respiratory Syndrome Coronavirus 2: Role of Deep-Learning-Based CT Diagnosis and Insights from Two Cases , 2020, Korean journal of radiology.

[18]  Bo Xu,et al.  A deep learning algorithm using CT images to screen for Corona virus disease (COVID-19) , 2020, European Radiology.

[19]  Li Yan,et al.  A machine learning-based model for survival prediction in patients with severe COVID-19 infection , 2020, medRxiv.

[20]  Yuedong Yang,et al.  Deep Learning Enables Accurate Diagnosis of Novel Coronavirus (COVID-19) With CT Images , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[21]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[22]  Rui Liu,et al.  Positive rate of RT-PCR detection of SARS-CoV-2 infection in 4880 cases from one hospital in Wuhan, China, from Jan to Feb 2020 , 2020, Clinica Chimica Acta.

[23]  E. Holmes,et al.  Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding , 2020, The Lancet.

[24]  X. Qi,et al.  Machine learning-based CT radiomics model for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: A multicenter study , 2020, medRxiv.

[25]  Alfonso J. Rodriguez-Morales,et al.  Clinical, laboratory and imaging features of COVID-19: A systematic review and meta-analysis , 2020, Travel Medicine and Infectious Disease.

[26]  A. M. Leontovich,et al.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2 , 2020, Nature Microbiology.

[27]  J. Xiang,et al.  Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study , 2020, The Lancet.

[28]  Chuan Liu,et al.  Machine learning-based CT radiomics method for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: a multicenter study. , 2020, Annals of translational medicine.

[29]  E. Holmes,et al.  A new coronavirus associated with human respiratory disease in China , 2020, Nature.

[30]  Long Jiang Zhang,et al.  Coronavirus Disease 2019 (COVID-19): A Perspective from China , 2020, Radiology.

[31]  T. Liang,et al.  Viral load dynamics and disease severity in patients infected with SARS-CoV-2 in Zhejiang province, China, January-March 2020: retrospective cohort study , 2020, BMJ.

[32]  Kai Zhao,et al.  A pneumonia outbreak associated with a new coronavirus of probable bat origin , 2020, Nature.

[33]  Xin Zhou,et al.  Risk Factors Associated With Acute Respiratory Distress Syndrome and Death in Patients With Coronavirus Disease 2019 Pneumonia in Wuhan, China , 2020, The Journal of Emergency Medicine.

[34]  W. Liang,et al.  Clinically Applicable AI System for Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography , 2020, Cell.

[35]  Ke Ma,et al.  Clinical characteristics of 113 deceased patients with coronavirus disease 2019: retrospective study , 2020, BMJ.

[36]  Zhichao Feng,et al.  Early Prediction of Disease Progression in 2019 Novel Coronavirus Pneumonia Patients Outside Wuhan with CT and Clinical Characteristics , 2020, medRxiv.

[37]  Jing Zhao,et al.  Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia , 2020, The New England journal of medicine.

[38]  Yuan-qiang Lu,et al.  COVID-19 early warning score: a multi-parameter screening tool to identify highly suspected patients , 2020, medRxiv.

[39]  K. C. Santosh,et al.  AI-Driven Tools for Coronavirus Outbreak: Need of Active Learning and Cross-Population Train/Test Models on Multitudinal/Multimodal Data , 2020, Journal of Medical Systems.

[40]  K. Yuen,et al.  Clinical Characteristics of Coronavirus Disease 2019 in China , 2020, The New England journal of medicine.

[41]  Ting Yu,et al.  Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study , 2020, The Lancet Respiratory Medicine.

[42]  S. Kannan,et al.  COVID-19 (Novel Coronavirus 2019) - recent trends. , 2020, European review for medical and pharmacological sciences.

[43]  Heshui Shi,et al.  Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: a descriptive study , 2020, The Lancet Infectious Diseases.

[44]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[45]  K. Cao,et al.  Using Artificial Intelligence to Detect COVID-19 and Community-acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy , 2020 .

[46]  W. Zeng,et al.  Chest CT Findings in Patients With Coronavirus Disease 2019 and Its Relationship With Clinical Features , 2020, Investigative radiology.

[47]  A. Katsarou,et al.  Reporting for specific materials, systems and methods , 2018 .