Machine learning based early warning system enables accurate mortality risk prediction for COVID-19

Soaring cases of coronavirus disease (COVID-19) are pummeling the global health system. Overwhelmed health facilities have endeavored to mitigate the pandemic, but mortality of COVID-19 continues to increase. Here, we present a mortality risk prediction model for COVID-19 (MRPMC) that uses patients’ clinical data on admission to stratify patients by mortality risk, which enables prediction of physiological deterioration and death up to 20 days in advance. This ensemble model is built using four machine learning methods including Logistic Regression, Support Vector Machine, Gradient Boosted Decision Tree, and Neural Network. We validate MRPMC in an internal validation cohort and two external validation cohorts, where it achieves an AUC of 0.9621 (95% CI: 0.9464–0.9778), 0.9760 (0.9613–0.9906), and 0.9246 (0.8763–0.9729), respectively. This model enables expeditious and accurate mortality risk stratification of patients with COVID-19, and potentially facilitates more responsive health systems that are conducive to high risk COVID-19 patients.

[1]  Limin Ou,et al.  Development and Validation of a Clinical Risk Score to Predict the Occurrence of Critical Illness in Hospitalized Patients With COVID-19. , 2020, JAMA internal medicine.

[2]  J. Marrero,et al.  Comparison of imputation methods for missing laboratory data in medicine , 2013, BMJ Open.

[3]  Zunyou Wu,et al.  Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. , 2020, JAMA.

[4]  K. Yuen,et al.  Clinical Characteristics of Coronavirus Disease 2019 in China , 2020, The New England journal of medicine.

[5]  P. Lambin,et al.  Development of a Clinical Decision Support System for Severity Risk Prediction and Triage of COVID-19 Patients at Hospital Admission: an International Multicenter Study , 2020, medRxiv.

[6]  D. Morrow,et al.  COVID-19 and Disruptive Modifications to Cardiac Critical Care Delivery , 2020, Journal of the American College of Cardiology.

[7]  Raymond Y Huang,et al.  AI Augmentation of Radiologist Performance in Distinguishing COVID-19 from Pneumonia of Other Etiology on Chest CT , 2020, Radiology.

[8]  J. Xiang,et al.  Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study , 2020, The Lancet.

[9]  Jeffrey Dean,et al.  Machine Learning in Medicine , 2019, The New England journal of medicine.

[10]  Yaling Shi,et al.  A Tool to Early Predict Severe Corona Virus Disease 2019 (COVID-19) : A Multicenter Study using the Risk Nomogram in Wuhan and Guangdong, China , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[11]  Spiros C. Denaxas,et al.  A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service , 2019, The Lancet. Digital health.

[12]  Xin Zhou,et al.  Risk Factors Associated With Acute Respiratory Distress Syndrome and Death in Patients With Coronavirus Disease 2019 Pneumonia in Wuhan, China , 2020, The Journal of Emergency Medicine.

[13]  S. Singh-Carlson INTERNATIONAL PERSPECTIVE. , 2015, Canadian oncology nursing journal = Revue canadienne de nursing oncologique.

[14]  D. Morrow,et al.  Disruptive Modifications to Cardiac Critical Care Delivery During the Covid-19 Pandemic: An International Perspective , 2020, Journal of the American College of Cardiology.

[15]  Gary S Collins,et al.  Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness , 2020, BMJ.

[16]  B. Dai,et al.  Identification and Validation of Stromal Immunotype Predict Survival and Benefit from Adjuvant Chemotherapy in Patients with Muscle-Invasive Bladder Cancer , 2018, Clinical Cancer Research.

[17]  M. Chua,et al.  SARS-CoV-2 Transmission in Patients With Cancer at a Tertiary Care Hospital in Wuhan, China , 2020, JAMA oncology.

[18]  Jun Yu Li,et al.  Clinical characteristics, outcomes, and risk factors for mortality in patients with cancer and COVID-19 in Hubei, China: a multicentre, retrospective, cohort study , 2020, The Lancet Oncology.

[19]  Dennis Andersson,et al.  A retrospective cohort study , 2018 .

[20]  L. Gostin,et al.  The Novel Coronavirus Originating in Wuhan, China: Challenges for Global Health Governance. , 2020, JAMA.

[21]  Mohammad Pourhomayoun,et al.  Predicting Mortality Risk in Patients with COVID-19 Using Artificial Intelligence to Help Medical Decision-Making , 2020, medRxiv.

[22]  Chuan Liu,et al.  Machine learning-based CT radiomics method for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: a multicenter study. , 2020, Annals of translational medicine.

[23]  Richard D Riley,et al.  Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal , 2020 .

[24]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[25]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[26]  Stef van Buuren,et al.  Flexible Imputation of Missing Data, Second Edition , 2018 .

[27]  Z. Fayad,et al.  Artificial intelligence–enabled rapid diagnosis of patients with COVID-19 , 2020, Nature Medicine.

[28]  Yan Zhao,et al.  Clinical Characteristics of 138 Hospitalized Patients With 2019 Novel Coronavirus-Infected Pneumonia in Wuhan, China. , 2020, JAMA.

[29]  Jian Sun,et al.  ACP risk grade: a simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China , 2020, medRxiv.

[30]  P. Lambin,et al.  Development of a clinical decision support system for severity risk prediction and triage of COVID-19 patients at hospital admission: an international multicentre study , 2020, European Respiratory Journal.

[31]  R. C. Macridis A review , 1963 .

[32]  G. Heinze,et al.  Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal , 2020, BMJ.

[33]  Yu Shi,et al.  Host susceptibility to severe COVID-19 and establishment of a host risk score: findings of 487 cases outside Wuhan , 2020, Critical Care.

[34]  Peter Bühlmann,et al.  MissForest - non-parametric missing value imputation for mixed-type data , 2011, Bioinform..

[35]  Yaling Shi,et al.  A Tool to Early Predict Severe 2019-Novel Coronavirus Pneumonia (COVID-19) : A Multicenter Study using the Risk Nomogram in Wuhan and Guangdong, China , 2020, medRxiv.

[36]  Zhiquan Hu,et al.  Clinical characteristics and risk factors associated with COVID-19 disease severity in patients with cancer in Wuhan, China: a multicentre, retrospective, cohort study , 2020, The Lancet Oncology.

[37]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[38]  Yu Zhou,et al.  Predicting COVID-19 malignant progression with AI techniques , 2020, medRxiv.

[39]  Wen Yin,et al.  Association of radiologic findings with mortality of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, medRxiv.

[40]  Sharon J Peacock,et al.  Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review. , 2020, JAMA.

[41]  A. Cho AI systems aim to sniff out coronavirus outbreaks. , 2020, Science.

[42]  Arturo Gonzalez-Izquierdo,et al.  UK phenomics platform for developing and validating electronic health record phenotypes: CALIBER , 2019, J. Am. Medical Informatics Assoc..