Systematic evaluation and external validation of 22 prognostic models among hospitalised adults with COVID-19: an observational cohort study

Background The number of proposed prognostic models for COVID-19, which aim to predict disease outcomes, is growing rapidly. It is not known whether any are suitable for widespread clinical implementation. We addressed this question by independent and systematic evaluation of their performance among hospitalised COVID-19 cases. Methods We conducted an observational cohort study to assess candidate prognostic models, identified through a living systematic review. We included consecutive adults admitted to a secondary care hospital with PCR-confirmed or clinically diagnosed community-acquired COVID-19 (1st February to 30th April 2020). We reconstructed candidate models as per their original descriptions and evaluated performance for their original intended outcomes (clinical deterioration or mortality) and time horizons. We assessed discrimination using the area under the receiver operating characteristic curve (AUROC), and calibration using calibration plots, slopes and calibration-in-the-large. We calculated net benefit compared to the default strategies of treating all and no patients, and against the most discriminating predictor in univariable analyses, based on a limited subset of a priori candidates. Results We tested 22 candidate prognostic models among a cohort of 411 participants, of whom 180 (43.8%) and 115 (28.0%) met the endpoints of clinical deterioration and mortality, respectively. The highest AUROCs were achieved by the NEWS2 score for prediction of deterioration over 24 hours (0.78; 95% CI 0.73-0.83), and a novel model for prediction of deterioration <14 days from admission (0.78; 0.74-0.82). Calibration appeared generally poor for models that used probability outcomes. In univariable analyses, admission oxygen saturation on room air was the strongest predictor of in-hospital deterioration (AUROC 0.76; 0.71-0.81), while age was the strongest predictor of in-hospital mortality (AUROC 0.76; 0.71-0.81). No prognostic model demonstrated consistently higher net benefit than using the most discriminating univariable predictors to stratify treatment, across a range of threshold probabilities. Conclusions Oxygen saturation on room air and patient age are strong predictors of deterioration and mortality among hospitalised adults with COVID-19, respectively. None of the prognostic models evaluated offer incremental value for patient stratification to these univariable predictors.

[1]  L. Mombaerts,et al.  An interpretable mortality prediction model for COVID-19 patients , 2020, Nature Machine Intelligence.

[2]  W. Lim,et al.  Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study , 2003, Thorax.

[3]  Andrew J. Vickers,et al.  A simple, step-by-step guide to interpreting decision curve analysis , 2019, Diagnostic and Prognostic Research.

[4]  Wen Yin,et al.  Association of radiologic findings with mortality of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, medRxiv.

[5]  J. McDevitt,et al.  Clinical Decision Support Tool and Rapid Point-of-Care Platform for Determining Disease Severity in Patients with COVID-19 , 2020, medRxiv.

[6]  Ni Yao,et al.  Comparing Rapid Scoring Systems in Mortality Prediction of Critically Ill Patients With Novel Coronavirus Disease , 2020, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[7]  A clinical risk score to identify patients with COVID-19 at high risk of critical care admission or death: An observational cohort study , 2020, Journal of Infection.

[8]  Patrick Royston,et al.  Multiple imputation using chained equations: Issues and guidance for practice , 2011, Statistics in medicine.

[9]  Xilong Deng,et al.  Prognostic factors for COVID-19 pneumonia progression to severe symptom based on the earlier clinical features: a retrospective analysis , 2020, medRxiv.

[10]  M. Kuo,et al.  Frequency and Distribution of Chest Radiographic Findings in COVID-19 Positive Patients , 2019, Radiology.

[11]  Nuno Ferreira,et al.  Estimation of risk factors for COVID-19 mortality - preliminary results , 2020, medRxiv.

[12]  G. Collins,et al.  PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies , 2019, Annals of Internal Medicine.

[13]  Vasa Curcin,et al.  Evaluation and improvement of the National Early Warning Score (NEWS2) for COVID-19: a multi-hospital study , 2021, BMC Medicine.

[14]  Xingdong Chen,et al.  Early prediction of mortality risk among severe COVID-19 patients using machine learning , 2020, medRxiv.

[15]  A. A. Kramer,et al.  TOWARD A COVID-19 SCORE-RISK ASSESSMENTS AND REGISTRY , 2020, medRxiv.

[16]  Lee-Jen Wei,et al.  Remdesivir for the Treatment of Covid-19 - Preliminary Report. , 2020, The New England journal of medicine.

[17]  Riccardo Miotto,et al.  Machine Learning to Predict Mortality and Critical Events in COVID-19 Positive New York City Patients , 2020, medRxiv.

[18]  A. Peleg,et al.  Remdesivir for the Treatment of Covid-19 - Preliminary Report. , 2020, The New England journal of medicine.

[19]  Jing Xu,et al.  Prediction for Progression Risk in Patients with COVID-19 Pneumonia: the CALL Score , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[20]  G. Heinze,et al.  Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal , 2020, BMJ.

[21]  Yu Shi,et al.  Host susceptibility to severe COVID-19 and establishment of a host risk score: findings of 487 cases outside Wuhan , 2020, Critical Care.

[22]  Mike Clarke,et al.  A minimal common outcome measure set for COVID-19 clinical research , 2020, The Lancet Infectious Diseases.

[23]  Partha Chakrabarti,et al.  A Machine Learning Model Reveals Older Age and Delayed Hospitalization as Predictors of Mortality in Patients with COVID-19 , 2020, medRxiv.

[24]  L. Lind,et al.  Rapid Emergency Medicine score: a new prognostic tool for in‐hospital mortality in nonsurgical emergency department patients , 2004, Journal of internal medicine.

[25]  Johannes B. Reitsma,et al.  Individual Participant Data (IPD) Meta-analyses of Diagnostic and Prognostic Modeling Studies: Guidance on Their Use , 2015, PLoS medicine.

[26]  Giacomo Grasselli,et al.  Critical Care Utilization for the COVID-19 Outbreak in Lombardy, Italy: Early Experience and Forecast During an Emergency Response. , 2020, JAMA.

[27]  Jennifer L. Bell,et al.  Effect of Dexamethasone in Hospitalized Patients with COVID-19: Preliminary Report , 2020, medRxiv.

[28]  R. Kolamunnage-Dona,et al.  Time-dependent ROC curve analysis in medical research: current methods and applications , 2017, BMC Medical Research Methodology.

[29]  Omar Yaxmehen Bello-Chavolla,et al.  Predicting mortality due to SARS-CoV-2: A mechanistic score relating obesity and diabetes to COVID-19 outcomes in Mexico , 2020, The Journal of clinical endocrinology and metabolism.

[30]  Mohammad Pourhomayoun,et al.  Predicting Mortality Risk in Patients with COVID-19 Using Artificial Intelligence to Help Medical Decision-Making , 2020, medRxiv.

[31]  Shinichi Tokuno,et al.  Prediction of the clinical outcome of COVID-19 patients using T lymphocyte subsets with 340 cases from Wuhan, China: a retrospective cohort study and a web visualization tool , 2020, medRxiv.

[32]  P. Horby,et al.  Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: prospective observational cohort study , 2020, BMJ.

[33]  M. Lipsitch,et al.  The demand for inpatient and ICU beds for COVID-19 in the US: lessons from Chinese cities , 2020, medRxiv.

[34]  Lena Osterhagen,et al.  Multiple Imputation For Nonresponse In Surveys , 2016 .

[35]  Gary S Collins,et al.  Sample size considerations for the external validation of a multivariable prognostic model: a resampling study , 2015, Statistics in medicine.

[36]  Gary B. Smith,et al.  The ability of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death. , 2013, Resuscitation.

[37]  Gary S Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration , 2015, Annals of Internal Medicine.

[38]  Yaling Shi,et al.  A Tool to Early Predict Severe Corona Virus Disease 2019 (COVID-19) : A Multicenter Study using the Risk Nomogram in Wuhan and Guangdong, China , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[39]  Jian-feng Xie,et al.  Development and external validation of a prognostic multivariable model on admission for hospitalized patients with COVID-19 , 2020, medRxiv.

[40]  T. Rea,et al.  Assessment of Clinical Criteria for Sepsis: For the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[41]  G. Rothblum,et al.  Performing risk stratification for COVID-19 when individual level data is not available, the experience of a large healthcare organization , 2020, medRxiv.

[42]  Jian Sun,et al.  ACP risk grade: a simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China , 2020, medRxiv.

[43]  Nicola Sverzellati,et al.  Well-aerated Lung on Admitting Chest CT to Predict Adverse Outcome in COVID-19 Pneumonia , 2020, Radiology.

[44]  Xian-gao Jiang,et al.  Towards an Artificial Intelligence Framework for Data-Driven Prediction of Coronavirus Clinical Severity , 2020 .

[45]  Jian Sun,et al.  Development and validation of an early warning score (EWAS) for predicting clinical deterioration in patients with coronavirus disease 2019 , 2020, medRxiv.

[46]  X. Qi,et al.  Machine learning-based CT radiomics model for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: A multicenter study , 2020, medRxiv.

[47]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[48]  Stephane Tran Ba,et al.  Holistic AI-Driven Quantification, Staging and Prognosis of COVID-19 Pneumonia , 2020 .

[49]  Yu Zhou,et al.  Predicting COVID-19 malignant progression with AI techniques , 2020, medRxiv.

[50]  Eun Ji Kim,et al.  Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. , 2020, JAMA.

[51]  T. Hallett,et al.  Report 17: Clinical characteristics and predictors of outcomes of hospitalised patients with COVID-19 in a London NHS Trust: a retrospective cohort study , 2020 .

[52]  C. Li,et al.  Risk prediction for poor outcome and death in hospital in-patients with COVID-19: derivation in Wuhan, China and external validation in London, UK , 2020, medRxiv.

[53]  A. DAS,et al.  Predicting community mortality risk due to CoVID-19 using machine learning and development of a prediction tool , 2020, medRxiv.

[54]  J. Hewitt,et al.  The effect of frailty on survival in patients with COVID-19 (COPE): a multicentre, European, observational cohort study , 2020, The Lancet Public Health.

[55]  G. Collins,et al.  Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement , 2015, BMJ : British Medical Journal.

[56]  Lei Liu,et al.  Risk assessment of progression to severe conditions for patients with COVID-19 pneumonia: a single-center retrospective study , 2020, medRxiv.

[57]  C. Subbe,et al.  Validation of a modified Early Warning Score in medical admissions. , 2001, QJM : monthly journal of the Association of Physicians.