A Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Prediction Model From Standard Laboratory Tests

Abstract Background With the limited availability of testing for the presence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and concerns surrounding the accuracy of existing methods, other means of identifying patients are urgently needed. Previous studies showing a correlation between certain laboratory tests and diagnosis suggest an alternative method based on an ensemble of tests. Methods We have trained a machine learning model to analyze the correlation between SARS-CoV-2 test results and 20 routine laboratory tests collected within a 2-day period around the SARS-CoV-2 test date. We used the model to compare SARS-CoV-2 positive and negative patients. Results In a cohort of 75 991 veteran inpatients and outpatients who tested for SARS-CoV-2 in the months of March through July 2020, 7335 of whom were positive by reverse transcription polymerase chain reaction (RT-PCR) or antigen testing, and who had at least 15 of 20 lab results within the window period, our model predicted the results of the SARS-CoV-2 test with a specificity of 86.8%, a sensitivity of 82.4%, and an overall accuracy of 86.4% (with a 95% confidence interval of [86.0%, 86.9%]). Conclusions Although molecular-based and antibody tests remain the reference standard method for confirming a SARS-CoV-2 diagnosis, their clinical sensitivity is not well known. The model described herein may provide a complementary method of determining SARS-CoV-2 infection status, based on a fully independent set of indicators, that can help confirm results from other tests as well as identify positive cases missed by molecular testing.

[1]  P. Maes,et al.  A Recurrent Mutation at Position 26340 of SARS-CoV-2 Is Associated with Failure of the E Gene Quantitative Reverse Transcription-PCR Utilized in a Commercial Dual-Target Diagnostic Assay , 2020, Journal of Clinical Microbiology.

[2]  Y. Shoenfeld,et al.  COVID-19 as part of the hyperferritinemic syndromes: the role of iron depletion therapy , 2020, Immunologic Research.

[3]  K. Hupart,et al.  Eosinopenia and COVID-19 , 2020, The Journal of the American Osteopathic Association.

[4]  F. Zheng,et al.  Clinical characteristics of coronavirus disease 2019 in patients aged 80 years and older , 2020, Journal of Integrative Medicine.

[5]  L. Kucirka,et al.  Variation in False-Negative Rate of Reverse Transcriptase Polymerase Chain Reaction–Based SARS-CoV-2 Tests by Time Since Exposure , 2020, Annals of Internal Medicine.

[6]  Junjie Yang,et al.  Genome-wide variations of SARS-CoV-2 infer evolution relationship and transmission route , 2020, medRxiv.

[7]  F. A. Lagunas-Rangel Neutrophil‐to‐lymphocyte ratio and lymphocyte‐to‐C‐reactive protein ratio in patients with severe coronavirus disease 2019 (COVID‐19): A meta‐analysis , 2020, Journal of medical virology.

[8]  L. Wang,et al.  C-reactive protein levels in the early stage of COVID-19 , 2020, Médecine et Maladies Infectieuses.

[9]  Jin Gu,et al.  Novel Coronavirus Pneumonia (COVID-19) Progression Course in 17 Discharged Patients: Comparison of Clinical and Thin-Section CT Features During Recovery , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[10]  Richard D Riley,et al.  Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal , 2020, BMJ.

[11]  Taojiao Wang,et al.  Clinical and immunologic features in severe and moderate Coronavirus Disease 2019. , 2020, The Journal of clinical investigation.

[12]  P. Peng,et al.  Comparison of Hospitalized Patients With ARDS Caused by COVID-19 and H1N1 , 2020, Chest.

[13]  Qiurong Ruan,et al.  Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China , 2020, Intensive Care Medicine.

[14]  Jun Liu,et al.  Relation Between Chest CT Findings and Clinical Conditions of Coronavirus Disease (COVID-19) Pneumonia: A Multicenter Study. , 2020, AJR. American journal of roentgenology.

[15]  K. Yuen,et al.  Clinical Characteristics of Coronavirus Disease 2019 in China , 2020, The New England journal of medicine.

[16]  Ting Yu,et al.  Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study , 2020, The Lancet Respiratory Medicine.

[17]  Lei Liu,et al.  Evaluating the accuracy of different respiratory specimens in the laboratory diagnosis and monitoring the viral shedding of 2019-nCoV infections , 2020, medRxiv.

[18]  L. Chen,et al.  [Analysis of clinical features of 29 patients with 2019 novel coronavirus pneumonia]. , 2020, Zhonghua jie he he hu xi za zhi = Zhonghua jiehe he huxi zazhi = Chinese journal of tuberculosis and respiratory diseases.

[19]  R. Eggo,et al.  Effectiveness of airport screening at detecting travellers infected with novel coronavirus (2019-nCoV) , 2020, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[20]  Ting Yu,et al.  Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study , 2020, The Lancet.

[21]  Amber Young,et al.  A data-driven approach to predicting diabetes and cardiovascular disease with machine learning , 2019, BMC Medical Informatics and Decision Making.

[22]  Shu Yang Flexible Imputation of Missing Data, 2nd ed. , 2019, Journal of the American Statistical Association.

[23]  B. Rath,et al.  Can we distinguish respiratory viral infections based on clinical features? A prospective pediatric cohort compared to systematic literature review , 2018, Reviews in medical virology.

[24]  Stef van Buuren,et al.  Flexible Imputation of Missing Data, Second Edition , 2018 .

[25]  Matjaž Kukar,et al.  An application of machine learning to haematological diagnosis , 2017, Scientific Reports.

[26]  Matjaž Kukar,et al.  Application of machine learning for hematological diagnosis , 2017 .

[27]  S. Kim,et al.  High-sensitivity C-reactive protein/albumin ratio as a predictor of in-hospital mortality in older adults admitted to the emergency department , 2017, Clinical and experimental emergency medicine.

[28]  F. He,et al.  The platelet-to-lymphocyte ratio, superior to the neutrophil-to-lymphocyte ratio, correlates with hepatitis C virus infection. , 2016, International journal of infectious diseases : IJID : official publication of the International Society for Infectious Diseases.

[29]  Mark Holodniy,et al.  Evaluation of Praedico™, A Next Generation Big Data Biosurveillance Application , 2015, Online Journal of Public Health Informatics.

[30]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[31]  Shiao-Chi Wu,et al.  Comparison of different comorbidity measures for use with administrative data in predicting short- and long-term mortality , 2010, BMC health services research.

[32]  John M. Chambers,et al.  Software for Data Analysis: Programming with R , 2008 .

[33]  Kristin J Leuschner,et al.  Resources and Capabilities of the Department of Veterans Affairs to Provide Timely and Accessible Care to Veterans. , 2016, Rand health quarterly.

[34]  John M. Chambers,et al.  Software for data analysis , 2008 .