Estimating the undetected infections in the Covid-19 outbreak by harnessing capture-recapture methods

A major open question, affecting the policy makers decisions, is the estimation of the true size of COVID-19 infections. Most of them are undetected, because of a large number of asymptomatic cases. We provide an efficient, easy to compute and robust lower bound estimator for the number of undetected cases. A "modified" version of the Chao estimator is proposed, based on the cumulative time-series distribution of cases and deaths. Heterogeneity has been accounted for by assuming a geometrical distribution underlying the data generation process. An (approximated) analytical variance formula has been properly derived to compute reliable confidence intervals at 95%. An application to Austrian situation is provided and results from other European Countries are mentioned in the discussion.

[1]  Imperial College COVID-19 Response Team,et al.  Estimating the number of infections and the impact of non-pharmaceutical interventions on COVID-19 in European countries: technical description update , 2020, 2004.11342.

[2]  P. Colaneri,et al.  Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy , 2020, Nature Medicine.

[3]  R. Ranjan Estimating the Final Epidemic Size for COVID-19 Outbreak using Improved Epidemiological Models , 2020, medRxiv.

[4]  S. Battiato,et al.  Estimation of Unreported Novel Coronavirus (SARS-CoV-2) Infections from Reported Deaths: A Susceptible–Exposed–Infectious–Recovered–Dead Model , 2020, Journal of clinical medicine.

[5]  S. Bhatt,et al.  Report 13: Estimating the number of infections and the impact of non-pharmaceutical interventions on COVID-19 in 11 European countries , 2020 .

[6]  K. Khan,et al.  Estimating the number of undetected COVID-19 cases exported internationally from all of China , 2020, medRxiv.

[7]  G. Onder,et al.  Case-Fatality Rate and Characteristics of Patients Dying in Relation to COVID-19 in Italy. , 2020, JAMA.

[8]  Victoria Ng,et al.  Estimation of COVID-19 outbreak size in Italy , 2020, The Lancet Infectious Diseases.

[9]  H. Brüssow Faculty Opinions recommendation of Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia. , 2020 .

[10]  Cheng-Shang Chang,et al.  A Time-dependent SIR model for COVID-19 , 2020, ArXiv.

[11]  N. Linton,et al.  Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19) , 2020, International Journal of Infectious Diseases.

[12]  Jing Zhao,et al.  Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia , 2020, The New England journal of medicine.

[13]  Wei Zhang,et al.  Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019‐nCoV , 2020, Journal of evidence-based medicine.

[14]  L. Yang,et al.  Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak , 2020, bioRxiv.

[15]  Robert K. Colwell,et al.  Thirty years of progeny from Chao’s inequality: Estimating and comparing richness with incidence data and incomplete sampling , 2017 .

[16]  D. Böhning Ratio Plot and Ratio Regression with Applications to Social and Medical Sciences , 2016 .

[17]  A flexible ratio regression approach for zero‐truncated capture–recapture counts , 2016, Biometrics.

[18]  Capture–recapture estimation based upon the geometric distribution allowing for heterogeneity , 2013 .

[19]  Anne Chao,et al.  Estimating population size for sparse data in capture-recapture experiments , 1989 .

[20]  A. Chao Estimating the population size for capture-recapture data with unequal catchability. , 1987, Biometrics.