Exact inference for disease prevalence based on a test with unknown specificity and sensitivity

To make informative public policy decisions in battling the ongoing COVID-19 pandemic, it is important to know the disease prevalence in a population. There are two intertwined difficulties in estimating this prevalence based on testing results from a group of subjects. First, the test is prone to measurement error with unknown sensitivity and specificity. Second, the prevalence tends to be low at the initial stage of the pandemic and we may not be able to determine if a positive test result is a false positive due to the imperfect specificity of the test. The statistical inference based on large sample approximation or conventional bootstrap may not be sufficiently reliable and yield confidence intervals that do not cover the true prevalence at the nominal level. In this paper, we have proposed a set of 95% confidence intervals, whose validity is guaranteed and doesn't depend on the sample size in the unweighted setting. For the weighted setting, the proposed inference is equivalent to a class of hybrid bootstrap methods, whose performance is also more robust to the sample size than those based on asymptotic approximations. The methods are used to reanalyze data from a study investigating the antibody prevalence in Santa Clara county, California, which was the motivating example of this research, in addition to several other seroprevalence studies where authors had tried to correct their estimates for test performance. Extensive simulation studies have been conducted to examine the finite-sample performance of the proposed confidence intervals.

[1]  J. Reiczigel,et al.  Exact confidence limits for prevalence of a disease with an imperfect diagnostic test , 2010, Epidemiology and Infection.

[2]  L. Tian,et al.  Exact inference on the random‐effects model for meta‐analyses with few studies , 2019, Biometrics.

[3]  O. Brady,et al.  Epidemiological and clinical characteristics of the COVID-19 epidemic in Brazil , 2020, Nature Human Behaviour.

[4]  P. Weihe,et al.  Seroprevalence of SARS-CoV-2–Specific Antibodies, Faroe Islands , 2020, Emerging infectious diseases.

[5]  W O Johnson,et al.  Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. , 2000, Preventive veterinary medicine.

[6]  L. Tian,et al.  Adjusting Coronavirus Prevalence Estimates for Laboratory Test Kit Error , 2020, American journal of epidemiology.

[7]  I S Chan,et al.  Test‐Based Exact Confidence Intervals for the Difference of Two Binomial Proportions , 1999, Biometrics.

[8]  Z. Dai,et al.  Prevalence and severity of corona virus disease 2019 (COVID-19): A systematic review and meta-analysis , 2020, Journal of Clinical Virology.

[9]  T. Lai,et al.  Hybrid resampling methods for confidence intervals , 1998 .

[10]  George Turabelidze,et al.  Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States, March 23-May 12, 2020. , 2020, JAMA internal medicine.

[11]  Gregory M. Goldgof,et al.  SARS-CoV-2 seroprevalence and neutralizing activity in donor and patient blood , 2020, Nature Communications.

[12]  R. Cousins,et al.  A Unified Approach to the Classical Statistical Analysis of Small Signals , 1997, physics/9711021.

[13]  C. Sempos,et al.  Adjusting Coronavirus Prevalence Estimates for Laboratory Test Kit Error , 2020, medRxiv.

[14]  J. Ioannidis,et al.  COVID-19 antibody seroprevalence in Santa Clara County, California , 2021, International journal of epidemiology.

[15]  Benjamin Reiser,et al.  Construction of confidence regions in the ROC space after the estimation of the optimal Youden index‐based cut‐off point , 2013, Biometrics.

[16]  C. Signorelli,et al.  COVID-19 in Italy: impact of containment measures and prevalence estimates of infection in the general population , 2020, Acta bio-medica : Atenei Parmensis.

[17]  Jingjing Yin,et al.  Joint inference about sensitivity and specificity at the optimal cut-off point associated with Youden index , 2014, Comput. Stat. Data Anal..

[18]  H. Ullum,et al.  Estimation of SARS-CoV-2 Infection Fatality Rate by Real-time Antibody Screening of Blood Donors , 2020, medRxiv.

[19]  J. Ioannidis,et al.  The infection fatality rate of COVID-19 inferred from seroprevalence data , 2020, medRxiv.

[20]  Lu Tian,et al.  Exact inference for the random‐effect model for meta‐analyses with rare events , 2019, Statistics in medicine.

[21]  B. Horta,et al.  SARS-CoV-2 antibody prevalence in Brazil: results from two successive nationwide serological household surveys , 2020, The Lancet Global Health.

[22]  Zeynep Ceylan Estimation of COVID-19 prevalence in Italy, Spain, and France , 2020, Science of The Total Environment.

[23]  Michael Woodroofe,et al.  ON THE UNIFIED METHOD WITH NUISANCE PARAMETERS , 2009 .