Bayesian Analysis of Tests with Unknown Specificity and Sensitivity

When testing for a rare disease, prevalence estimates can be highly sensitive to uncertainty in the specificity and sensitivity of the test. Bayesian inference is a natural way to propagate these uncertainties, with hierarchical modeling capturing variation in these parameters across experiments. Another concern is the people in the sample not being representative of the general population. Statistical adjustment cannot without strong assumptions correct for selection bias in an opt-in sample, but multilevel regression and poststratification can at least adjust for known differences between sample and population. We demonstrate these models with code in R and Stan and discuss their application to a controversial recent study of COVID-19 antibodies in a sample of people from the Stanford University area. Wide posterior intervals make it impossible to evaluate the quantitative claims of that study regarding the number of unreported infections. For future studies, the methods described here should facilitate more accurate estimates of disease prevalence from imperfect tests performed on non-representative samples.

[1]  H. Rue,et al.  Bayesian bivariate meta‐analysis of diagnostic test studies with interpretable priors , 2015, Statistics in medicine.

[2]  J. Ioannidis,et al.  COVID-19 antibody seroprevalence in Santa Clara County, California , 2020, medRxiv.

[3]  David Hemenway,et al.  The Myth of Millions of Annual Self-Defense Gun Uses: A Case Study of Survey Overestimates of Rare Events , 1997 .

[4]  Sander Greenland,et al.  Bayesian perspectives for epidemiologic research: III. Bias analysis via missing-data methods. , 2009, International journal of epidemiology.

[5]  P. Levy Measurement Error and Misclassification in Statistics and Epidemiology: Impacts and Bayesian Adjustments , 2004 .

[6]  Ying Liu,et al.  Simulation-efficient shortest probability intervals , 2013, Stat. Comput..

[7]  Dusan Petrovic,et al.  Appendix: Seroprevalence of anti-SARS-COV-2 IgG antibodies in a population-based sample from Geneva, Switzerland , 2020 .

[8]  A. Gelman,et al.  Deep Interactions with MRP: Election Turnout and Voting Patterns Among Small Electoral Subgroups , 2013 .

[10]  Sebastian Bonhoeffer,et al.  Estimating seroprevalence with imperfect serological tests: a cutoff-free approach , 2020 .

[11]  P. Gustafson,et al.  Conservative prior distributions for variance parameters in hierarchical models , 2006 .

[12]  A. Gelman,et al.  Bayesian Analysis of Serial Dilution Assays , 2004, Biometrics.

[13]  Devin Caughey,et al.  Public Opinion in Subnational Politics , 2019, The Journal of Politics.

[14]  D. W. Maybury,et al.  A note on COVID-19 seroprevalence studies: a meta-analysis using hierarchical modelling , 2020, medRxiv.

[15]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[16]  Lisa M. Schwartz,et al.  PSYCHOLOGICAL SCIENCE IN THE PUBLIC INTEREST Helping Doctors and Patients Make Sense of Health Statistics , 2022 .