Correlation Improves Group Testing

Population-wide screening to identify and isolate infectious individuals is a powerful tool for controlling COVID-19 and other infectious diseases. Testing an entire population, however, requires significant resources. Group testing can enable large-scale screening by testing more people with fewer resources, but dilution degrades its sensitivity, reducing its effectiveness as an infection control measure. Analysis of this tradeoff typically assumes that pooled samples are independent. Building on recent empirical results in the literature, we argue that this assumption significantly underestimates the true benefits of group testing. Indeed, placing samples from a social group or household into the same pool correlates a pool's samples. As a result, a positive pool likely contains multiple positive samples, increasing a pooled test's sensitivity and also tending to reduce the number of pools that require follow-up tests. We prove that under a general correlation structure, pooling correlated samples together ("correlated pooling") achieves higher sensitivity and requires fewer tests per positive identified compared to independently pooling the samples ("naive pooling") using the same pool size within the two-stage Dorfman procedure, the most widely-used group testing method. To the best of our knowledge, our work is the first to theoretically characterize correlation's effect on sensitivity, and the first to study its effect on test usage under a realistic test error model. Under a representative starting prevalence of 1%, simulation results estimate that correlated pooling requires 12.9% fewer tests than naive pooling to control an epidemic. Thus, we argue that correlation is an important consideration for policy-makers designing infection control interventions: it makes screening more attractive for infection control and it suggests that sample collection should maximize correlation.

[1]  Hae-Young Kim,et al.  Comparison of Group Testing Algorithms for Case Identification in the Presence of Test Error , 2007, Biometrics.

[2]  A. Regev,et al.  Using viral load and epidemic dynamics to optimize pooled testing in resource constrained settings , 2020, medRxiv.

[3]  Eric Song,et al.  Saliva is more sensitive for SARS-CoV-2 detection in COVID-19 patients than nasopharyngeal swabs , 2020, medRxiv.

[4]  J. Norris Appendix: probability and measure , 1997 .

[5]  Arya Mazumdar,et al.  Group Testing Schemes From Codes and Designs , 2015, IEEE Transactions on Information Theory.

[6]  T. Cacoullos,et al.  Exercises In Probability , 1988 .

[7]  R. Moseley,et al.  Screening of Blood Donations for Zika Virus Infection - Puerto Rico, April 3-June 11, 2016. , 2016, MMWR. Morbidity and mortality weekly report.

[8]  Joshua M Tebbs,et al.  Two‐Stage Hierarchical Group Testing for Multiple Infections with Application to the Infertility Prevention Project , 2013, Biometrics.

[9]  Zuo-jun Shen,et al.  Optimizing and evaluating PCR-based pooled screening during COVID-19 pandemics , 2021, Scientific Reports.

[10]  L. Wein,et al.  Pooled testing for HIV prevalence estimation: exploiting the dilution effect. , 2015, Statistics in medicine.

[11]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[12]  M. Burns,et al.  Modelling the limit of detection in real-time quantitative PCR , 2008 .

[13]  Bastien Mallein,et al.  Group testing as a strategy for COVID-19 epidemiological monitoring and community surveillance , 2020, PLoS Comput. Biol..

[14]  Meningococcal disease. Secondary attack rate and chemoprophylaxis in the United States, 1974. , 1976, JAMA.

[15]  Michael G Hudgens,et al.  Group Testing for Case Identification with Correlated Responses , 2012, Biometrics.

[16]  L. Malone,et al.  Secondary Attack Rate of Tuberculosis in Urban Households in Kampala, Uganda , 2011, PloS one.

[17]  Cassidy Mentus,et al.  Analysis and Applications of Adaptive Group Testing Methods for COVID-19 , 2020, medRxiv.

[18]  M. Yassour,et al.  Lessons from applied large-scale pooling of 133,816 SARS-CoV-2 RT-PCR tests , 2020, Science Translational Medicine.

[19]  David C Christiani,et al.  Work-related COVID-19 transmission in six Asian countries/areas: A follow-up study , 2020, PloS one.

[20]  B. Healy,et al.  The impact of false positive COVID-19 results in an area of low prevalence. , 2020, Clinical medicine.

[21]  James S Koopman,et al.  Stochastic effects on endemic infection levels of disseminating versus local contacts. , 2002, Mathematical biosciences.

[22]  G. Wong,et al.  Rapid Response to an Outbreak in Qingdao, China , 2020, The New England journal of medicine.

[23]  A. Sterrett On the Detection of Defective Members of Large Populations , 1957 .

[24]  Wilfred Ndifon,et al.  A strategy for finding people infected with SARS-CoV-2: optimizing pooled testing at low prevalence , 2020, medRxiv.

[25]  Joshua M Tebbs,et al.  Regression models for group testing data with pool dilution effects. , 2013, Biostatistics.

[26]  Francesco Checchi,et al.  Variability in Intrahousehold Transmission of Ebola Virus, and Estimation of the Household Secondary Attack Rate , 2017, The Journal of infectious diseases.

[27]  Claire Donnat,et al.  Statistical Modeling for Practical Pooled Testing During the COVID-19 Pandemic , 2021, Statistical Science.

[28]  Ao Wang,et al.  Group Testing in a Pandemic: The Role of Frequent Testing, Correlated Risk, and Machine Learning , 2020 .

[29]  M. Salit,et al.  Testing at scale during the COVID-19 pandemic , 2021, Nature Reviews Genetics.

[30]  Christopher D. Pilcher,et al.  Optimizing Screening for Acute Human Immunodeficiency Virus Infection with Pooled Nucleic Acid Amplification Tests , 2008, Journal of Clinical Microbiology.

[31]  Ellsworth M. Campbell,et al.  Participation in Fraternity and Sorority Activities and the Spread of COVID-19 Among Residential University Communities — Arkansas, August 21–September 5, 2020 , 2021, MMWR. Morbidity and mortality weekly report.

[32]  Douglas R. Bish,et al.  Optimal Risk-Based Group Testing , 2019, Manag. Sci..

[33]  H. M. Finucan The Blood Testing Problem , 1964 .

[34]  S. Lohse,et al.  Pooling of samples for testing for SARS-CoV-2 in asymptomatic people , 2020, The Lancet Infectious Diseases.

[35]  A. Bateman,et al.  Assessing the dilution effect of specimen pooling on the sensitivity of SARS‐CoV‐2 PCR tests , 2020, Journal of medical virology.

[36]  Lawrence M. Wein,et al.  Pooled Testing for HIV Screening: Capturing the Dilution Effect , 2018, Oper. Res..

[37]  Milton Sobel,et al.  Group testing with a new goal, estimation , 1975 .

[38]  R. Phatarfod,et al.  The use of a square array scheme in blood testing. , 1994, Statistics in medicine.

[39]  M. Sobel,et al.  Group testing to eliminate efficiently all defectives in a binomial sample , 1959 .

[40]  J. Gaudino,et al.  Sensitivity and specificity of pooled versus individual sera in a human immunodeficiency virus antibody prevalence study , 1989, Journal of clinical microbiology.

[41]  Salinas,et al.  The Effect of Correlation and False Negatives in Pool Testing Strategies for COVID-19 (preprint) , 2020 .

[42]  Dmitry M. Malioutov,et al.  Boolean compressed sensing: LP relaxation for group testing , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[43]  Amin Karbasi,et al.  Compressed sensing with probabilistic measurements: A group testing solution , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[44]  Amar S Basu,et al.  Digital Assays Part I: Partitioning Statistics and Digital PCR , 2017, SLAS technology.

[45]  D. Pigott,et al.  Crowding and the shape of COVID-19 epidemics , 2020, Nature medicine.

[46]  George Atia,et al.  Boolean Compressed Sensing and Noisy Group Testing , 2009, IEEE Transactions on Information Theory.

[47]  Venkatesh Saligrama,et al.  Learning immune-defectives graph through group tests , 2015, 2015 IEEE International Symposium on Information Theory (ISIT).

[48]  Ely Porat,et al.  Search Methodologies , 2022 .

[49]  Mikhail B. Malyutov,et al.  Search for Sparse Active Inputs: A Review , 2013, Information Theory, Combinatorics, and Search Theory.

[50]  Byron J. T. Morgan,et al.  Bayesian Models for Limiting Dilution Assay and Group Test Data , 1996 .

[51]  Jens Niklas Eberhardt,et al.  Multi-Stage Group Testing Improves Efficiency of Large-Scale COVID-19 Screening , 2020, Journal of Clinical Virology.

[52]  R Brookmeyer,et al.  Feasibility of pooling sera for HIV-1 viral RNA to diagnose acute primary HIV-1 infection and estimate HIV incidence , 2000, AIDS.

[53]  Jos'e M. Mart'inez,et al.  Group Testing With Nested Pools , 2020, IEEE Transactions on Information Theory.

[54]  Thomas D. Schmittgen,et al.  Real-Time Quantitative PCR , 2002 .

[55]  L. Fang,et al.  Household secondary attack rate of COVID-19 and associated determinants in Guangzhou, China: a retrospective cohort study , 2020, The Lancet Infectious Diseases.

[56]  Yael Mandel-Gutfreund,et al.  Evaluation of COVID-19 RT-qPCR test in multi-sample pools , 2020, medRxiv.

[57]  T. Bärnighausen,et al.  Simulation of pooled-sample analysis strategies for COVID-19 mass testing , 2020, Bulletin of the World Health Organization.

[58]  David M. Warshauer,et al.  Pooling Nasopharyngeal/Throat Swab Specimens To Increase Testing Capacity for Influenza Viruses by PCR , 2011, Journal of Clinical Microbiology.

[59]  David Eppstein,et al.  Improved Combinatorial Group Testing Algorithms for Real-World Problem Sizes , 2005, SIAM J. Comput..

[60]  Amin Karbasi,et al.  Graph-Constrained Group Testing , 2010, IEEE Transactions on Information Theory.

[61]  D. Du,et al.  Combinatorial Group Testing and Its Applications , 1993 .

[62]  Andrei Z. Broder,et al.  A Note on Double Pooling Tests , 2020, ArXiv.

[63]  K. Taniguchi,et al.  Assessment of secondary attack rate and effectiveness of antiviral prophylaxis among household contacts in an influenza A(H1N1)v outbreak in Kobe, Japan, May-June 2009. , 2009, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[64]  Douglas R. Bish,et al.  A methodology for deriving the sensitivity of pooled testing, based on viral load progression and pooling dilution , 2019, Journal of Translational Medicine.

[65]  G. Dorsey,et al.  PCR-Based Pooling of Dried Blood Spots for Detection of Malaria Parasites: Optimization and Application to a Cohort of Ugandan Children , 2010, Journal of Clinical Microbiology.

[66]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[67]  D. Mak,et al.  Secondary attack rate of pandemic influenza A(H1N1) 2009 in Western Australian households, 29 May-7 August 2009. , 2011, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[68]  G. Biele,et al.  An analysis of SARS-CoV-2 viral load by patient age , 2020, medRxiv.

[69]  Christopher D Pilcher,et al.  Real-time, universal screening for acute HIV infection in a routine HIV counseling and testing population. , 2002, JAMA.

[70]  W. Swallow,et al.  Robustness of Group Testing in the Estimation of Proportions , 1999, Biometrics.

[71]  Mohan Sabyasachi Paragkumar Dhirajlal Praveen Archisman Bairwa Behera Chavda Kulkarni Mohapatra Ram Combating the COVID-19 pandemic in a resource-constrained setting: insights from initial response in India , 2020, BMJ Global Health.

[72]  Magnus Unemo,et al.  Pooling samples: the key to sensitive, specific and cost-effective genetic diagnosis of Chlamydia trachomatis in low-resource countries. , 2007, Acta dermato-venereologica.

[73]  Robert Roeloffs,et al.  Group Testing in the Presence of Test Error; An Extension of the Dorfman Procedure , 1972 .

[74]  Jos J.A.M. Weusten,et al.  Mathematic modeling of the risk of HBV, HCV, and HIV transmission by window‐phase donations not detected by NAT , 2002, Transfusion.

[75]  M. Halloran,et al.  Household transmission of SARS-CoV-2: a systematic review and meta-analysis of secondary attack rate , 2020, medRxiv.

[76]  F. K. Hwang,et al.  Group testing with a dilution effect , 1976 .

[77]  Markus Eickmann,et al.  Blood Screening for Influenza , 2007, Emerging infectious diseases.

[78]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[79]  Emanuel Knill,et al.  Non-adaptive Group Testing in the Presence of Errors , 1998, Discret. Appl. Math..

[80]  Hrayer Aprahamian,et al.  Sequential prevalence estimation with pooling and continuous test outcomes , 2018, Statistics in medicine.

[81]  Karima R. Nigmatulina,et al.  The scaling of contact rates with population density for the infectious disease models. , 2013, Mathematical biosciences.

[82]  Cheng-Shang Chang,et al.  Positively Correlated Samples Save Pooled Testing Costs , 2020, IEEE Transactions on Network Science and Engineering.

[83]  Estimating a Proportion Based on Group Testing for Correlated Binary Response , 2013 .

[84]  D. Gupta,et al.  Group testing in presence of classification errors. , 1999, Statistics in medicine.

[85]  T. Quinn,et al.  Successful use of pooled sera to determine HIV-1 seroprevalence in Zaire with development of cost-efficiency models. , 1990, AIDS.

[86]  Atri Rudra,et al.  Efficiently decodable non-adaptive group testing , 2010, SODA '10.