Regression Models for Disease Prevalence with Diagnostic Tests on Pools of Serum Samples

Whether the aim is to diagnose individuals or estimate prevalence, many epidemiological studies have demonstrated the successful use of tests on pooled sera. These tests detect whether at least one sample in the pool is positive. Although originally designed to reduce diagnostic costs, testing pools also lowers false positive and negative rates in low prevalence settings and yields more precise prevalence estimates. Current methods are aimed at estimating the average population risk from diagnostic tests on pools. In this article, we extend the original class of risk estimators to adjust for covariates recorded on individual pool members. Maximum likelihood theory provides a flexible estimation method that handles different covariate values in the pool, different pool sizes, and errors in test results. In special cases, software for generalized linear models can be used. Pool design has a strong impact on precision and cost efficiency, with covariate-homogeneous pools carrying the largest amount of information. We perform joint pool and sample size calculations using information from individual contributors to the pool and show that a good design can severely reduce cost and yet increase precision. The methods are illustrated using data from a Kenyan surveillance study of HIV. Compared to individual testing, age-homogeneous, optimal-sized pools of average size seven reduce cost to 44% of the original price with virtually no loss in precision.

[1]  Marcello Pagano,et al.  On the informativeness and accuracy of pooled testing in estimating prevalence of a rare disease: Application to HIV screening , 1995 .

[2]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[3]  C. Farrington Estimating prevalence by group testing using generalized linear models. , 1992, Statistics in medicine.

[4]  J. Gaudino,et al.  Sensitivity and specificity of pooled versus individual sera in a human immunodeficiency virus antibody prevalence study , 1989, Journal of clinical microbiology.

[5]  Marcello Pagano,et al.  Screening for the Presence of a Disease by Pooling Sera Samples , 1994 .

[6]  Asit P. Basu,et al.  Aspects of Statistical Inference , 1996, Technometrics.

[7]  R. Dorfman The Detection of Defective Members of Large Populations , 1943 .

[8]  R Brookmeyer,et al.  Analysis of Multistage Pooling Studies of Biological Specimens for Estimating Disease Incidence and Prevalence , 1999, Biometrics.

[9]  Jacqueline M. Hughes-Oliver,et al.  A Two-Stage Adaptive Group-Testing Procedure for Estimating Small Proportions , 1994 .

[10]  Farah,et al.  Pooling sera to reduce the cost of HIV surveillance: a feasibility study in a rural Kenyan district , 1998, Tropical medicine & international health : TM & IH.

[11]  A. H. Welsh,et al.  Aspects of Statistical Inference: Welsh/Aspects , 1996 .

[12]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[13]  B. Kubanek,et al.  Mini‐pool screening by nucleic acid testing for hepatitis B virus, hepatitis C virus, and HIV: preliminary results , 1998, Transfusion.

[14]  S Zeger,et al.  Evaluation of human immunodeficiency virus seroprevalence in population surveys using pooled sera , 1989, Journal of clinical microbiology.

[15]  X M Tu,et al.  Studies of AIDS and HIV surveillance. Screening tests: can we get more by doing less? , 1994, Statistics in medicine.

[16]  Joseph L. Gastwirth,et al.  Estimation of the prevalence of a rare disease, preserving the anonymity of the subjects by group testing: application to estimating the prevalence of aids antibodies in blood donors , 1989 .