Estimators in capture–recapture studies with two sources

This paper investigates the applications of capture–recapture methods to human populations. Capture–recapture methods are commonly used in estimating the size of wildlife populations but can also be used in epidemiology and social sciences, for estimating prevalence of a particular disease or the size of the homeless population in a certain area. Here we focus on estimating the prevalence of infectious diseases. Several estimators of population size are considered: the Lincoln–Petersen estimator and its modified version, the Chapman estimator, Chao’s lower bound estimator, the Zelterman’s estimator, McKendrick’s moment estimator and the maximum likelihood estimator. In order to evaluate these estimators, they are applied to real, three-source, capture-recapture data. By conditioning on each of the sources of three source data, we have been able to compare the estimators with the true value that they are estimating. The Chapman and Chao estimators were compared in terms of their relative bias. A variance formula derived through conditioning is suggested for Chao’s estimator, and normal 95% confidence intervals are calculated for this and the Chapman estimator. We then compare the coverage of the respective confidence intervals. Furthermore, a simulation study is included to compare Chao’s and Chapman’s estimator. Results indicate that Chao’s estimator is less biased than Chapman’s estimator unless both sources are independent. Chao’s estimator has also the smaller mean squared error. Finally, the implications and limitations of the above methods are discussed, with suggestions for further development.

[1]  B. Cadwell,et al.  Enhancing vaccine safety surveillance: a capture-recapture analysis of intussusception after rotavirus vaccination. , 2001, American journal of epidemiology.

[2]  Estimating the number of people eligible for health service use , 2002 .

[3]  G. Corrao,et al.  Capture-recapture methods to size alcohol related problems in a population , 2000, Journal of epidemiology and community health.

[4]  D. Böhning A simple variance formula for population size estimators by conditioning , 2008 .

[5]  John M. Roberts,et al.  Estimating the prevalence of male clients of prostitute women in Vancouver with a simple capture–recapture method , 2006 .

[6]  A Chao,et al.  The applications of capture‐recapture models to epidemiological data , 2001, Statistics in medicine.

[7]  Anne Chao,et al.  Estimating population size for sparse data in capture-recapture experiments , 1989 .

[8]  Hans C van Houwelingen,et al.  Point and interval estimation of the population size using the truncated Poisson regression model , 2003 .

[9]  A. Chao Estimating the population size for capture-recapture data with unequal catchability. , 1987, Biometrics.

[10]  R. A. Silverman,et al.  Introductory Real Analysis , 1972 .

[11]  Daniel Zelterman,et al.  Robust estimation in truncated discrete distributions with application to capture-recapture experiments , 1988 .

[12]  J. Richardus,et al.  Estimating infectious diseases incidence: validity of capture–recapture analysis and truncated models for incomplete count data , 2007, Epidemiology and Infection.

[13]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation I: History and theoretical development ( Review ) , 1995 .

[14]  R R Regal,et al.  Capture-recapture methods in epidemiology: methods and limitations. , 1995, Epidemiologic reviews.

[15]  V. Vaillant,et al.  How many foodborne outbreaks of Salmonella infection occurred in France in 1995? Application of the capture-recapture method to three surveillance systems. , 2000, American journal of epidemiology.

[16]  G. Seber The estimation of animal abundance and related parameters , 1974 .

[17]  George A. F. Seber,et al.  The Effects of Trap Response on Tag Recapture Estimates , 1970 .

[18]  S. T. Buckland,et al.  Estimating Animal Abundance , 2002 .