Accuracy of alternative approaches to capture-recapture estimates of disease frequency: internal validity analysis of data from five sources.

The authors used "internal validity analysis" to evaluate the performance of various capture-recapture methods. Data from studies with five overlapping, incomplete lists generated subgroups whose known sizes were compared with estimates derived from various four-source capture-recapture analyses. In 15 data sets unanalyzed previously (five subgroups of each of three new studies), the authors observed a trend toward mean underestimation of the known population size by 16-25%. (Coverage of the 90% confidence intervals associated with the method found to be optimal was acceptable (13/15), despite the downward bias.) The authors conjectured that (with the obvious exception of geographically disparate lists) most data sets used by epidemiologists tend to have a net positive dependence; that is, cases captured by one source are more likely to be captured by some other available source than are cases selected randomly from the population, and this trend results in a bias toward underestimation. Attempts to ensure that the underlying assumptions of the methods are met, such as minimizing (or adjusting adequately) for the possibility of loss due to death or migration, as was undertaken in one exceptional study, appear likely to improve the behavior of these methods.

[1]  R R Regal,et al.  Goodness-of-fit based confidence intervals for estimates of the size of a closed population. , 1984, Statistics in medicine.

[2]  R R Regal,et al.  Capture-recapture methods in epidemiology: methods and limitations. , 1995, Epidemiologic reviews.

[3]  N. Mckeganey,et al.  Estimating the prevalence of drug misuse in Dundee, Scotland: an application of capture-recapture methods. , 1996, Journal of epidemiology and community health.

[4]  R. Regal,et al.  Effect of variation in probability of ascertainment by sources ("variable catchability") upon "capture-recapture" estimates of prevalence. , 1993, American journal of epidemiology.

[5]  Gordon Hay,et al.  The selection from multiple data sources in epidemiological capture–recapture studies , 1997 .

[6]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[7]  Clifford M. Hurvich,et al.  Model selection for extended quasi-likelihood models in small samples. , 1995, Biometrics.

[8]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[9]  K. Cruickshank,et al.  A community based stroke register in a high risk area for stroke in north west England. , 1997, Journal of epidemiology and community health.

[10]  S. Fienberg The multiple recapture census for closed populations and incomplete 2k contingency tables , 1972 .

[11]  W. Deming Quality, productivity, and competitive position , 1982 .

[12]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation II: Applications in human diseases. International Working Group for Disease Monitoring and Forecasting. , 1995, American journal of epidemiology.

[13]  Cormack Rm,et al.  Problems with using capture-recapture in epidemiology: an example of a measles epidemic. , 1999 .

[14]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[15]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation I: History and theoretical development ( Review ) , 1995 .

[16]  D. Bonett,et al.  Bias reduction for multiple-recapture estimators of closed population size. , 1994, Biometrics.

[17]  R R Regal,et al.  Validity of methods for model selection, weighting for model uncertainty, and small sample adjustment in capture-recapture estimation. , 1997, American journal of epidemiology.

[18]  M. Mayes,et al.  The epidemiology of scleroderma among women: assessment of risk from exposure to silicone and silica. , 1996, The Journal of rheumatology.

[19]  G. Kitagawa,et al.  Akaike Information Criterion Statistics , 1988 .

[20]  R. Regal,et al.  On the need for a 16th and 17th recommendations for capture-recapture analysis. , 2000, Journal of clinical epidemiology.

[21]  M. Mayes,et al.  Racial differences in scleroderma among women in Michigan. , 1997, Arthritis and rheumatism.

[22]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[23]  Hook Eb,et al.  Recommendations for presentation and evaluation of capture-recapture estimates in epidemiology. , 1999 .

[24]  G. Kitagawa,et al.  Akaike Information Criterion Statistics , 1988 .

[25]  S. Haberman Analysis of qualitative data , 1978 .

[26]  R R Regal,et al.  The value of capture-recapture methods even for apparent exhaustive surveys. The need for adjustment for source of ascertainment intersection in attempted complete prevalence studies. , 1992, American journal of epidemiology.