Capture–recapture analysis with a latent class model allowing for local dependence and observed heterogeneity

In epidemiology, capture-recapture models are commonly used to estimate the size of an unknown population based on several incomplete lists of individuals. The method operates under two main assumptions: independence between the lists (local independence) and homogeneity of capture probabilities of individuals. In practice, these assumptions are rarely satisfied. We introduce a multinomial latent class model that can account for both list dependence and heterogeneity. Parameter estimation is performed by maximizing the conditional likelihood function with the use of the EM algorithm. In addition, a new approach for evaluating the standard errors of the parameter estimates is discussed, which considerably reduces the computational burden associated with the evaluation of the variance of the population size estimate.

[1]  J. Alho Logistic regression in capture-recapture models. , 1990, Biometrics.

[2]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation I: History and theoretical development ( Review ) , 1995 .

[3]  Hans C van Houwelingen,et al.  Point and interval estimation of the population size using the truncated Poisson regression model , 2003 .

[4]  W. Link Nonidentifiability of Population Size from Capture‐Recapture Data with Heterogeneous Detection Probabilities , 2003, Biometrics.

[5]  Francesco Bartolucci,et al.  A Class of Latent Marginal Models for Capture–Recapture Data With Continuous Covariates , 2006 .

[6]  B. Muthén,et al.  Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study , 2007 .

[7]  R. Regal,et al.  Effect of variation in probability of ascertainment by sources ("variable catchability") upon "capture-recapture" estimates of prevalence. , 1993, American journal of epidemiology.

[8]  Dean Harper,et al.  Local dependence latent structure models , 1972 .

[9]  Yan Wang,et al.  A Semiparametric Model for Recapture Experiments , 2003 .

[10]  Dankmar Böhning,et al.  Equivalence of Truncated Count Mixture Distributions and Mixtures of Truncated Count Distributions , 2006, Biometrics.

[11]  A. Agresti Categorical data analysis , 1993 .

[12]  A Chao,et al.  The applications of capture‐recapture models to epidemiological data , 2001, Statistics in medicine.

[13]  Cormack Rm,et al.  Interval estimation for mark-recapture studies of closed populations. , 1992 .

[14]  J. Hagenaars Latent Structure Models with Direct Effects between Indicators , 1988 .

[15]  Dankmar Böhning,et al.  Mixture models for capture-recapture count data , 2005, Stat. Methods Appl..

[16]  Yan Wang,et al.  A latent variable regression model for capture-recapture data , 2009, Comput. Stat. Data Anal..

[17]  P. Yip,et al.  A NOTE ON NONPARAMETRIC INFERENCE FOR CAPTURE-RECAPTURE EXPERIMENTS WITH HETEROGENEOUS CAPTURE PROBABILITIES , 2001 .

[18]  Eugene Zwane,et al.  Population estimation using the multiple system estimator in the presence of continuous covariates , 2005 .

[19]  Peter G M van der Heijden,et al.  A Multiple‐Record Systems Estimation Method that Takes Observed and Unobserved Heterogeneity into Account , 2004, Biometrics.

[20]  Karen Bandeen-Roche,et al.  Building an identifiable latent class model with covariate effects on underlying and measured variables , 2004 .

[21]  Dankmar Böhning,et al.  Editorial – Recent Developments in Capture‐Recapture Methods and Their Applications , 2008, Biometrical journal. Biometrische Zeitschrift.

[22]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation II: Applications in human diseases. International Working Group for Disease Monitoring and Forecasting. , 1995, American journal of epidemiology.

[23]  V W Sidel,et al.  Capture-recapture methods for assessing the completeness of case ascertainment when using multiple information sources. , 1974, Journal of chronic diseases.

[24]  Chris A. Glasbey,et al.  A simulation-based method for model evaluation , 2003 .

[25]  S. Fienberg,et al.  Classical multilevel and Bayesian approaches to population size estimation using multiple lists , 1999 .

[26]  Francesco Bartolucci,et al.  Focused Information Criterion for Capture–Recapture Models for Closed Populations , 2008 .

[27]  Shirley Pledger,et al.  Using Mixtures to Model Heterogeneity in Ecological Capture‐Recapture Studies , 2008, Biometrical journal. Biometrische Zeitschrift.

[28]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[29]  Scott L. Zeger,et al.  Latent Variable Regression for Multiple Discrete Outcomes , 1997 .

[30]  F Merletti,et al.  National Diabetes Programs: Application of capture-recapture to count diabetes? , 1994, Diabetes Care.

[31]  R. Huggins On the statistical analysis of capture experiments , 1989 .