A Multilevel Model for Continuous Time Population Estimation

Statistical methods have been developed and applied to estimating populations that are difficult or too costly to enumerate. Known as multilist methods in epidemiological settings, individuals are matched across lists and estimation of population size proceeds by modeling counts in incomplete multidimensional contingency tables (based on patterns of presence/absence on lists). As multilist methods typically assume that lists are compiled instantaneously, there are few options available for estimating the unknown size of a closed population based on continuously (longitudinally) compiled lists. However, in epidemiological settings, continuous time lists are a routine byproduct of administrative functions. Existing methods are based on time-to-event analyses with a second step of estimating population size. We propose an alternative approach to address the twofold epidemiological problem of estimating population size and of identifying patient factors related to duration (in days) between visits to a health care facility. A Bayesian framework is proposed to model interval lengths because, for many patients, the data are sparse; many patients were observed only once or twice. The proposed method is applied to the motivating data to illustrate the methods' applicability. Then, a small simulation study explores the performance of the estimator under a variety of conditions. Finally, a small discussion section suggests opportunities for continued methodological development for continuous time population estimation.

[1]  Joseph G. Ibrahim,et al.  Bayesian Survival Analysis , 2004 .

[2]  D Y Lin,et al.  Estimation of Population Size Based on Additive Hazards Models for Continuous‐Time Recapture Experiments , 1999, Biometrics.

[3]  D. Lin,et al.  Inference for capture-recapture experiments in continuous time with variable capture rates , 1996 .

[4]  Yan Wang,et al.  A frailty model for detecting number of faults in a system , 2002 .

[5]  R. Huggins Some practical aspects of a conditional likelihood approach to capture experiments , 1991 .

[6]  Yan Wang,et al.  A Unified Parametric Regression Model for Recapture Studies with Random Removals in Continuous Time , 2002, Biometrics.

[7]  G A Seber,et al.  Capture‐Recapture, Epidemiology, and List Mismatches: Two Lists , 2000, Biometrics.

[8]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[9]  Steven C. Amstrup,et al.  Estimation of population size using open capture-recapture models , 2001 .

[10]  Peter G M van der Heijden,et al.  The multiple‐record systems estimator when registrations refer to different but overlapping populations , 2004, Statistics in medicine.

[11]  R. Cormack Log-linear models for capture-recapture , 1989 .

[12]  Multi‐List Methods Using Incomplete Lists in Closed Populations , 2005, Biometrics.

[13]  A. Chao Estimating the population size for capture-recapture data with unequal catchability. , 1987, Biometrics.

[14]  Wanzhu Tu,et al.  Empirical Bayes analysis for a hierarchical Poisson generalized linear model , 2003 .

[15]  Ronald E. LaPorte,et al.  Capture-recapture and multiple-record systems estimation II: Applications in human diseases. International Working Group for Disease Monitoring and Forecasting. , 1995, American journal of epidemiology.

[16]  G. Seber,et al.  Estimating Animal Abundance: Review III , 1999 .

[17]  Alan J. Lee Effect of List Errors on the Estimation of Population Size , 2002, Biometrics.

[18]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[19]  Anne Chao,et al.  Population size estimation for capture-recapture models with applications to epidemiological data , 2001 .

[20]  R R Regal,et al.  Accuracy of alternative approaches to capture-recapture estimates of disease frequency: internal validity analysis of data from five sources. , 2000, American journal of epidemiology.

[21]  J. Albert Computational methods using a Bayesian hierarchical generalized linear model , 1988 .

[22]  R. Huggins On the statistical analysis of capture experiments , 1989 .

[23]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[24]  S. Fienberg The multiple recapture census for closed populations and incomplete 2k contingency tables , 1972 .

[25]  Paul S F Yip,et al.  A Unified Likelihood‐Based Approach for Estimating Population Size in Continuous‐Time Capture–Recapture Experiments with Frailty , 2007, Biometrics.

[26]  Anne Chao,et al.  Continuous-time capture-recapture models with covariates , 2002 .

[27]  David R. Anderson,et al.  Statistical inference from capture data on closed animal populations , 1980 .

[28]  J. Andrew Royle,et al.  Analysis of Multinomial Models With Unknown Index Using Data Augmentation , 2007, Journal of Computational and Graphical Statistics.

[29]  G. Seber A review of estimating animal abundance. , 1986, Biometrics.