Parameter-expanded data augmentation for Bayesian analysis of capture–recapture models

Data augmentation (DA) is a flexible tool for analyzing closed and open population models of capture–recapture data, especially models which include sources of hetereogeneity among individuals. The essential concept underlying DA, as we use the term, is based on adding “observations” to create a dataset composed of a known number of individuals. This new (augmented) dataset, which includes the unknown number of individuals N in the population, is then analyzed using a new model that includes a reformulation of the parameter N in the conventional model of the observed (unaugmented) data. In the context of capture–recapture models, we add a set of “all zero” encounter histories which are not, in practice, observable. The model of the augmented dataset is a zero-inflated version of either a binomial or a multinomial base model. Thus, our use of DA provides a general approach for analyzing both closed and open population models of all types. In doing so, this approach provides a unified framework for the analysis of a huge range of models that are treated as unrelated “black boxes” and named procedures in the classical literature. As a practical matter, analysis of the augmented dataset by MCMC is greatly simplified compared to other methods that require specialized algorithms. For example, complex capture–recapture models of an augmented dataset can be fitted with popular MCMC software packages (WinBUGS or JAGS) by providing a concise statement of the model’s assumptions that usually involves only a few lines of pseudocode. In this paper, we review the basic technical concepts of data augmentation, and we provide examples of analyses of closed-population models (M0, Mh, distance sampling, and spatial capture–recapture models) and open-population models (Jolly–Seber) with individual effects.

[1]  Olivier Gimenez,et al.  State-space modelling of data on marked individuals , 2007 .

[2]  M. Conroy,et al.  Modeling demographic processes in marked populations , 2009 .

[3]  K. ESTIMATING TIGER Panthera tigris POPULATIONS FROM CAMERA-TRAP DATA USING CAPTURE RECAPTURE MODELS , 2022 .

[4]  J. Andrew Royle,et al.  Estimating species richness and accumulation by modeling species occurrence and detectability. , 2006, Ecology.

[5]  M. Conroy,et al.  Analysis and Management of Animal Populations , 2002 .

[6]  J. Norris,et al.  Capture-Recapture Models with Heterogeneity : I . Cormack-Jolly-Seber Model , 2003 .

[7]  G. Seber A NOTE ON THE MULTIPLE-RECAPTURE CENSUS. , 1965, Biometrika.

[8]  J Andrew Royle,et al.  A hierarchical model for spatial capture-recapture data. , 2008, Ecology.

[9]  J. Andrew Royle,et al.  Estimating Black Bear Density Using DNA Data From Hair Snares , 2010 .

[10]  David Huard,et al.  PyMC: Bayesian Stochastic Modelling in Python. , 2010, Journal of statistical software.

[11]  J. Nichols,et al.  ESTIMATING SITE OCCUPANCY, COLONIZATION, AND LOCAL EXTINCTION WHEN A SPECIES IS DETECTED IMPERFECTLY , 2003 .

[12]  J Andrew Royle,et al.  Analysis of Capture–Recapture Models with Individual Covariates Using Data Augmentation , 2008, Biometrics.

[13]  R. Routledge,et al.  The Method of Bounded Counts: When Does It Work? , 1982 .

[14]  J. Andrew Royle,et al.  Estimating Size and Composition of Biological Communities by Modeling the Occurrence of Species , 2005 .

[15]  B. Manly,et al.  Parsimonious modelling of capture―mark―recapture studies , 1985 .

[16]  Richard J. Barker,et al.  A unified capture-recapture framework , 2008 .

[17]  K. Pollock A Capture-Recapture Design Robust to Unequal Probability of Capture , 1982 .

[18]  S. Brooks,et al.  On the Bayesian analysis of population size , 2001 .

[19]  Douglas H. Johnson The Insignificance of Statistical Significance Testing , 1999 .

[20]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[21]  William A. Link,et al.  Bayesian Inference: With Ecological Applications , 2009 .

[22]  James D Nichols,et al.  Assessing tiger population dynamics using photographic capture-recapture sampling. , 2006, Ecology.

[23]  M. Efford Density estimation in live‐trapping studies , 2004 .

[24]  J Andrew Royle,et al.  Site Occupancy Models with Heterogeneous Detection Probabilities , 2006, Biometrics.

[25]  O Gimenez,et al.  Individual heterogeneity in studies on marked animals using numerical integration: capture-recapture mixed models. , 2010, Ecology.

[26]  Jerome A Dupuis,et al.  A Bayesian Approach to the Multistate Jolly–Seber Capture–Recapture Model , 2007, Biometrics.

[27]  David A. Elston,et al.  Mark-recapture with occasion and individual effects: Abundance estimation through Bayesian model selection in a fixed dimensional parameter space , 2005 .

[28]  J. Andrew Royle,et al.  Species richness and occupancy estimation in communities subject to temporary emigration. , 2009, Ecology.

[29]  Matthew R. Schofield,et al.  Incorporating Genotype Uncertainty into Mark–Recapture‐Type Models For Estimating Abundance Using DNA Samples , 2009, Biometrics.

[30]  Carl J. Schwarz,et al.  A General Methodology for the Analysis of Capture-Recapture Experiments in Open Populations , 1996 .

[31]  J. Andrew Royle,et al.  A Bayesian state-space formulation of dynamic occupancy models. , 2007, Ecology.

[32]  J. Andrew Royle,et al.  Hierarchical Modeling and Inference in Ecology: The Analysis of Data from Populations, Metapopulations and Communities , 2008 .

[33]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[34]  D L Borchers,et al.  Spatially Explicit Maximum Likelihood Methods for Capture–Recapture Studies , 2008, Biometrics.

[35]  J Andrew Royle,et al.  Spatially explicit inference for open populations: estimating demographic parameters from camera-trap studies. , 2010, Ecology.

[36]  R King,et al.  Analyzing Complex Capture–Recapture Data in the Presence of Individual and Temporal Covariates and Model Uncertainty , 2008, Biometrics.

[37]  C J Schwarz,et al.  An Extension of the Cormack–Jolly–Seber Model for Continuous Covariates with Application to Microtus pennsylvanicus , 2006, Biometrics.

[38]  A. Agresti,et al.  The Use of Mixed Logit Models to Reflect Heterogeneity in Capture‐Recapture Studies , 1999, Biometrics.

[39]  J. Andrew Royle,et al.  Analysis of Multinomial Models With Unknown Index Using Data Augmentation , 2007, Journal of Computational and Graphical Statistics.

[40]  Jun S. Liu,et al.  Parameter Expansion for Data Augmentation , 1999 .

[41]  W. Link Nonidentifiability of Population Size from Capture‐Recapture Data with Heterogeneous Detection Probabilities , 2003, Biometrics.

[42]  J Andrew Royle,et al.  Models for inference in dynamic metacommunity systems. , 2010, Ecology.

[43]  J. Andrew Royle,et al.  ESTIMATING SITE OCCUPANCY RATES WHEN DETECTION PROBABILITIES ARE LESS THAN ONE , 2002, Ecology.

[44]  G. Jolly EXPLICIT ESTIMATES FROM CAPTURE-RECAPTURE DATA WITH BOTH DEATH AND IMMIGRATION-STOCHASTIC MODEL. , 1965, Biometrika.

[45]  J. Nichols,et al.  ESTIMATION OF TIGER DENSITIES IN INDIA USING PHOTOGRAPHIC CAPTURES AND RECAPTURES , 1998 .

[46]  J Andrew Royle,et al.  Bayesian inference in camera trapping studies for a class of spatial capture-recapture models. , 2009, Ecology.

[47]  J. Andrew Royle,et al.  Inference About Species Richness and Community Structure Using Species-Specific Occupancy Models in the National Swiss Breeding Bird Survey MHB , 2009 .

[48]  Shirley Pledger,et al.  The Performance of Mixture Models in Heterogeneous Closed Population Capture–Recapture , 2005, Biometrics.

[49]  J. Andrew Royle Hierarchical Spatial Capture–Recapture Models for Estimating Density from Trapping Arrays , 2011 .

[50]  J. Andrew Royle,et al.  Hierarchical modeling of an invasive spread: case of the Eurasian collared-dove Streptopelia decaocto in the USA , 2010 .

[51]  J. Andrew Royle,et al.  Dealing with incomplete and variable detectability in multi-year, multi-site monitoring of ecological populations , 2012 .

[52]  James D. Nichols,et al.  Statistical concepts: Assessing spatial distributions , 2002 .

[53]  Andrew Thomas,et al.  The BUGS project: Evolution, critique and future directions , 2009, Statistics in medicine.

[54]  K. Burnham,et al.  Estimation of the size of a closed population when capture probabilities vary among animals , 1978 .

[55]  J Andrew Royle,et al.  Web-based Supplementary Materials for “ Modeling Individual Effects in the Cormack-Jolly-Seber Model : A State-space Formulation ” , 2010 .

[56]  David R. Anderson,et al.  Modeling Survival and Testing Biological Hypotheses Using Marked Animals: A Unified Approach with Case Studies , 1992 .

[57]  J. Andrew Royle,et al.  Mixture Models for Estimating the Size of a Closed Population When Capture Rates Vary among Individuals , 2003, Biometrics.

[58]  S. Brooks,et al.  On the Bayesian Estimation of a Closed Population Size in the Presence of Heterogeneity and Model Uncertainty , 2007, Biometrics.

[59]  M. Tanner Tools for statistical inference: methods for the exploration of posterior distributions and likeliho , 1994 .