Combining information in hierarchical models improves inferences in population ecology and demographic population analyses

Statistical models that include random effects are becoming more common in population ecology all the time. Random effects are two or more effects of some grouping factor that belong together, in the sense that we can imagine they were generated by a common stochastic process. Thus, they are similar, but not identical, and therefore assumed to be exchangeable. Models with random effects contain as a description of this common stochastic process so-called prior distributions, with estimable hyperparameters, for instance the mean and the variance for normally distributed random effects. Models with random effects have intrinsically more than one level; therefore, they are often called hierarchical models (Royle & Dorazio, 2008) or multilevel models (Gelman & Hill, 2007). Interest in hierarchical models often focuses on the hyperparameters, but the realizations from the process described by the prior distributions, that is, the random effects, can also be estimated. The main difference to treating a set of effects as fixed is that in a random effects model, the parameters of a grouping factor are no longer estimated independently. Rather, the assumption that they come from a common prior distribution induces a dependence, which means that the estimate of each is somewhat influenced by the estimate of all other effects comprising that factor. This is often called borrowing strength from the ensemble. One consequence of treating a factor as random is that its effect estimates are pulled in towards the overall mean, which is called shrinkage in the literature. If the assumption of a common stochastic process is reasonable, borrowing strength and shrinkage typically leads to better estimates compared with their fixed effects counterparts (Gelman, 2005; chapter 4 in Kery & Schaub, 2012). Hierarchical models are not at all intrinsically Bayesian; rather, they can be analysed using likelihood or Bayesian methods (Royle & Dorazio, 2008), though Bayesian analysis is often easier, especially for ecologists using MCMC engines like WinBUGS (Lunn et al., 2000) or JAGS (Plummer, 2003). One particularly interesting use of hierarchical model is as a formal way of combining information, for instance, coming from different studies. A good example is the feature paper by Halstead et al. (2012), where estimates of survival probabilities of a snake species from radio-tracking data at multiple study sites in the same region are combined in a hierarchical model that is fitted using the WinBUGS software. Sample size at each site was fairly low. Hence, Halstead and colleagues treated each study site as a replicate and estimated the hyperparameters of the distribution that collects together the sitespecific parameters, which were treated as random effects. Hierarchical modelling thus provides a formal way of combining this information, where study sites with more information (e.g. more snakes or with more precise estimates) get more weight in contributing to the estimates of the overall mean and the among-site variability (the mean and the variance hyperparameters). Site-specific estimates are shrunken towards the mean, where the degree of shrinkage depends on the precision of the site-specific estimates. This is a desirable property, as it avoids possible overinterpretation of patterns that may be due to some idiosyncrasy of a small sample. Borrowing strength typically also helps against boundary estimates that are frequent in survival analysis with small sample size. Thus, not only were Halstead and colleagues able to obtain a formal estimate of the regional mean of the interesting parameters, but arguably they also obtained better site-specific estimates owing to the sharing of information among all sites. In a sense, what they did is a meta-analysis. A further advantage of combining data from several studies is increased precision of the site-specific estimates, because they borrow strength from the entire dataset. Increased precision means to have more power to detect an effect, which is especially advantageous when dealing with rare species, where sample size is typically small to very small. A similar approach was chosen by Papadatou et al. (2012) who estimated survival probabilities of several bat species (i.e. where species was bs_bs_banner

[1]  Roger Pradel,et al.  Comparing survival among species with imperfect detection using multilevel analysis of mark–recapture data: a case study on bats , 2012 .

[2]  Marc Kery,et al.  Introduction to WinBUGS for Ecologists: Bayesian approach to regression, ANOVA, mixed models and related analyses , 2010 .

[3]  Bayesian shared frailty models for regional inference about wildlife survival , 2012 .

[4]  A. Gelman Analysis of variance: Why it is more important than ever? , 2005, math/0504499.

[5]  Michael Schaub,et al.  Bayesian Population Analysis using WinBUGS: A Hierarchical Perspective , 2011 .

[6]  P Besbeas,et al.  Integrating Mark–Recapture–Recovery and Census Data to Estimate Animal Abundance and Demographic Parameters , 2002, Biometrics.

[7]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[8]  Olivier Gimenez,et al.  An assessment of integrated population models: bias, accuracy, and violation of the assumption of independence. , 2010, Ecology.

[9]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[10]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[11]  J. Andrew Royle,et al.  Hierarchical Modeling and Inference in Ecology: The Analysis of Data from Populations, Metapopulations and Communities , 2008 .

[12]  Jessica Gurevitch,et al.  Meta-analysis in ecology , 2001 .