Using information criteria to select the correct variance–covariance structure for longitudinal data in ecology

1. Ecological data sets often use clustered measurements or use repeated sampling in a longitudinal design. Choosing the correct covariance structure is an important step in the analysis of such data, as the covariance describes the degree of similarity among the repeated observations. 2. Three methods for choosing the covariance are: the Akaike information criterion (AIC), the quasi-information criterion (QIC), and the deviance information criterion (DIC). We compared the methods using a simulation study and using a data set that explored effects of forest fragmentation on avian species richness over 15 years. 3. The overall success was 80.6% for the AIC, 29.4% for the QIC and 81.6% for the DIC. For the forest fragmentation study the AIC and DIC selected the unstructured covariance, whereas the QIC selected the simpler autoregressive covariance. Graphical diagnostics suggested that the unstructured covariance was probably correct. 4. We recommend using DIC for selecting the correct covariance structure.

[1]  Arthur J. Reynolds,et al.  Alterable predictors of child well-being in the Chicago longitudinal study , 2004 .

[2]  Christopher Zorn Generalized Estimating Equation Models for Correlated Data: A Review with Applications , 2001 .

[3]  Annette J. Dobson,et al.  An introduction to generalized linear models , 1991 .

[4]  V. Carey,et al.  Criteria for Working–Correlation–Structure Selection in GEE , 2007 .

[5]  T. Blackburn,et al.  Extinction and endemism in the New Zealand avifauna , 2004 .

[6]  T. Donovan,et al.  DETERMINANTS OF WOOD THRUSH NEST SUCCESS: A MULTI-SCALE, MODEL SELECTION APPROACH , 2005 .

[7]  You-Gan Wang,et al.  Working‐correlation‐structure identification in generalized estimating equations , 2009, Statistics in medicine.

[8]  Philip D. Taylor,et al.  Changing importance of habitat structure across multiple spatial scales for three species of insects , 2003 .

[9]  Philip Heidelberger,et al.  Simulation Run Length Control in the Presence of an Initial Transient , 1983, Oper. Res..

[10]  David R. Anderson,et al.  Model selection and inference : a practical information-theoretic approach , 2000 .

[11]  Aki Vehtari Discussion to "Bayesian measures of model complexity and fit" by Spiegelhalter, D.J., Best, N.G., Carlin, B.P., and van der Linde, A. , 2002 .

[12]  M. Lindstrom,et al.  A survey of methods for analyzing clustered binary response data , 1996 .

[13]  H. Akaike A new look at the statistical model identification , 1974 .

[14]  N. Koper,et al.  Effects of Habitat Management for Ducks on Target and Nontarget Species , 2006 .

[15]  Fiona K. A. Schmiegelow,et al.  ARE BOREAL BIRDS RESILIENT TO FOREST FRAGMENTATION? AN EXPERIMENTAL STUDY OF SHORT‐TERM COMMUNITY RESPONSES , 1997 .

[16]  EFFECTS OF NATAL DEPARTURE AND WATER LEVEL ON SURVIVAL OF JUVENILE SNAIL KITES (ROSTRHAMUS SOCIABILIS) IN FLORIDA , 2004 .

[17]  Ian J. Stewart,et al.  A Bayesian hierarchical meta-analysis of growth for the genus Sebastes in the eastern Pacific Ocean , 2007 .

[18]  W. Pan Akaike's Information Criterion in Generalized Estimating Equations , 2001, Biometrics.

[19]  Xin Tu,et al.  A comparison of several approaches for choosing between working correlation structures in generalized estimating equation analysis of longitudinal binary data , 2009, Statistics in medicine.

[20]  J. Ware,et al.  Applied Longitudinal Analysis , 2004 .

[21]  S. Tonidandel,et al.  Robustness of Generalized Estimating Equation (GEE) Tests of Significance against Misspecification of the Error Structure Model , 2004 .

[22]  J. Hardin,et al.  Generalized Estimating Equations , 2002 .

[23]  Manuel K. Schneider,et al.  Quantification of neighbourhood‐dependent plant growth by Bayesian hierarchical modelling , 2006 .

[24]  J. Wiens Spatial Scaling in Ecology , 1989 .

[25]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[26]  P. Diggle,et al.  Analysis of Longitudinal Data , 2003 .

[27]  Cameron L. Aldridge,et al.  Application of random effects to the study of resource selection by animals. , 2006, The Journal of animal ecology.

[28]  Model selection techniques for the covariance matrix for incomplete longitudinal data. , 1995, Statistics in medicine.

[29]  D Hémon,et al.  Assessing the significance of the correlation between two spatial processes. , 1989, Biometrics.

[30]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[31]  A. Brix Bayesian Data Analysis, 2nd edn , 2005 .

[32]  You-Gan Wang,et al.  Applications: A Generalized Estimating Equations Approach for Analysis of the Impact of New Technology on a Trawl Fishery , 2000 .

[33]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .