Testing environmental and genetic effects in the presence of spatial autocorrelation

Spatial autocorrelation is a well-recognized concern for observational data in general, and more specifically for spatial data in ecology. Generalized linear mixed models (GLMMs) with spatially autocorrelated random effects are a potential general framework for handling these spatial correlations. However, as the result of statistical and practical issues, such GLMMs have been fitted through the undocumented use of procedures based on penalized quasi-likelihood approximations (PQL), and under restrictive models of spatial correlation. Alternatively, they are often neglected in favor of simpler but more questionable approaches. In this work we aim to provide practical and validated means of inference under spatial GLMMs, that overcome these limitations. For this purpose, a new software is developed to fit spatial GLMMs. We use it to assess the performance of likelihood ratio tests for fixed effects under spatial autocorrelation, based on Laplace or PQL approximations of the likelihood. Expectedly, the Laplace approximation performs generally slightly better, although a variant of PQL was better in the binary case. We show that a previous implementation of PQL methods in the R language, glmmPQL, is not appropriate for such applications. Finally, we illustrate the efficiency of a bootstrap procedure for correcting the small sample bias of the tests, which applies also to non-spatial models.

[1]  Emmanuel Lesaffre,et al.  Hierarchical Generalized Linear Models: The R Package HGLMMM , 2011 .

[2]  Gilles Guillot,et al.  Dismantling the Mantel tests , 2011, 1112.0651.

[3]  B. Kempenaers,et al.  Identification of a gene associated with avian migratory behaviour , 2011, Proceedings of the Royal Society B: Biological Sciences.

[4]  M. Wall A close look at the spatial structure implied by the CAR and SAR models , 2004 .

[5]  J. Pinheiro,et al.  Efficient Laplacian and Adaptive Gaussian Quadrature Algorithms for Multilevel Generalized Linear Mixed Models , 2006 .

[6]  William J. Browne,et al.  A comparison of the hierarchical likelihood and Bayesian approaches to spatial epidemiological modelling , 2007 .

[7]  Peter J. Diggle,et al.  An Introduction to Model-Based Geostatistics , 2003 .

[8]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[9]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[10]  M. Fortin,et al.  Comparison of the Mantel test and alternative approaches for detecting complex multivariate relationships in the spatial analysis of genetic data , 2010, Molecular ecology resources.

[11]  Richard Field,et al.  Coefficient shifts in geographical ecology: an empirical evaluation of spatial and non-spatial regression , 2009 .

[12]  Michael L. Stein,et al.  Interpolation of spatial data , 1999 .

[13]  Maengseok Noh,et al.  REML estimation for binary data in GLMMs , 2007 .

[14]  G. Robertson,et al.  The spatial distribution of nematode trophic groups across a cultivated ecosystem , 1995 .

[15]  L. Waits,et al.  Landscape genetics: where are we now? , 2010, Molecular ecology.

[16]  P. Dixon,et al.  Accounting for Spatial Pattern When Modeling Organism- Environment Interactions , 2022 .

[17]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[18]  Y Pawitan,et al.  Multicomponent variance estimation for binary traits in family‐based studies , 2006, Genetic epidemiology.

[19]  R. Baayen,et al.  Mixed-effects modeling with crossed random effects for subjects and items , 2008 .

[20]  Joy Bergelson,et al.  References and Notes Supporting Online Material Adaptation to Climate across the Arabidopsis Thaliana Genome , 2022 .

[21]  AURÉLIE COULON,et al.  Statistical methods in spatial genetics , 2009, Molecular ecology.

[22]  Robert R. Sokal,et al.  An investigation of three-matrix permutation tests , 1992 .

[23]  F. Martellosio,et al.  THE CORRELATION STRUCTURE OF SPATIAL AUTOREGRESSIONS , 2012, Econometric Theory.

[24]  François Rousset,et al.  PARTIAL MANTEL TESTS: REPLY TO CASTELLANO AND BALLETTO , 2002 .

[25]  Jennifer A Hoeting,et al.  Model selection for geostatistical models. , 2006, Ecological applications : a publication of the Ecological Society of America.

[26]  J. Losos,et al.  Niche incumbency, dispersal limitation and climate shape geographical distributions in a species-rich island adaptive radiation , 2013 .

[27]  Bartlett's correction and the bootstrap in normal linear regression models , 1990 .

[28]  Woojoo Lee,et al.  Modifications of REML algorithm for HGLMs , 2012, Stat. Comput..

[29]  J. Gaillard,et al.  Statistical evaluation of parameters estimating autocorrelation and individual heterogeneity in longitudinal studies , 2012 .

[30]  T. Clutton‐Brock,et al.  SHARED SPATIAL EFFECTS ON QUANTITATIVE GENETIC PARAMETERS: ACCOUNTING FOR SPATIAL AUTOCORRELATION AND HOME RANGE OVERLAP REDUCES ESTIMATES OF HERITABILITY IN WILD RED DEER , 2012, Evolution; international journal of organic evolution.

[31]  N. Breslow,et al.  Bias correction in generalised linear mixed models with a single component of dispersion , 1995 .

[32]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[33]  A. Agresti,et al.  Random Effects Modeling of Multiple Binomial Responses Using the Multivariate Binomial Logit‐Normal Distribution , 2000, Biometrics.

[34]  Peter J. Diggle,et al.  Spatial modelling and the prediction of Loa loa risk: decision making under uncertainty. , 2007 .

[35]  Peter L. Ralph,et al.  DISENTANGLING THE EFFECTS OF GEOGRAPHIC AND ECOLOGICAL ISOLATION ON GENETIC DIFFERENTIATION , 2013, Evolution; international journal of organic evolution.

[36]  Moudud Alam,et al.  hglm: A Package for Fitting Hierarchical Generalized Linear Models , 2010, R J..

[37]  Youngjo Lee,et al.  Model selection for multi‐component frailty models , 2007, Statistics in medicine.

[38]  B. Minasny,et al.  The Matérn function as a general model for soil variograms , 2005 .

[39]  Youngjo Lee,et al.  Modelling and analysing correlated non-normal data , 2001 .

[40]  J. Nelder,et al.  Hierarchical generalised linear models: A synthesis of generalised linear models, random-effect models and structured dispersions , 2001 .

[41]  David M. Rocke Bootstrap Bartlett Adjustment in Seemingly Unrelated Regression , 1989 .

[42]  F. Rousset,et al.  ARE PARTIAL MANTEL TESTS ADEQUATE? , 2001, Evolution; international journal of organic evolution.