spind: an R Package to Account for Spatial Autocorrelation in the Analysis of Lattice Data

Abstract spind is an R package aiming to provide a useful toolkit to account for spatial dependence in the analysis of lattice data. Grid-based data sets in spatial modelling often exhibit spatial dependence, i.e. values sampled at nearby locations are more similar than those sampled further apart. spind methods, described here, take this kind of two-dimensional dependence into account and are sensitive to its variation across different spatial scales. Methods presented to account for spatial autocorrelation are based on the two fundamentally different approaches of generalised estimating equations as well as wavelet-revised methods. Both methods are extensions to generalised linear models. spind also provides functions for multi-model inference and scaling by wavelet multiresolution regression. Since model evaluation is essential for assessing prediction accuracy in species distribution modelling, spind additionally supplies users with spatial accuracy measures, i.e. measures that are sensitive to the spatial arrangement of the predictions.

[1]  Ingolf Kühn,et al.  A Wavelet-Based Extension of Generalized Linear Models to Remove the Effect of Spatial Autocorrelation: Wavelet-based Extension of Generalized Linear Models , 2010 .

[2]  N. Gotelli Predicting Species Occurrences: Issues of Accuracy and Scale , 2003 .

[3]  Hanna Tuomisto,et al.  DISSECTING THE SPATIAL STRUCTURE OF ECOLOGICAL DATA AT MULTIPLE SCALES , 2004 .

[4]  Mevin B Hooten,et al.  The basis function approach for modeling autocorrelation in ecological data. , 2016, Ecology.

[5]  G. Carl,et al.  Assessing relative variable importance across different spatial scales: a two‐dimensional wavelet analysis , 2016 .

[6]  Jason Fine,et al.  Estimating equations for association structures , 2004, Statistics in medicine.

[7]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[8]  Colin M Beale,et al.  Regression analysis of spatial data. , 2010, Ecology letters.

[9]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[10]  Adrian G. Barnett,et al.  Using information criteria to select the correct variance–covariance structure for longitudinal data in ecology , 2010 .

[11]  Søren Højsgaard,et al.  The R Package geepack for Generalized Estimating Equations , 2005 .

[12]  A. Jiménez‐Valverde Threshold-dependence as a desirable attribute for discrimination assessment: implications for the evaluation of species distribution models , 2014, Biodiversity and Conservation.

[13]  G. Carl,et al.  Analyzing spatial autocorrelation in species distributions using Gaussian and logit models , 2007 .

[14]  R. Real,et al.  AUC: a misleading measure of the performance of predictive distribution models , 2008 .

[15]  G. Carl,et al.  Spind: a package for computing spatially corrected accuracy measures , 2017 .

[16]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[17]  Alberto Jiménez-Valverde,et al.  Not as good as they seem: the importance of concepts in species distribution modelling , 2008 .

[18]  P. Legendre Spatial Autocorrelation: Trouble or New Paradigm? , 1993 .

[19]  Jack J. Lennon,et al.  Red-shifts and red herrings in geographical ecology , 2000 .

[20]  Stéphane Dray,et al.  Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM) , 2006 .

[21]  Edward C. Chao,et al.  Generalized Estimating Equations , 2003, Technometrics.

[22]  Ingolf Kühn,et al.  Analyzing spatial ecological data using linear regression and wavelet analysis , 2008 .