Model selection for geostatistical models.

We consider the problem of model selection for geospatial data. Spatial correlation is often ignored in the selection of explanatory variables, and this can influence model selection results. For example, the importance of particular explanatory variables may not be apparent when spatial correlation is ignored. To address this problem, we consider the Akaike Information Criterion (AIC) as applied to a geostatistical model. We offer a heuristic derivation of the AIC in this context and provide simulation results that show that using AIC for a geostatistical model is superior to the often-used traditional approach of ignoring spatial correlation in the selection of explanatory variables. These ideas are further demonstrated via a model for lizard abundance. We also apply the principle of minimum description length (MDL) to variable selection for the geostatistical model. The effect of sampling design on the selection of explanatory covariates is also explored. R software to implement the geostatistical model selection methods described in this paper is available in the Supplement.

[1]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[2]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[3]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[4]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[5]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[6]  John Eccleston,et al.  Statistics and Computing , 2006 .

[7]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[8]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[9]  R. Haining Spatial Data Analysis in the Social and Environmental Sciences , 1990 .

[10]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[11]  Richard J. Smith Spatial statistics in environmental science , 2000 .

[12]  Noel A Cressie,et al.  Uncertainty and Spatial Linear Models for Ecological Data , 2001 .

[13]  M. Stein,et al.  A Bayesian analysis of kriging , 1993 .

[14]  A. McQuarrie,et al.  Regression and Time Series Model Selection , 1998 .

[15]  David R. Anderson,et al.  Model Selection and Inference: A Practical Information-Theoretic Approach , 2001 .

[16]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[17]  Allan D. Hollander,et al.  Hierarchical representations of species distributions using maps, images and sighting data , 1994 .

[18]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .