Spatially autocorrelated sampling falsely inflates measures of accuracy for presence‐only niche models

Aim Environmental niche models that utilize presence-only data have been increasingly employed to model species distributions and test ecological and evolutionary predictions. The ideal method for evaluating the accuracy of a niche model is to train a model with one dataset and then test model predictions against an independent dataset. However, a truly independent dataset is often not available, and instead random subsets of the total data are used for ‘training’ and ‘testing’ purposes. The goal of this study was to determine how spatially autocorrelated sampling affects measures of niche model accuracy when using subsets of a larger dataset for accuracy evaluation.

[1]  J. K. Frey,et al.  Mountaintop island age determines species richness of boreal mammals in the American Southwest , 2007 .

[2]  Adrian Baddeley,et al.  spatstat: An R Package for Analyzing Spatial Point Patterns , 2005 .

[3]  A. Peterson,et al.  Evidence of climatic niche shift during biological invasion. , 2007, Ecology letters.

[4]  G. A. Mulligan The biology of Canadian weeds , 1979 .

[5]  Steven J. Phillips,et al.  Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. , 2009, Ecological applications : a publication of the Ecological Society of America.

[6]  A. K. Wa Tson,et al.  THE BIOLOGY OF CANADIAN WEEDS.: 6. CENTAUREA DIFFUSA AND C. MACULOSA , 1974 .

[7]  M. Araújo,et al.  Five (or so) challenges for species distribution modelling , 2006 .

[8]  P. Hernandez,et al.  The effect of sample size and species characteristics on performance of different species distribution modeling methods , 2006 .

[9]  Robert P. Anderson,et al.  Maximum entropy modeling of species geographic distributions , 2006 .

[10]  N. Barré,et al.  Using invaded range data to model the climate suitability for Amblyomma variegatum (Acari: Ixodidae) in the New World , 2007, Experimental and Applied Acarology.

[11]  C. Graham,et al.  INTEGRATING PHYLOGENETICS AND ENVIRONMENTAL NICHE MODELS TO EXPLORE SPECIATION MECHANISMS IN DENDROBATID FROGS , 2004, Evolution; international journal of organic evolution.

[12]  David R. B. Stockwell,et al.  The GARP modelling system: problems and solutions to automated spatial prediction , 1999, Int. J. Geogr. Inf. Sci..

[13]  Robert P. Anderson,et al.  Using niche-based GIS modeling to test geographic predictions of competitive exclusion and competitive release in South American pocket mice , 2002 .

[14]  C. Stern CONCLUDING REMARKS OF THE CHAIRMAN , 1950 .

[15]  Robert P. Anderson,et al.  Evaluating predictive models of species’ distributions: criteria for selecting optimal models , 2003 .

[16]  M. Turelli,et al.  Environmental Niche Equivalency versus Conservatism: Quantitative Approaches to Niche Evolution , 2008, Evolution; international journal of organic evolution.

[17]  N. Raes,et al.  A null‐model for significance testing of presence‐only species distribution models , 2007 .

[18]  Antoine Guisan,et al.  Predicting current and future biological invasions: both native and invaded ranges matter , 2008, Biology Letters.

[19]  V. Sánchez‐Cordero,et al.  Conservatism of ecological niches in evolutionary time , 1999, Science.

[20]  A. Peterson,et al.  New developments in museum-based informatics and applications in biodiversity analysis. , 2004, Trends in ecology & evolution.

[21]  A. Peterson,et al.  Predicting Species Invasions Using Ecological Niche Modeling: New Approaches from Bioinformatics Attack a Pressing Problem , 2001 .

[22]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[23]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[24]  M. Andersen,et al.  Spatial analysis of two-species interactions , 1992, Oecologia.

[25]  Anestis Antoniadis,et al.  Wavelet Estimators in Nonparametric Regression: A Comparative Simulation Study , 2001 .

[26]  W. Thuiller Patterns and uncertainties of species' range shifts under climate change , 2004 .

[27]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[28]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[29]  Robert P. Anderson,et al.  Real vs. artefactual absences in species distributions: tests for Oryzomys albigularis (Rodentia: Muridae) in Venezuela , 2003 .

[30]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[31]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.