论文信息 - Testing the predictive performance of distribution models

Testing the predictive performance of distribution models

Distribution models are used to predict the likelihood of occurrence or abundance of a species at locations where census data are not available. An integral part of modelling is the testing of model performance. We compared different schemes and measures for testing model performance using 79 species from the North American Breeding Bird Survey. The four testing schemes we compared featured increasing independence between test and training data: resubstitution, random data hold-out and two spatially segregated data hold-out designs. The different testing measures also addressed different levels of information content in the dependent variable: regression R 2 for absolute abundance, squared correlation coefficient r 2 for relative abundance and AUC/Somer’s D for presence/absence. We found that higher levels of independence between test and training data lead to lower assessments of prediction accuracy. Even for data collected independently, spatial autocorrelation leads to dependence between random hold-out test data and training data, and thus to inflated measures of model performance. While there is a general awareness of the importance of autocorrelation to model building and hypothesis testing, its consequences via violation of independence between training and testing data have not been addressed systematically and comprehensively before. Furthermore, increasing information content (from correctly classifying presence/absence, to predicting relative abundance, to predicting absolute abundance) leads to decreasing predictive performance. The current tests for presence/absence distribution models are typically overly optimistic because a) the test and training data are not independent and b) the correct classification of presence/absence has a relatively low information content and thus capability to address ecological and conservation questions compared to a prediction of abundance. Meaningful evaluation of model performance requires testing on spatially independent data, if the intended application of the model is to predict into new geographic or climatic space, which arguably is the case for most applications of distribution models.

B. McGill | V. Bahn

[1] M. Araújo,et al. Uses and misuses of bioclimatic envelope modeling. , 2012, Ecology.

[2] Hugh P. Possingham,et al. Evaluating model transferability for a threatened species to adjacent areas: Implications for rock-wallaby conservation , 2011 .

[3] M. Araújo,et al. BIOMOD – a platform for ensemble forecasting of species distributions , 2009 .

[4] B. McGill,et al. Variation in abundance across a species' range predicts climate change responses in the range interior will exceed those at the edge: a case study with North American beaver , 2008 .

[5] Steven J. Phillips. Transferability, sample selection bias and background data in presence‐only modelling: a response to Peterson et al. (2007) , 2008 .

[6] Brian J. McGill,et al. Can niche-based distribution models outperform spatial interpolation? , 2007 .

[7] S. Jackson,et al. Novel climates, no‐analog communities, and ecological surprises , 2007 .

[8] Steven J. Phillips,et al. WHAT MATTERS FOR PREDICTING THE OCCURRENCES OF TREES: TECHNIQUES, DATA, OR SPECIES' CHARACTERISTICS? , 2007 .

[9] D. R. Cutler,et al. Utah State University From the SelectedWorks of , 2017 .

[10] A. Townsend Peterson,et al. Transferability and model evaluation in ecological niche modeling: a comparison of GARP and Maxent , 2007 .

[11] C. Dormann. Effects of incorporating spatial autocorrelation into the analysis of species distribution data , 2007 .