Spatial cross-validation is not the right way to evaluate map accuracy

Abstract For decades scientists have produced maps of biological, ecological and environmental variables. These studies commonly evaluate the map accuracy through cross-validation with the data used for calibrating the underlying mapping model. Recent studies, however, have argued that cross-validation statistics of most mapping studies are optimistically biased. They attribute these overoptimistic results to a supposed serious methodological flaw in standard cross-validation methods, namely that these methods ignore spatial autocorrelation in the data. They argue that spatial cross-validation should be used instead, and contend that standard cross-validation methods are inherently invalid in a geospatial context because of the autocorrelation present in most spatial data. Here we argue that these studies propagate a widespread misconception of statistical validation of maps. We explain that unbiased estimates of map accuracy indices can be obtained by probability sampling and design-based inference and illustrate this with a numerical experiment on large-scale above-ground biomass mapping. In our experiment, standard cross-validation (i.e., ignoring autocorrelation) led to smaller bias than spatial cross-validation. Standard cross-validation was deficient in case of a strongly clustered dataset that had large differences in sampling density, but less so than spatial cross-validation. We conclude that spatial cross-validation methods have no theoretical underpinning and should not be used for assessing map accuracy, while standard cross-validation is deficient in case of clustered data. Model-free, design-unbiased and valid accuracy assessment is achieved with probability sampling and design-based inference. It is valid without the need to explicitly incorporate or adjust for spatial autocorrelation and perfectly suited for the validation of large scale biological, ecological and environmental maps.

[1]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[2]  R. Henrik Nilsson,et al.  Global diversity and geography of soil fungi , 2014, Science.

[3]  Gerard B. M. Heuvelink,et al.  Updating the 1:50,000 Dutch soil map using legacy soil data: A multinomial logistic regression approach , 2009 .

[4]  Damien Sulla-Menashe,et al.  A global land-cover validation data set, part I: fundamental design principles , 2012 .

[5]  D. Brus Balanced sampling: A versatile sampling approach for statistical soil surveys , 2015 .

[6]  J. Townshend,et al.  Global land change from 1982 to 2016 , 2018, Nature.

[7]  J. Deville,et al.  Efficient balanced sampling: The cube method , 2004 .

[8]  Martin Herold,et al.  A global land-cover validation data set, II: augmenting a stratified sampling design to estimate accuracy by region and land-cover class , 2012 .

[9]  M. Herold,et al.  The Importance of Consistent Global Forest Aboveground Biomass Product Validation , 2019, Surveys in Geophysics.

[10]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[11]  Sassan Saatchi,et al.  Local validation of global biomass maps , 2019, Int. J. Appl. Earth Obs. Geoinformation.

[12]  D. J. Brus,et al.  Sampling for Natural Resource Monitoring , 2006 .

[13]  Budiman Minasny,et al.  Optimizing stratification and allocation for design-based estimation of spatial means using predictions with error , 2015 .

[14]  Carsten F. Dormann,et al.  Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure , 2017 .

[15]  C. Justice,et al.  High-Resolution Global Maps of 21st-Century Forest Cover Change , 2013, Science.

[16]  Niklas L. P. Lundström,et al.  Spatially Balanced Sampling through the Pivotal Method , 2012, Biometrics.

[17]  Gerard B. M. Heuvelink,et al.  Sampling for validation of digital soil maps , 2011 .

[18]  Vincent Bretagnolle,et al.  Spatial leave‐one‐out cross‐validation for variable selection in the presence of spatial autocorrelation , 2014 .

[19]  Stephen V. Stehman,et al.  Basic probability sampling designs for thematic map accuracy assessment , 1999 .

[20]  N. Picard,et al.  Spatial validation reveals poor predictive performance of large-scale ecological mapping models , 2020, Nature Communications.

[21]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[22]  T. Gregoire,et al.  Sampling Strategies for Natural Resources and the Environment , 2004 .

[23]  Measurement of hindered diffusion in complex geometries for high-speed studies of single-molecule forces , 2021, Scientific reports.

[24]  David A. Patterson,et al.  Toward estimation of map accuracy without a probability test sample , 2003, Environmental and Ecological Statistics.

[25]  Thomas Nauss,et al.  Importance of spatial predictor variable selection in machine learning applications - Moving from data reproduction to spatial prediction , 2019, Ecological Modelling.

[26]  Diana H. Wall,et al.  Soil nematode abundance and functional group composition at a global scale , 2019, Nature.

[27]  S. Goetz,et al.  Estimated carbon dioxide emissions from tropical deforestation improved by carbon-density maps , 2012 .

[28]  Dick J. Brus,et al.  Statistical approaches for spatial sample survey: Persistent misconceptions and new developments , 2020, European Journal of Soil Science.

[29]  J. Burt,et al.  Elementary statistics for geographers , 1995 .

[30]  Gifty E. Acquah,et al.  African soil properties and nutrients mapped at 30 m spatial resolution using two-scale ensemble machine learning , 2020, Scientific Reports.

[31]  Luigi Boschetti,et al.  A stratified random sampling design in space and time for regional to global scale burned area product validation. , 2016, Remote sensing of environment.

[32]  N. Fierer,et al.  A global atlas of the dominant bacteria found in soil , 2018, Science.