Dealing with overprediction in species distribution models: How adding distance constraints can improve model accuracy

Abstract Species distribution models can be affected by overprediction when dispersal movement is not incorporated into the modelling process. We compared the efficiency of seven methods that take into account spatial constraints to reduce overprediction when using four algorithms for species distribution models. By using a virtual ecologist approach, we were able to measure the accuracy of each model in predicting actual species distributions. We built 40 virtual species distributions within the Neotropical realm. Then, we randomly sampled 50 occurrences that were used in seven spatially restricted species distribution models (hereafter called M-SDMs) and a non-spatially restricted ecological niche model (ENM). We used four algorithms; Maximum Entropy, Generalized Linear Models, Random Forest, and Support Vector Machine. M-SDM methods were divided into a priori methods, in which spatial restrictions were inserted with environmental variables in the modelling process, and a posteriori methods, in which reachable and suitable areas were overlapped. M-SDM efficiency was obtained by calculating the difference in commission and omission errors between M-SDMs and ENMs. We used linear mixed-effects models to test if differences in commission and omission errors varied among the M-SDMs and algorithms. Our results indicate that overall M-SDMs reduce overprediction with no increase in underprediction compared to ENMs with few exceptions, such as a priori methods combined with the Support Vector Machine algorithm. There is a high variation in modelling performance among species, but there were only a few cases in which overprediction or underprediction increased. We only compared methods that do not require species dispersal data, guaranteeing that they can be applied to less-studied species. We advocate that species distribution modellers should not ignore spatial constraints, especially because they can be included in models at low costs but high benefits in terms of overprediction reduction.

[1]  A. Rozenfeld,et al.  The geographic scaling of biotic interactions , 2013 .

[2]  J. Nichols,et al.  Dynamic occupancy models for analyzing species' range dynamics across large geographic scales , 2013, Ecology and evolution.

[3]  M. Fortin,et al.  Measuring ecological niche overlap from occurrence and spatial environmental data , 2012 .

[4]  Damaris Zurell,et al.  Outstanding Challenges in the Transferability of Ecological Models. , 2018, Trends in ecology & evolution.

[5]  M. Carrete,et al.  Incorporating spatial constraints in different periods of the annual cycle improves species distribution model performance for a highly mobile bird species , 2014 .

[6]  M. Turner Landscape ecology: what is the state of the science? , 2005 .

[7]  Ferenc Jordán,et al.  Contribution of habitat patches to network connectivity: Redundancy and uniqueness of topological indices , 2011 .

[8]  J. Hoeting,et al.  FACTORS AFFECTING SPECIES DISTRIBUTION PREDICTIONS: A SIMULATION MODELING EXPERIMENT , 2005 .

[9]  Jennifer A. Miller Species Distribution Modeling , 2010 .

[10]  Karel Mokany,et al.  Appropriateness of full‐, partial‐ and no‐dispersal scenarios in climate change impact modelling , 2013 .

[11]  Jennifer L. Molnar,et al.  Marine Ecoregions of the World: A Bioregionalization of Coastal and Shelf Areas , 2007 .

[12]  D. Rogers,et al.  The effects of species’ range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact? , 2004 .

[13]  Steven R. Beissinger,et al.  Birds track their Grinnellian niche through a century of climate change , 2009, Proceedings of the National Academy of Sciences.

[14]  A. Peterson,et al.  An evaluation of transferability of ecological niche models , 2018, Ecography.

[15]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[16]  Shinichi Nakagawa,et al.  A general and simple method for obtaining R2 from generalized linear mixed‐effects models , 2013 .

[17]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[18]  A. Peterson,et al.  Use of niche models in invasive species risk assessments , 2011, Biological Invasions.

[19]  A. Guisan,et al.  Equilibrium modeling of alpine plant distribution: how far can we go? , 2000 .

[20]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[21]  Jorge Soberón Grinnellian and Eltonian niches and geographic distributions of species. , 2007, Ecology letters.

[22]  M. Wink,et al.  The role of landscape change and paleoclimatic events in shaping the evolutionary history of the Polioptila gnatcatchers (Passeriformes, Polioptilidae) with emphasis on species associated with open habitats , 2018 .

[23]  Gabriel Ramos-Fernández,et al.  Integrating expert knowledge and ecological niche models to estimate Mexican primates’ distribution , 2018, Primates.

[24]  Jorge Soberón,et al.  Niches and distributional areas: Concepts, methods, and assumptions , 2009, Proceedings of the National Academy of Sciences.

[25]  Simon Ferrier,et al.  Evaluating the predictive performance of habitat models developed using logistic regression , 2000 .

[26]  Omri Allouche,et al.  Incorporating distance constraints into species distribution models , 2008 .

[27]  Jane Elith,et al.  What do we gain from simplicity versus complexity in species distribution models , 2014 .

[28]  Piero Calosi,et al.  Thermal tolerance, acclimatory capacity and vulnerability to global climate change , 2008, Biology Letters.

[29]  M. Fortin,et al.  Modeling bird species distribution change in fire prone Mediterranean landscapes: incorporating species dispersal and landscape dynamics , 2012 .

[30]  Jennifer A. Miller,et al.  Incorporating movement in species distribution models: how do simulations of dispersal affect the accuracy and uncertainty of projections? , 2016, Int. J. Geogr. Inf. Sci..

[31]  Jorge Soberón Niche and area of distribution modeling: a population ecology perspective , 2010 .

[32]  A. Peterson,et al.  Species Distribution Modeling and Ecological Niche Modeling: Getting the Concepts Right , 2012 .

[33]  G. Powell,et al.  Terrestrial Ecoregions of the World: A New Map of Life on Earth , 2001 .

[34]  T. Hengl,et al.  Spatial prediction of species’ distributions from occurrence-only records: combining point pattern analysis, ENFA and regression-kriging , 2009 .

[35]  A. Guisan,et al.  MigClim: Predicting plant distribution and dispersal in a changing climate , 2009 .

[36]  Sunil Kumar,et al.  Caveats for correlative species distribution modeling , 2015, Ecol. Informatics.

[37]  J. Diniz‐Filho,et al.  Spatial analysis improves species distribution modelling during range expansion , 2008, Biology Letters.

[38]  Ben Collen,et al.  The Tropical Biodiversity Data Gap: Addressing Disparity in Global Monitoring , 2008 .

[39]  Divya Vasudev,et al.  From dispersal constraints to landscape connectivity: lessons from species distribution modeling , 2015 .

[40]  Susanne A. Fritz,et al.  An Update of Wallace’s Zoogeographic Regions of the World , 2013, Science.

[41]  J. Lawton,et al.  Range, population abundance and conservation. , 1993, Trends in ecology & evolution.

[42]  C. Orme,et al.  Predicting dispersal distance in mammals: a trait-based approach. , 2013, The Journal of animal ecology.

[43]  J. Andrew Royle,et al.  Site‐Occupancy Distribution Modeling to Correct Population‐Trend Estimates Derived from Opportunistic Observations , 2010, Conservation biology : the journal of the Society for Conservation Biology.

[44]  Santiago Saura,et al.  A new habitat availability index to integrate connectivity in landscape conservation planning : Comparison with existing indices and application to a case study , 2007 .

[45]  Jason M. Evans,et al.  Does accounting for imperfect detection improve species distribution models , 2011 .

[46]  J. Franklin Moving beyond static species distribution models in support of conservation biogeography , 2010 .

[47]  Eve McDonald-Madden,et al.  Predicting species distributions for conservation decisions , 2013, Ecology letters.

[48]  A. Peterson,et al.  The crucial role of the accessible area in ecological niche modeling and species distribution modeling , 2011 .

[49]  J. Svenning,et al.  Limited filling of the potential range in European tree species , 2004 .

[50]  M. Lima-Ribeiro,et al.  Predicting where species could go: climate is more important than dispersal for explaining the distribution of a South American turtle , 2018, Hydrobiologia.

[51]  Craig Moritz,et al.  Historical climate modelling predicts patterns of current biodiversity in the Brazilian Atlantic forest , 2008 .

[52]  Jacob van Etten,et al.  Application of consensus theory to formalize expert evaluations of plant species distribution models , 2014 .

[53]  J. Lobo,et al.  Historical bias in biodiversity inventories affects the observed environmental niche of the species , 2008 .

[54]  C. Nóbrega,et al.  Evaluating collinearity effects on species distribution models: An approach based on virtual species simulation , 2018, PloS one.

[55]  Brian J. McGill,et al.  Can niche-based distribution models outperform spatial interpolation? , 2007 .

[56]  Alberto Jiménez-Valverde,et al.  Not as good as they seem: the importance of concepts in species distribution modelling , 2008 .

[57]  C. Rahbek,et al.  Understanding (insect) species distributions across spatial scales , 2010 .

[58]  P. De Marco,et al.  Niche mismatches can impair our ability to predict potential invasions , 2019, Biological Invasions.

[59]  Jorge Soberón,et al.  Creating individual accessible area hypotheses improves stacked species distribution model performance , 2018 .

[60]  Santiago José Elías Velazco,et al.  ENMTML: An R package for a straightforward construction of complex ecological niche models , 2020, Environ. Model. Softw..

[61]  M. M. Weber,et al.  PAPER Have old species reached most environmentally suitable areas? A case study with South American phyllostomid bats , 2014 .

[62]  T. Rangel,et al.  Challenging Wallacean and Linnean shortfalls: knowledge gradients and conservation planning in a biodiversity hotspot , 2006 .

[63]  Walter Jetz,et al.  Global priorities for an effective information basis of biodiversity distributions , 2015, Nature Communications.

[64]  Mark John Costello Biodiversity: The Known, Unknown, and Rates of Extinction , 2015, Current Biology.

[65]  Daniel A Griffith,et al.  Spatial modeling in ecology: the flexibility of eigenfunction spatial analyses. , 2006, Ecology.

[66]  A. Hirzel,et al.  Habitat suitability modelling and niche theory , 2008 .

[67]  W. D. Kissling,et al.  The role of biotic interactions in shaping distributions and realised assemblages of species: implications for species distribution modelling , 2012, Biological reviews of the Cambridge Philosophical Society.

[68]  E. Revilla,et al.  A movement ecology paradigm for unifying organismal movement research , 2008, Proceedings of the National Academy of Sciences.

[69]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[70]  M. Araújo,et al.  The effects of model and data complexity on predictions from species distributions models , 2016 .

[71]  J. Andrew Royle,et al.  Likelihood analysis of species occurrence probability from presence‐only data for modelling species distributions , 2012, Methods in Ecology and Evolution.

[72]  Damaris Zurell,et al.  The virtual ecologist approach: simulating data and observers , 2010 .

[73]  Steven J. Phillips,et al.  Aligning Conservation Priorities Across Taxa in Madagascar with High-Resolution Planning Tools , 2008, Science.

[74]  D. W. Shimwell,et al.  Areography: Geographical Strategies of Species , 1983 .

[75]  A. Peterson,et al.  INTERPRETATION OF MODELS OF FUNDAMENTAL ECOLOGICAL NICHES AND SPECIES' DISTRIBUTIONAL AREAS , 2005 .

[76]  Alberto Jiménez-Valverde,et al.  The uncertain nature of absences and their importance in species distribution modelling , 2010 .

[77]  F. Schurr,et al.  Estimating demographic models for the range dynamics of plant species , 2010 .

[78]  Georgina M. Mace,et al.  Distorted Views of Biodiversity: Spatial and Temporal Bias in Species Occurrence Data , 2010, PLoS biology.