How to best threshold and validate stacked species assemblages? Community optimisation might hold the answer

The popularity of species distribution models (SDMs) and the associated stacked species distribution models (S‐SDMs), as tools for community ecologists, largely increased in recent years. However, while some consensus was reached about the best methods to threshold and evaluate individual SDMs, little agreement exists on how to best assemble individual SDMs into communities, that is, how to build and assess S‐SDM predictions. Here, we used published data of insects and plants collected within the same study region to test (a) if the most established thresholding methods to optimize single species prediction are also the best choice for predicting species assemblage composition, or if community‐based thresholding can be a better alternative, and (b) whether the optimal thresholding method depends on taxa, prevalence distribution and/or species richness. Based on a comparison of different evaluation approaches, we provide guidelines for a robust community cross‐validation framework, to use if spatial or temporal independent data are unavailable. Our results showed that the selection of the “optimal” assembly strategy mostly depends on the evaluation approach rather than taxa, prevalence distribution, regional species pool or species richness. If evaluated with independent data or reliable cross‐validation, community‐based thresholding seems superior compared to single species optimisation. However, many published studies did not evaluate community projections with independent data, often leading to overoptimistic community evaluation metrics based on single species optimisation. The fact that most of the reviewed S‐SDM studies reported over‐fitted community evaluation metrics highlights the importance of developing clear evaluation guidelines for community models. Here, we move a first step in this direction, providing a framework for cross‐validation at the community level.

[1]  L. Maiorano,et al.  Improving spatial predictions of taxonomic, functional and phylogenetic diversity , 2018 .

[2]  N. Zimmermann,et al.  Habitat Suitability and Distribution Models: With Applications in R , 2017 .

[3]  A. Guisan,et al.  Biodiversity Models: What If Unsaturation Is the Rule? , 2017, Trends in ecology & evolution.

[4]  Antoine Guisan,et al.  ecospat: an R package to support spatial analyses and modeling of species niches and distributions , 2017 .

[5]  P. Vittoz,et al.  Assessing and predicting shifts in mountain forest composition across 25 years of climate change , 2017 .

[6]  C. Rahbek,et al.  Spatial predictions at the community level: from current approaches to future frameworks , 2017, Biological reviews of the Cambridge Philosophical Society.

[7]  J. Diez,et al.  When Climate Reshuffles Competitors: A Call for Experimental Macroecology. , 2016, Trends in ecology & evolution.

[8]  N. Zimmermann,et al.  Effects of functional traits on the prediction accuracy of species richness models , 2016 .

[9]  Wilfried Thuiller,et al.  From species distributions to meta-communities. , 2015, Ecology letters.

[10]  A. Guisan,et al.  Predicting richness and composition in mountain insect communities at high resolution: a new test of the SESAM framework , 2015 .

[11]  Philippe Birnbaum,et al.  Accounting for the indirect area effect in stacked species distribution models to map species richness in a montane biodiversity hotspot , 2015 .

[12]  Á. Felicísimo,et al.  How to describe species richness patterns for bryophyte conservation? , 2015, Ecology and evolution.

[13]  Antoine Guisan,et al.  Overcoming limitations of modelling rare species by using ensembles of small models , 2015 .

[14]  Heidi K. Mod,et al.  Biotic interactions boost spatial models of species richness , 2015 .

[15]  Rui F. Fernandes,et al.  Using species richness and functional traits predictions to constrain assemblage predictions from stacked species distribution models , 2015 .

[16]  J. Fromentin,et al.  Mapping diversity indices: not a trivial issue , 2015 .

[17]  G. Langham,et al.  Stacked species distribution models and macroecological models provide congruent projections of avian species richness under climate change , 2015 .

[18]  Jane Elith,et al.  What do we gain from simplicity versus complexity in species distribution models , 2014 .

[19]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[20]  Anna F. Cord,et al.  Remote sensing data can improve predictions of species richness by stacked species distribution models: a case study for Mexican pines , 2014 .

[21]  Brendan A. Wintle,et al.  Predicting species distributions for conservation decisions , 2013, Ecology letters.

[22]  S. Finnegan,et al.  Climate Change and the Past, Present, and Future of Biotic Interactions , 2013, Science.

[23]  Blas M. Benito,et al.  The impact of modelling choices in the predictive performance of richness maps derived from species‐distribution models: guidelines to build better diversity models , 2013 .

[24]  M. White,et al.  Selecting thresholds for the prediction of species occurrence with presence‐only data , 2013 .

[25]  Antoine Guisan,et al.  A new spin on a compositionalist predictive modelling framework for conservation planning: A tropical case study in Ecuador , 2013 .

[26]  N. Pettorelli,et al.  Essential Biodiversity Variables , 2013, Science.

[27]  L. Maiorano,et al.  Climate-based empirical models show biased predictions of butterfly communities along environmental gradients , 2012 .

[28]  C. Plutzar,et al.  Extinction debt of high-mountain plants under twenty-first-century climate change , 2012 .

[29]  Á. Felicísimo,et al.  Do Stacked Species Distribution Models Reflect Altitudinal Diversity Patterns? , 2012, PloS one.

[30]  C. Rahbek,et al.  Communities Under Climate Change , 2011, Science.

[31]  A. Guisan,et al.  Predicting spatial patterns of plant species richness: a comparison of direct macroecological and species stacking modelling approaches , 2011 .

[32]  M. Araújo,et al.  Choice of threshold alters projections of species range shifts under climate change , 2011 .

[33]  Antoine Guisan,et al.  SESAM – a new framework integrating macroecological and species distribution models for predicting spatio‐temporal patterns of species assemblages , 2011 .

[34]  Simon Ferrier,et al.  Predicting impacts of climate change on biodiversity: a role for semi‐mechanistic community‐level modelling , 2011 .

[35]  N. Zimmermann,et al.  Will climate change reduce the efficacy of protected areas for amphibian conservation in Italy , 2011 .

[36]  C. Graham,et al.  New trends in species distribution modelling , 2010 .

[37]  M. Araújo,et al.  Biotic and abiotic variables show little redundancy in explaining tree species distributions , 2010 .

[38]  Antoine Guisan,et al.  Overcoming the rare species modelling paradox: a novel hierarchical framework applied to an Iberian endemic plant. , 2010 .

[39]  M. Visser,et al.  Predicting species distribution and abundance responses to climate change: why it is essential to include biotic interactions across trophic levels , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[40]  Philippe Hartmann,et al.  L'Observatoire de l'écosystème forestier du canton de Vaud: espace de recherche appliquée | The Forest Ecosystem Observatory in Canton Vaud: a field of applied research , 2009 .

[41]  Walter Jetz,et al.  Patterns and causes of species richness: a general simulation model for macroecology. , 2009, Ecology letters.

[42]  M. Kearney,et al.  Mechanistic niche modelling: combining physiological and spatial data to predict species' ranges. , 2009, Ecology letters.

[43]  J. Elith,et al.  Species Distribution Models: Ecological Explanation and Prediction Across Space and Time , 2009 .

[44]  Mathieu Marmion,et al.  Evaluation of consensus methods in predictive species distribution modelling , 2009 .

[45]  Gretchen G. Moisen,et al.  A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and Kappa , 2008 .

[46]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[47]  J. Lobo,et al.  Threshold criteria for conversion of probability of species presence to either–or presence–absence , 2007 .

[48]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[49]  A. Lehmann,et al.  Improving generalized regression analysis for the spatial prediction of forest communities , 2006 .

[50]  Barry R. Noon,et al.  Utility and limitations of species richness metrics for conservation planning , 2006 .

[51]  Antoine Guisan,et al.  Spatial modelling of biodiversity at the community level , 2006 .

[52]  Trevor Hastie,et al.  Making better biogeographical predictions of species’ distributions , 2006 .

[53]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[54]  W. Thuiller,et al.  Predicting species distribution: offering more than simple habitat models. , 2005, Ecology letters.

[55]  M. Sykes,et al.  Climate change threats to plant diversity in Europe. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[56]  T. Dawson,et al.  Selecting thresholds of occurrence in the prediction of species distributions , 2005 .

[57]  Richard Field,et al.  Predictions and tests of climate‐based hypotheses of broad‐scale variation in taxonomic richness , 2004 .

[58]  D. Rogers,et al.  The effects of species’ range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact? , 2004 .

[59]  John Casani,et al.  A New Spin , 2004 .

[60]  B. A. Hawkins,et al.  Water–energy balance and the geographic pattern of species richness of western Palearctic butterflies , 2003 .

[61]  A. Magurran,et al.  Explaining the excess of rare species in natural species abundance distributions , 2003, Nature.

[62]  G. Yohe,et al.  A globally coherent fingerprint of climate change impacts across natural systems , 2003, Nature.

[63]  Michael Drielsma,et al.  Extended statistical approaches to modelling spatial pattern in biodiversity in northeast New South Wales. II. Community-level modelling , 2002, Biodiversity & Conservation.

[64]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[65]  L. Breiman Random Forests , 2001, Machine Learning.

[66]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .

[67]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[68]  S. Pickett,et al.  Ecology: Individuals, populations and communities , 1987 .

[69]  J. Turner,et al.  Does Solar Energy Control Organic Diversity? Butterflies, Moths and the British Climate , 1987 .

[70]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[71]  F. W. Preston The Commonness, And Rarity, of Species , 1948 .

[72]  J. Calabrese,et al.  Stacking species distribution models and adjusting bias by linking them to macroecological models , 2014 .

[73]  Antoine Guisan,et al.  Quels facteurs influencent la richesse en orthoptères des Préalpes vaudoises , 2013 .

[74]  Antoine Guisan,et al.  The accuracy of plant assemblage prediction from species distribution models varies along environmental gradients , 2013 .

[75]  Damaris Zurell,et al.  Collinearity: a review of methods to deal with it and a simulation study evaluating their performance , 2013 .

[76]  Eduardo Pineda,et al.  Assessing the accuracy of species distribution models to predict amphibian species richness patterns. , 2009, The Journal of animal ecology.

[77]  Susan P. Johnson,et al.  Population Ecology , 2004 .

[78]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .