A comparison between Ensemble and MaxEnt species distribution modelling approaches for conservation: A case study with Egyptian medicinal plants

Abstract Understanding the relationship between the geographical distribution of taxa and their environmental conditions is a key concept in ecology and conservation. The use of ensemble modelling methods for species distribution modelling (SDM) have been promoted over single algorithms such as Maximum Entropy (MaxEnt). Nevertheless, we suggest that in cases where data, technical support or computational power are limited, for example in developing countries, single algorithm methods produce robust and accurate distribution maps. We fit SDMs for 114 Egyptian medicinal plant species (nearly all native) with a total of 14,396 occurrence points. The predictive performances of eight single-algorithm methods (maxent, random forest (rf), support-vector machine (svm), maxlike, boosted regression trees (brt), classification and regression trees (cart), flexible discriminant analysis (fda) and generalised linear models (glm)) were compared to an ensemble modelling approach combining all eight algorithms. Predictions were based originally on the current climate, and then projected into the future time slice of 2050 based on four alternate climate change scenarios (A2a and B2a for CMIP3 and RCP 2.6 and RCP 8.5 for CMIP5). Ensemble modelling, MaxEnt and rf achieved the highest predictive performances based on AUC and TSS, while svm and cart had the poorest performance. There is high similarity in habitat suitability between MaxEnt and ensemble predictive maps for both current and future emission scenarios, but lower similarity between rf and ensemble, or rf and MaxEnt. We conclude that single-algorithm modelling methods, particularly MaxEnt, are capable of producing distribution maps of comparable accuracy to ensemble methods. Furthermore, the ease of use, reduced computational time and simplicity of methods like MaxEnt provides support for their use in scenarios when the choice of modelling methods, knowledge or computational power is limited but the need for robust and accurate conservation predictions is urgent.

[1]  Carsten F. Dormann,et al.  Improved species‐occurrence predictions in data‐poor regions: using large‐scale data and bias correction with down‐weighted Poisson regression and Maxent , 2018 .

[2]  Laura J. Pollock,et al.  Understanding co‐occurrence by modelling species simultaneously with a Joint Species Distribution Model (JSDM) , 2014 .

[3]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[4]  S. Lavorel,et al.  Effects of restricting environmental range of data to project current and future species distributions , 2004 .

[5]  Donald W. Marquaridt Generalized Inverses, Ridge Regression, Biased Linear Estimation, and Nonlinear Estimation , 1970 .

[6]  B. A. Hawkins,et al.  Species distribution modelling as a macroecological tool: a case study using New World amphibians , 2012 .

[7]  T. Reader,et al.  Modelling terrestrial reptile species richness, distributions and habitat suitability in Saudi Arabia , 2020 .

[8]  Yujun Yi,et al.  Maxent modeling for predicting the potential distribution of endangered medicinal plant (H. riparia Lour) in Yunnan, China , 2016 .

[9]  K. H. Batanouny,et al.  Wild medicinal plants in Egypt : an inventory to support conservation and sustainable use , 1999 .

[10]  Sunil Kumar,et al.  Predicting impacts of climate change on medicinal asclepiads of Pakistan using Maxent modeling , 2013 .

[11]  J. Calabrese,et al.  Stacking species distribution models and adjusting bias by linking them to macroecological models , 2014 .

[12]  Antoine Guisan,et al.  Measuring the relative effect of factors affecting species distribution model predictions , 2014 .

[13]  Eric Young,et al.  Predicting the future of species diversity: macroecological theory, climate change, and direct tests of alternative forecasting methods , 2009 .

[14]  G. De’ath MULTIVARIATE REGRESSION TREES: A NEW TECHNIQUE FOR MODELING SPECIES–ENVIRONMENT RELATIONSHIPS , 2002 .

[15]  Robert P. Anderson,et al.  Ecological Niches and Geographic Distributions , 2011 .

[16]  G. Langham,et al.  Stacked species distribution models and macroecological models provide congruent projections of avian species richness under climate change , 2015 .

[17]  Peter D. Wilson,et al.  Which species distribution models are more (or less) likely to project broad-scale, climate-induced shifts in species ranges? , 2016 .

[18]  Roger A. Baldwin,et al.  Use of Maximum Entropy Modeling in Wildlife Research , 2009, Entropy.

[19]  R. Halvorsen A strict maximum likelihood explanation of MaxEnt, and some implications for distribution modelling , 2013 .

[20]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[21]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[22]  Trevor Hastie,et al.  Generalized linear and generalized additive models in studies of species distributions: setting the scene , 2002 .

[23]  J. Engler,et al.  Mapping Species Distributions with MAXENT Using a Geographically Biased Sample of Presence Data: A Performance Assessment of Methods for Correcting Sampling Bias , 2014, PloS one.

[24]  Miroslav Dudík,et al.  Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation , 2008 .

[25]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[26]  Nicholas E. Young,et al.  Using the Maxent program for species distribution modelling to assess invasion risk , 2015 .

[27]  Robert P. Anderson,et al.  Maximum entropy modeling of species geographic distributions , 2006 .

[28]  S. Lek,et al.  Ensemble modelling of species distribution: the effects of geographical and environmental ranges , 2011 .

[29]  J. Andrew Royle,et al.  Likelihood analysis of species occurrence probability from presence‐only data for modelling species distributions , 2012, Methods in Ecology and Evolution.

[30]  T. Rangel,et al.  Partitioning and mapping uncertainties in ensembles of forecasts of species turnover under climate change , 2009 .

[31]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[32]  S. B. E. Din,et al.  A guide to the reptiles and amphibians of Egypt , 2006 .

[33]  Narkis S. Morales,et al.  MaxEnt’s parameter configuration and small samples: are we paying attention to recommendations? A systematic review , 2016, bioRxiv.

[34]  Patrick A. Zollner,et al.  Improving the forecast for biodiversity under climate change , 2016, Science.

[35]  N. Raes,et al.  Using species distribution modeling to improve conservation and land use planning of Yunnan, China , 2012 .

[36]  J. Elith,et al.  A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD , 2019, Diversity and Distributions.

[37]  M. Papes,et al.  Tracking a Medically Important Spider: Climate Change, Ecological Niche Modeling, and the Brown Recluse (Loxosceles reclusa) , 2011, PloS one.

[38]  Mauro Fois,et al.  Using species distribution models at local scale to guide the search of poorly known species: Review, methodological issues and future directions , 2018, Ecological Modelling.

[39]  Wenguo Wang,et al.  Modeling impacts of climate change on the geographic distribution of medicinal plant Fritillaria cirrhosa D. Don , 2018 .

[40]  C. Graham,et al.  The ability of climate envelope models to predict the effect of climate change on species distributions , 2006 .

[41]  H. Reiss,et al.  Species distribution modelling of marine benthos: a North Sea case study , 2011 .

[42]  J. Elith,et al.  Species Distribution Models: Ecological Explanation and Prediction Across Space and Time , 2009 .

[43]  C. Graham,et al.  New trends in species distribution modelling , 2010 .

[44]  Mathieu Marmion,et al.  Evaluation of consensus methods in predictive species distribution modelling , 2009 .

[45]  F. Gilbert,et al.  Predicting the distributions of Egypt's medicinal plants and their potential shifts under future climate change , 2017, PloS one.

[46]  David N. Bucklin,et al.  Comparing species distribution models constructed with different subsets of environmental predictors , 2015 .

[47]  F. Gilbert,et al.  Using species distribution models to assess the importance of Egypt’s Protected Areas for the conservation of medicinal plants , 2016 .

[48]  Chandra Prakash Kala,et al.  Influence of climate change on production of secondary chemicals in high altitude medicinal plants: Issues needs immediate attention , 2010 .

[49]  F. Gilbert,et al.  Assessment of the extinction risks of medicinal plants in Egypt under climate change by integrating species distribution models and IUCN Red List criteria , 2019 .

[50]  D. R. Cutler,et al.  Effects of sample survey design on the accuracy of classification tree models in species distribution models , 2006 .

[51]  E. Kaky,et al.  Potential habitat suitability of Iraqi amphibians under climate change , 2020 .

[52]  A. Vanreusel,et al.  Null models reveal preferential sampling, spatial autocorrelation and overfitting in habitat suitability modelling , 2011 .

[53]  M. Araújo,et al.  Individualistic vs community modelling of species distributions under climate change. , 2009 .

[54]  A. Townsend Peterson,et al.  The influence of spatial errors in species occurrence data used in distribution models , 2007 .

[55]  J. Tao,et al.  Maxent modeling for predicting the potential geographical distribution of two peony species under climate change. , 2018, The Science of the total environment.

[56]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[57]  T. Reader,et al.  Testing the accuracy of species distribution models using species records from a new field survey , 2010 .

[58]  A. Guisan,et al.  An improved approach for predicting the distribution of rare and endangered species from occurrence and pseudo-absence data , 2004 .

[59]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[60]  Shanshan Wu,et al.  Building statistical models to analyze species distributions. , 2006, Ecological applications : a publication of the Ecological Society of America.

[61]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[62]  Trevor Hastie,et al.  Making better biogeographical predictions of species’ distributions , 2006 .

[63]  E. Harvey,et al.  Habitat suitability for marine fishes using presence-only modelling and multibeam sonar , 2010 .

[64]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[65]  M. White,et al.  Measuring and comparing the accuracy of species distribution models with presence–absence data , 2011 .

[66]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[67]  Richard E. Glor,et al.  ENMTools: a toolbox for comparative studies of environmental niche models , 2010 .

[68]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[69]  M. Schwartz,et al.  Advances in climate models from CMIP3 to CMIP5 do not change predictions of future habitat suitability for California reptiles and amphibians , 2016, Climatic Change.

[70]  O. Schmitz,et al.  The limits of direct community modeling approaches for broad-scale predictions of ecological assemblage structure , 2016 .

[71]  B. Huntley,et al.  Do ecological differences between taxonomic groups influence the relationship between species’ distributions and climate? A global meta‐analysis using species distribution models , 2013 .

[72]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[73]  Loutfy Boulos,et al.  Flora of Egypt , 1999 .

[74]  Jane Elith,et al.  Maxent is not a presence–absence method: a comment on Thibaud et al. , 2014 .

[75]  M. Austin,et al.  Improving species distribution models for climate change studies: variable selection and scale , 2011 .

[76]  Antoine Guisan,et al.  SESAM – a new framework integrating macroecological and species distribution models for predicting spatio‐temporal patterns of species assemblages , 2011 .

[77]  A. Lehmann,et al.  Predicting species spatial distributions using presence-only data: a case study of native New Zealand ferns , 2002 .

[78]  A. Peterson,et al.  Effects of sample size on the performance of species distribution models , 2008 .

[79]  R. Tibshirani,et al.  Flexible Discriminant Analysis by Optimal Scoring , 1994 .

[80]  W. Thuiller BIOMOD – optimizing predictions of species distributions and projecting potential future shifts under global change , 2003 .

[81]  J. Elith,et al.  Do they? How do they? WHY do they differ? On finding reasons for differing performances of species distribution models , 2009 .

[82]  Mauro Fois,et al.  Using MaxEnt modeling to predict the potential distribution of the endemic plant Rosa arabica Crép. in Egypt , 2019, Ecol. Informatics.

[83]  J. Andrew Royle,et al.  Presence‐only modelling using MAXENT: when can we trust the inferences? , 2013 .

[84]  Catherine S. Jarnevich,et al.  Ensemble Habitat Mapping of Invasive Plant Species , 2010, Risk analysis : an official publication of the Society for Risk Analysis.

[85]  Niels Raes,et al.  Minimum required number of specimen records to develop accurate species distribution models , 2016 .

[86]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[87]  A. Ellison,et al.  MaxEnt versus MaxLike: empirical comparisons with ant species distributions , 2013 .

[88]  Mark New,et al.  Ensemble forecasting of species distributions. , 2007, Trends in ecology & evolution.

[89]  Francis K. C. Hui,et al.  So Many Variables: Joint Modeling in Community Ecology. , 2015, Trends in ecology & evolution.

[90]  M. Turelli,et al.  Environmental Niche Equivalency versus Conservatism: Quantitative Approaches to Niche Evolution , 2008, Evolution; international journal of organic evolution.

[91]  E. Martínez‐Meyer Climate Change and Biodiversity: Some Considerations in Forecasting Shifts in Species' Potential Distributions , 2005 .

[92]  Matthew J. Smith,et al.  Protected areas network is not adequate to protect a critically endangered East Africa Chelonian: Modelling distribution of pancake tortoise, Malacochersus tornieri under current and future climates , 2013, bioRxiv.

[93]  T. Schoener The Anolis Lizards of Bimini: Resource Partitioning in a Complex Fauna , 1968 .

[94]  S. Gairola,et al.  Traditional knowledge and use of medicinal plants in the Eastern Desert of Egypt: a case study from Wadi El-Gemal National Park , 2013 .

[95]  M. Hooten,et al.  Hierarchical Species Distribution Models , 2016 .

[96]  J. Elith,et al.  Species distribution modeling with R , 2016 .

[97]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[98]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[99]  M. Araújo,et al.  An evaluation of methods for modelling species distributions , 2004 .

[100]  Trevor Hastie,et al.  A statistical explanation of MaxEnt for ecologists , 2011 .

[101]  J. González‐Irusta,et al.  Comparing species distribution models: a case study of four deep sea urchin species , 2015, Hydrobiologia.

[102]  Blas M. Benito,et al.  The impact of modelling choices in the predictive performance of richness maps derived from species‐distribution models: guidelines to build better diversity models , 2013 .

[103]  Emad Kaky,et al.  Allowing for human socioeconomic impacts in the conservation of plants under climate change , 2020, Plant Biosystems - An International Journal Dealing with all Aspects of Plant Biology.

[104]  Francis Gilbert,et al.  Conserving Egypt's reptiles under climate change , 2016 .

[105]  T. Dawson,et al.  Predicting the impacts of climate change on the distribution of species: are bioclimate envelope models useful? , 2003 .

[106]  S. Lek,et al.  Uncertainty in ensemble forecasting of species distribution , 2010 .

[107]  P. I. Miller,et al.  Comparison of five modelling techniques to predict the spatial distribution and abundance of seabirds , 2012 .

[108]  M. Araújo,et al.  Reducing uncertainty in projections of extinction risk from climate change , 2005 .

[109]  L. Hannah Climate Change Biology , 2021 .

[110]  David B. Dunson,et al.  A comprehensive evaluation of predictive performance of 33 species distribution models at species and community levels , 2019, Ecological Monographs.

[111]  Miguel B. Araújo,et al.  sdm: a reproducible and extensible R platform for species distribution modelling , 2016 .

[112]  D. Nogues‐Bravo,et al.  Applications of species distribution modeling to paleobiology , 2011 .

[113]  Antoine Guisan,et al.  Overcoming limitations of modelling rare species by using ensembles of small models , 2015 .