Improving prediction of rare species’ distribution from community data

Species distribution models (SDMs) have been increasingly used to predict the geographic distribution of a wide range of organisms; however, relatively fewer research efforts have concentrated on rare species despite their critical roles in biological conservation. The present study tested whether community data may improve modelling rare species by sharing information among common and rare ones. We chose six SDMs that treat community data in different ways, including two traditional single-species models (random forest and artificial neural network) and four joint species distribution models that incorporate species associations implicitly (multivariate random forest and multi-response artificial neural network) or explicitly (hierarchical modelling of species communities and generalized joint attribute model). In addition, we evaluated two approaches of data arrangement, species filtering and conditional prediction, to enhance the selected models. The model predictions were tested using cross validation based on empirical data collected from marine fisheries surveys, and the effects of community data were evaluated by comparing models for six selected rare species. The results demonstrated that the community data improved the predictions of rare species’ distributions to certain extent but might also be unhelpful in some cases. The rare species could be appropriately predicted in terms of occurrence, whereas their abundance tended to be underestimated by most models. Species filtering and conditional predictions substantially benefited the predictive performances of multiple- and single-species models, respectively. We conclude that both the modelling algorithms and community data need to be carefully selected in order to deliver improvement in modelling rare species. The study highlights the opportunity and challenges to improve prediction of rare species’ distribution by making the most of community data.

[1]  M. Cameletti,et al.  Spatial and Spatio-temporal Bayesian Models with R - INLA , 2015 .

[2]  Neil Reid,et al.  Modelling the influence of biotic factors on species distribution patterns , 2016 .

[3]  A. Lehmann,et al.  Using Niche‐Based Models to Improve the Sampling of Rare Species , 2006, Conservation biology : the journal of the Society for Conservation Biology.

[4]  Erin M. Schliep,et al.  Joint species distribution modelling for spatio‐temporal occurrence and ordinal abundance data , 2018 .

[5]  Anne-Laure Boulesteix,et al.  Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics , 2012, WIREs Data Mining Knowl. Discov..

[6]  BoulesteixAnne-Laure,et al.  Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics , 2012 .

[7]  Timothy G. Howard,et al.  Development and Delivery of Species Distribution Models to Inform Decision-Making , 2019, BioScience.

[8]  M. Araújo,et al.  The importance of biotic interactions for modelling species distributions under climate change , 2007 .

[9]  P. Kinas,et al.  Bayesian spatial predictive models for data-poor fisheries , 2017 .

[10]  Mark J. Costello,et al.  A Systematic Review of Marine-Based Species Distribution Models (SDMs) with Recommendations for Best Practice , 2017, Front. Mar. Sci..

[11]  L. Ancillotto,et al.  Modelling risks posed by wind turbines and power lines to soaring birds: the black stork (Ciconia nigra) in Italy as a case study , 2020, Biodiversity and Conservation.

[12]  Can Ozan Tan,et al.  Methodological issues in building, training, and testing artificial neural networks in ecological applications , 2005, q-bio/0510017.

[13]  G. N. Rao,et al.  Neural networks in fisheries research , 2008 .

[14]  Marc Mangel,et al.  Overcoming the Data Crisis in Biodiversity Conservation. , 2018, Trends in ecology & evolution.

[15]  C. Rahbek,et al.  Spatial predictions at the community level: from current approaches to future frameworks , 2017, Biological reviews of the Cambridge Philosophical Society.

[16]  Michael A. McCarthy,et al.  A comparison of joint species distribution models for presence–absence data , 2018, Methods in Ecology and Evolution.

[17]  Donald A. Jackson,et al.  How many principal components? stopping rules for determining the number of non-trivial axes revisited , 2005, Comput. Stat. Data Anal..

[18]  J. Olden A Species‐Specific Approach to Modeling Biological Communities and Its Potential for Conservation , 2003 .

[19]  Anna Norberg,et al.  How to make more out of community data? A conceptual framework and its implementation as models and software. , 2017, Ecology letters.

[20]  J. Andrew Royle,et al.  Modelling occurrence and abundance of species when detection is imperfect , 2005 .

[21]  David J. Harris Generating realistic assemblages with a joint species distribution model , 2015 .

[22]  David B. Dunson,et al.  A comprehensive evaluation of predictive performance of 33 species distribution models at species and community levels , 2019, Ecological Monographs.

[23]  Gianluca Baio,et al.  Spatial and spatio-temporal models with R-INLA. , 2013, Spatial and spatio-temporal epidemiology.

[24]  S. Ferrier,et al.  Controlled comparison of species- and community-level models across novel climates and communities , 2016, Proceedings of the Royal Society B: Biological Sciences.

[25]  Otso Ovaskainen,et al.  Making more out of sparse data: hierarchical modeling of species communities. , 2011, Ecology.

[26]  J. Franklin,et al.  Effect of species rarity on the accuracy of species distribution models for reptiles and amphibians in southern California , 2009 .

[27]  Ian Phillip Vaughan,et al.  The continuing challenges of testing species distribution models , 2005 .

[28]  Antoine Guisan,et al.  Overcoming limitations of modelling rare species by using ensembles of small models , 2015 .

[29]  J. Blois,et al.  Multiresponse algorithms for community‐level modelling: Review of theory, applications, and comparison to species distribution models , 2017 .

[30]  A. Guisan,et al.  An improved approach for predicting the distribution of rare and endangered species from occurrence and pseudo-absence data , 2004 .

[31]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[32]  Myrna E. Watanabe The Nagoya Protocol: The Conundrum of Defining Digital Sequence Information , 2019, BioScience.

[33]  Francis K. C. Hui,et al.  boral – Bayesian Ordination and Regression Analysis of Multivariate Abundance Data in r , 2016 .

[34]  R. A. Garcia,et al.  Climate change vulnerability assessment of species , 2018, WIREs Climate Change.

[35]  Adam Morawiec Some Statistical Issues , 2004 .

[36]  Binduo Xu,et al.  Comparing the prediction of joint species distribution models with respect to characteristics of sampling data , 2018 .

[37]  Damaris Zurell,et al.  Outstanding Challenges in the Transferability of Ecological Models. , 2018, Trends in ecology & evolution.

[38]  M. Schwartz,et al.  Using species distribution models to predict new occurrences for rare plants , 2009 .

[39]  Antoine Guisan,et al.  Overcoming the rare species modelling paradox: a novel hierarchical framework applied to an Iberian endemic plant. , 2010 .

[40]  M. Payne,et al.  The predictive skill of species distribution models for plankton in a changing climate , 2016, Global change biology.

[41]  Brendan A. Wintle,et al.  Is my species distribution model fit for purpose? Matching data and models to applications , 2015 .

[42]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[43]  P. Hernandez,et al.  The effect of sample size and species characteristics on performance of different species distribution modeling methods , 2006 .

[44]  J. Thorson,et al.  Model‐based inference for estimating shifts in species distribution, area occupied and centre of gravity , 2016 .

[45]  J. Elith,et al.  Species Distribution Models: Ecological Explanation and Prediction Across Space and Time , 2009 .

[46]  Eve McDonald-Madden,et al.  Predicting species distributions for conservation decisions , 2013, Ecology letters.

[47]  Ranadip Pal,et al.  IntegratedMRF: random forest‐based framework for integrating prediction from different data types , 2017, Bioinform..

[48]  David B. Lindenmayer,et al.  MODELING COUNT DATA OF RARE SPECIES: SOME STATISTICAL ISSUES , 2005 .

[49]  Mark R. Segal,et al.  Multivariate random forests , 2011, WIREs Data Mining Knowl. Discov..

[50]  melanie GoGol-prokurat Predicting habitat suitability for rare plants at local spatial scales using a species distribution model. , 2011, Ecological applications : a publication of the Ecological Society of America.

[51]  Steven J. Phillips,et al.  Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. , 2009, Ecological applications : a publication of the Ecological Society of America.

[52]  Francis K. C. Hui,et al.  So Many Variables: Joint Modeling in Community Ecology. , 2015, Trends in ecology & evolution.

[53]  Gianluca Baio,et al.  Erratum to “Spatial and spatio-temporal models with R-INLA” [Spat Spatio-tempor Epidemiol 4 (2013) 33–49] , 2013 .

[54]  James S. Clark,et al.  Generalized joint attribute modeling for biodiversity analysis: median-zero, multivariate, multifarious data , 2017 .

[55]  C. Meynard,et al.  Predicting species distributions: a critical comparison of the most common statistical models using artificial species , 2007 .

[56]  K. Gaston What is rarity , 1997 .

[57]  G. Bogliani,et al.  Identifying hotspots for rare species under climate change scenarios: improving saproxylic beetle conservation in Italy , 2018, Biodiversity and Conservation.

[58]  Szymon Smoliński,et al.  Spatial prediction of demersal fish diversity in the Baltic Sea: comparison of machine learning and regression‐based techniques , 2017 .

[59]  G. Mazza,et al.  An African bat in Europe, Plecotus gaisleri: Biogeographic and ecological insights from molecular taxonomy and Species Distribution Models , 2020, Ecology and evolution.

[60]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[61]  Jakub Stoklosa,et al.  Model-based thinking for community ecology , 2014, Plant Ecology.

[62]  Rosa M. Chefaoui,et al.  Assessing the effects of pseudo-absences on predictive distribution model performance , 2008 .

[63]  S. Cairns,et al.  Seven forms of rarity and their frequency in the flora of the British Isles , 1986 .

[64]  Antoine Guisan,et al.  Measuring the relative effect of factors affecting species distribution model predictions , 2014 .

[65]  David B. Roy,et al.  Uncovering hidden spatial structure in species communities with spatially explicit joint species distribution models , 2016 .

[66]  J. Elith,et al.  A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD , 2019, Diversity and Distributions.

[67]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[68]  A. Magurran,et al.  Explaining the excess of rare species in natural species abundance distributions , 2003, Nature.

[69]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[70]  平館 道子,et al.  Predictive Analysis , 1983, Encyclopedia of GIS.

[71]  D. Lindenmayer,et al.  Modelling the abundance of rare species: statistical models for counts with extra zeros , 1996 .

[72]  W. Thuiller,et al.  Predicting species distribution: offering more than simple habitat models. , 2005, Ecology letters.

[73]  K. Marske,et al.  Persistence of genetic diversity and phylogeographic structure of three New Zealand forest beetles under climate change , 2018, Diversity and Distributions.

[74]  James s. Clark,et al.  Why species tell more about traits than traits about species: predictive analysis. , 2016, Ecology.

[75]  Francis K C Hui,et al.  To mix or not to mix: comparing the predictive performance of mixture models vs. separate species distribution models. , 2013, Ecology.

[76]  David P. Larsen,et al.  Rare species in multivariate analysis for bioassessment: some considerations , 2001, Journal of the North American Benthological Society.

[77]  Robert P. Anderson,et al.  When and how should biotic interactions be considered in models of species niches and distributions? , 2017 .

[78]  Johan Ehrlén,et al.  Predicting changes in the distribution and abundance of species under environmental change , 2015, Ecology letters.