Prediction of potential areas of species distributions based on presence-only data

We introduce a methodology to infer zones of high potential for the habitat of a species, useful for management of biodiversity, conservation, biogeography, ecology, or sustainable use. Inference is based on a set of sites where the presence of the species has been reported. Each site is associated with covariate values, measured on discrete scales. We compute the predictive probability that the species is present at each node of a regular grid. Possible spatial bias for sites of presence is accounted for. Since the resulting posterior distribution does not have a closed form, a Markov chain Monte Carlo (MCMC) algorithm is implemented. However, we also describe an approximation to the posterior distribution, which avoids MCMC. Relevant features of the approach are that specific notions of data acquisition such as sampling intensity and detectability are accounted for, and that available a priori information regarding areas of distribution of the species is incorporated in a clear-cut way. These concepts, arising in the presence-only context, are not addressed in alternative methods. We also consider an uncertainty map, which measures the variability for the predictive probability at each node on the grid. A simulation study is carried out to test and compare our approach with other standard methods. Two case studies are also presented.

[1]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[2]  Anthony N. Pettitt,et al.  A Conditional Autoregressive Gaussian Process for Irregularly Spaced Multivariate Data with Application to Modelling Large Sets of Binary Data , 2002, Stat. Comput..

[3]  Jorge Soberón,et al.  The Importance of Opuntia in Mexico and Routes of Invasion and Impact of Cactoblastis cactorum (Lepidoptera: Pyralidae) , 2001 .

[4]  K. Gaston,et al.  Pattern and Process in Macroecology , 2000 .

[5]  V. D. Oliveira,et al.  Bayesian prediction of clipped Gaussian random fields , 2000 .

[6]  David R. B. Stockwell,et al.  The GARP modelling system: problems and solutions to automated spatial prediction , 1999, Int. J. Geogr. Inf. Sci..

[7]  Peter G. Jones,et al.  FloraMap: a computer tool for predicting the distribution of plants and other organisms in the wild , 2001 .

[8]  A. Peterson,et al.  Sensitivity of distributional prediction algorithms to geographic data completeness , 1999 .

[9]  G. Carpenter,et al.  DOMAIN: a flexible modelling procedure for mapping potential distributions of plants and animals , 1993, Biodiversity & Conservation.

[10]  Jesper Møller,et al.  Estimating Distribution Maps From Atlas Data Using Methods of Statistical Image Analysis , 1995 .

[11]  M. Austin Spatial prediction of species distribution: an interface between ecological theory and statistical modelling , 2002 .

[12]  David G. Green,et al.  Parallel computing in ecological simulation , 1990 .

[13]  V. Sánchez‐Cordero,et al.  Conservatism of ecological niches in evolutionary time , 1999, Science.

[14]  J. Heikkinen,et al.  Fully Bayesian Approach to Image Restoration with an Application in Biogeography , 1994 .

[15]  Dawn M. Kaufman,et al.  THE GEOGRAPHIC RANGE: Size, Shape, Boundaries, and Internal Structure , 1996 .

[16]  S. Lele,et al.  A Composite Likelihood Approach to Binary Spatial Data , 1998 .

[17]  David R. B. Stockwell,et al.  Induction of sets of rules from animal distribution data: a robust and informative method of data analysis , 1992 .

[18]  R. Jennrich,et al.  Measurement of non-circular home range. , 1969, Journal of theoretical biology.

[19]  J. Busby BIOCLIM - a bioclimate analysis and prediction system , 1991 .

[20]  V. Sánchez‐Cordero,et al.  Museum specimen data predict crop damage by tropical rodents. , 2000, Proceedings of the National Academy of Sciences of the United States of America.