Spatial hidden Markov models and species distributions

ABSTRACT A spatial hidden Markov model (SHMM) is introduced to analyse the distribution of a species on an atlas, taking into account that false observations and false non-detections of the species can occur during the survey, blurring the true map of presence and absence of the species. The reconstruction of the true map is tackled as the restoration of a degraded pixel image, where the true map is an autologistic model, hidden behind the observed map, whose normalizing constant is efficiently computed by simulating an auxiliary map. The distribution of the species is explained under the Bayesian paradigm and Markov chain Monte Carlo (MCMC) algorithms are developed. We are interested in the spatial distribution of the bird species Greywing Francolin in the south of Africa. Many climatic and land-use explanatory variables are also available: they are included in the SHMM and a subset of them is selected by the mutation operators within the MCMC algorithm.

[1]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[2]  J. Heikkinen,et al.  Fully Bayesian Approach to Image Restoration with an Application in Biogeography , 1994 .

[3]  David Bruce Wilson,et al.  Exact sampling with coupled Markov chains and applications to statistical mechanics , 1996, Random Struct. Algorithms.

[4]  Anthony N. Pettitt,et al.  Classification using distance nearest neighbours , 2010, Stat. Comput..

[5]  David A. Elston,et al.  Empirical models for the spatial distribution of wildlife , 1993 .

[6]  Recent Bayesian approaches for spatial analysis of 2-D images with application to environmental modelling , 2015, Environmental and Ecological Statistics.

[7]  L. Spezia,et al.  Mapping species distributions in one dimension by non-homogeneous hidden Markov models: the case of freshwater pearl mussels in the River Dee , 2014, Environmental and Ecological Statistics.

[8]  GLEN A. SARGEANT,et al.  MARKOV CHAIN MONTE CARLO ESTIMATION OF SPECIES DISTRIBUTIONS: A CASE STUDY OF THE SWIFT FOX IN WESTERN KANSAS , 2005 .

[9]  Kiva L. Oken,et al.  A Bayesian hierarchical occupancy model for track surveys conducted in a series of linear, spatially correlated, sites , 2011 .

[10]  R. Matsika Land-cover change: threats to the grassland biome of South Africa , 2008 .

[11]  Stephen M. Smith,et al.  Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.

[12]  S. T. Buckland,et al.  An autologistic model for the spatial distribution of wildlife , 1996 .

[13]  Mevin B. Hooten,et al.  Spatial occupancy models for large data sets , 2013 .

[14]  R. Dowsett,et al.  The atlas of southern African birds , 2001 .

[15]  D. M. Titterington,et al.  Computational Bayesian Analysis of Hidden Markov Models , 1998 .

[16]  H. Possingham,et al.  IMPROVING PRECISION AND REDUCING BIAS IN BIOLOGICAL SURVEYS: ESTIMATING FALSE‐NEGATIVE ERROR RATES , 2003 .

[17]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[18]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[19]  Adrian E. Raftery,et al.  Approximate Bayes Factors for Image Segmentation: The Pseudolikelihood Information Criterion (PLIC) , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Sophie Ancelet,et al.  Bayesian Clustering Using Hidden Markov Random Fields in Spatial Population Genetics , 2006, Genetics.

[21]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[22]  J Andrew Royle,et al.  Generalized site occupancy models allowing for false positive and false negative errors. , 2006, Ecology.

[23]  J. Ray,et al.  Modeling Wolverine Occurrence Using Aerial Surveys of Tracks in Snow , 2007 .

[24]  Jennifer A. Hoeting,et al.  An Improved Model for Spatially Correlated Binary Responses , 2000 .

[25]  Hulin Wu,et al.  Modelling the distribution of plant species using the autologistic regression model , 1997, Environmental and Ecological Statistics.

[26]  Xiao-Li Meng,et al.  Simulating Normalizing Constants: From Importance Sampling to Bridge Sampling to Path Sampling , 1998 .

[27]  Anthony N. Pettitt,et al.  Efficient recursions for general factorisable models , 2004 .

[28]  R. O’Hara,et al.  A review of Bayesian variable selection methods: what, how and which , 2009 .

[29]  Richard G. Everitt,et al.  Bayesian Parameter Estimation for Latent Markov Random Fields and Social Networks , 2012, ArXiv.

[30]  A. Gimona,et al.  Opening the climate envelope reveals no macroscale associations with climate in European birds , 2008, Proceedings of the National Academy of Sciences.

[31]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[32]  Jonathan M. Graham,et al.  AUTOLOGISTIC MODEL OF SPATIAL PATTERN OF PHYTOPHTHORA EPIDEMIC IN BELL PEPPER: EFFECTS OF SOIL VARIABLES ONDISEASE PRESENCE , 1994 .

[33]  J. Møller,et al.  An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants , 2006 .

[34]  Anthony N. Pettitt,et al.  Bayesian Inference in Hidden Markov Random Fields for Binary Data Defined on Large Lattices , 2009 .

[35]  Zoubin Ghahramani,et al.  MCMC for Doubly-intractable Distributions , 2006, UAI.

[36]  Anthony N. Pettitt,et al.  Variational Bayes for estimating the parameters of a hidden Potts model , 2009, Stat. Comput..

[37]  C. Dormann Effects of incorporating spatial autocorrelation into the analysis of species distribution data , 2007 .

[38]  Jay M. Ver Hoef,et al.  Coarse-Scale Distribution Surveys and Occurrence Probability Modeling for Wolverine in Interior Alaska , 2010 .

[39]  W. Qian,et al.  Estimation of parameters in hidden Markov models , 1991, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[40]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[41]  P. Green,et al.  Hidden Markov Models and Disease Mapping , 2002 .