Bayesian Spatial Binary Classification

In analyses of spatially-referenced data, researchers often have one of two goals: to quantify relationships between a response variable and covariates while accounting for residual spatial dependence or to predict the value of a response variable at unobserved locations. In this second case, when the response variable is categorical, prediction can be viewed as a classification problem. Many classification methods either ignore response-variable/covariate relationships and rely only on spatially proximate observations for classification, or they ignore spatial dependence and use only the covariates for classification. The Bayesian spatial generalized linear (mixed) model offers a tool to accommodate both spatial and covariate sources of information in classification problems. In this paper, we formally define spatial classification rules based on these models. We also take a close look at two of these models that have been proposed in the literature, namely the probit versions of the spatial generalized linear model (SGLM) and the Bayesian spatial generalized linear mixed model (SGLMM). We describe the implications of the seemingly slight differences between these models for spatial classification and explore the issue of robustness to model misspecification through a simulation study. We also provide an overview of alternatives to the SGLM/SGLMM-based classifiers and illustrate the various methods using satellite-derived land cover data from Southeast Asia.

[1]  Jennifer A. Hoeting,et al.  An Improved Model for Spatially Correlated Binary Responses , 2000 .

[2]  Murali Haran,et al.  Dimension reduction and alleviation of confounding for spatial generalized linear mixed models , 2010, 1011.6649.

[3]  C. Calder,et al.  The relationships between biomass burning, land-cover/-use change, and the distribution of carbonaceous aerosols in mainland Southeast Asia: a review and synthesis , 2008 .

[4]  Jennifer A. Hoeting,et al.  A clipped latent variable model for spatially correlated ordered categorical data , 2010, Comput. Stat. Data Anal..

[5]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[6]  K. Dučinskas,et al.  Linear Discriminant Analysis of Multivariate Spatial–Temporal Regressions , 2005 .

[7]  Christopher J. Paciorek,et al.  Computational techniques for spatial logistic regression with large data sets , 2007, Comput. Stat. Data Anal..

[8]  S. James Press,et al.  The Directional Neighborhoods Approach to Contextual Classification of Images from Noisy Data , 1996 .

[9]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[10]  Candace Berrett,et al.  Data augmentation strategies for the Bayesian spatial probit regression model , 2012, Comput. Stat. Data Anal..

[11]  V. D. Oliveira,et al.  Bayesian prediction of clipped Gaussian random fields , 2000 .

[12]  V. Zadnik,et al.  Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease‐Mapping Models , 2006, Biometrics.

[13]  P. Diggle,et al.  Model‐based geostatistics , 2007 .

[14]  Candace Berrett Bayesian Probit Regression Models for Spatially-Dependent Categorical Data , 2010 .

[15]  K. Zografos,et al.  Errors of misclassification in discrimination of dimensional coherent elliptic random field observations , 2011 .

[16]  Mevin B. Hooten,et al.  Restricted spatial regression in practice: geostatistical models, confounding, and robustness under model misspecification , 2015 .

[17]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[20]  Kanti V. Mardia,et al.  Spatial discrimination and classification maps , 1984 .

[21]  Jennifer A. Hoeting,et al.  Multilevel Latent Gaussian Process Model for Mixed Discrete and Continuous Multivariate Response Data , 2013 .

[22]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[23]  S. J. Press,et al.  Adaptive Bayesian Classification of Spatial Data , 1992 .

[24]  D. V. Dyk,et al.  A Bayesian analysis of the multinomial probit model using marginal data augmentation , 2005 .

[25]  A. Gelfand,et al.  Gaussian predictive process models for large spatial data sets , 2008, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[26]  Paul Switzer,et al.  Extensions of linear discriminant analysis for statistical classification of remotely sensed satellite imagery , 1980 .

[27]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .