Bayesian nonstationary Gaussian process modeling: the BayesNSGP package for R

In spite of the diverse literature on nonstationary Gaussian process modeling, the software for implementing convolution-based methods is extremely limited, particularly for fully Bayesian analysis. To address this gap, here we present the BayesNSGP software package for R that enables off-the-shelf functionality for fully Bayesian, nonstationary Gaussian process modeling. Our approach to nonstationary modeling involves a closed-form, convolution-based covariance function with spatially-varying parameters; these parameter processes can be specified either deterministically (using covariates or basis functions) or stochastically (using approximate Gaussian processes). Stationary Gaussian processes are a special case of our methodology, and we furthermore implement approximate Gaussian process inference to account for very large spatial data sets. Bayesian inference is carried out using Markov chain Monte Carlo methods via the nimble package, which enables sampling of the highly correlated parameter spaces common to Gaussian process models (e.g., using adaptive block Metropolis-Hastings sampling). Posterior prediction for the Gaussian process at unobserved locations is also implemented as a straightforward post-processing step. As a demonstration, we use the package to analyze three data sets: first, the relatively small precipitation data set from Colorado that is used as a test case for related nonstationary methodology; second, a larger precipitation data set from a set of in situ measurements over the contiguous United States (with N ~ 1500 weather stations); and finally, a very large data set of return values for Boreal winter mean temperature at the surface derived from a general circulation model (with N ~ 50,000 grid cells).

[1]  Catherine A. Calder,et al.  A dynamic process convolution approach to modeling ambient particulate matter concentrations , 2008 .

[2]  David Higdon,et al.  A process-convolution approach to modelling temperatures in the North Atlantic Ocean , 1998, Environmental and Ecological Statistics.

[3]  Catherine A. Calder,et al.  Regression‐based covariance functions for nonstationary spatial modeling , 2014, 1410.1494.

[4]  P. Guttorp,et al.  Bayesian estimation of semi‐parametric non‐stationary spatial covariance structures , 2001 .

[5]  Catherine A. Calder,et al.  Local likelihood estimation for covariance functions with spatially-varying parameters: the convoSPAT package for R , 2015, 1507.08613.

[6]  Joseph Guinness,et al.  Permutation and Grouping Methods for Sharpening Gaussian Process Approximations , 2016, Technometrics.

[7]  Duncan Temple Lang,et al.  Programming With Models: Writing Statistical Algorithms for General Model Structures With NIMBLE , 2015, 1505.05093.

[8]  David Higdon,et al.  Non-Stationary Spatial Modeling , 2022, 2212.08043.

[9]  M. Katzfuss,et al.  A General Framework for Vecchia Approximations of Gaussian Processes , 2017, 1708.06302.

[10]  Anthony O'Hagan,et al.  Considering covariates in the covariance structure of spatial processes , 2011 .

[11]  W. Collins,et al.  An independent assessment of anthropogenic attribution statements for recent extreme temperature and rainfall events , 2017 .

[12]  Andrew O. Finley,et al.  Efficient Algorithms for Bayesian Nearest Neighbor Gaussian Processes , 2017, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[13]  Murali Haran,et al.  Automated Factor Slice Sampling , 2014, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[14]  Mark D. Risser,et al.  Review: Nonstationary Spatial Modeling, with Emphasis on Process Convolution and Covariate-Driven Approaches , 2016, 1610.02447.

[15]  Lewis R. Blake,et al.  Pushing the Limit: A Hybrid Parallel Implementation of the Multi-resolution Approximation for Massive Data , 2019 .

[16]  A. V. Vecchia Estimation and model identification for continuous spatial processes , 1988 .

[17]  Kurt Hornik,et al.  The Comprehensive R Archive Network , 2012 .

[18]  Mark J. Schervish,et al.  Nonstationary Covariance Functions for Gaussian Process Regression , 2003, NIPS.

[19]  Sudipto Banerjee,et al.  Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets , 2014, Journal of the American Statistical Association.

[20]  C. Wikle,et al.  Bayesian Hierarchical Models With Conjugate Full-Conditional Distributions for Dependent Data From the Natural Exponential Family , 2017, 1701.07506.

[21]  F. Lindgren,et al.  Exploring a New Class of Non-stationary Spatial Gaussian Random Fields with Varying Local Anisotropy , 2013, 1304.6949.

[22]  F. Lindgren,et al.  Spatial models with explanatory variables in the dependence structure , 2014 .

[23]  Joaquim H. Vianna Neto,et al.  Accounting for spatially varying directional effects in spatial covariance structures , 2012, 1209.5977.

[24]  Robert B. Gramacy,et al.  tgp: An R Package for Bayesian Nonstationary, Semiparametric Nonlinear Regression and Design by Treed Gaussian Process Models , 2007 .

[25]  P. Guttorp,et al.  Nonparametric Estimation of Nonstationary Spatial Covariance Structure , 1992 .

[26]  Bradley P Carlin,et al.  spBayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models. , 2007, Journal of statistical software.

[27]  Ying Sun,et al.  Efficient Estimation of Non-stationary Spatial Covariance Functions with Application to High-resolution Climate Model Emulation , 2019, Statistica Sinica.

[28]  A. O'Hagan,et al.  Bayesian inference for non‐stationary spatial covariance structure via spatial deformations , 2003 .

[29]  Michael F. Wehner,et al.  A basis set for exploration of sensitivity to prescribed ocean conditions for estimating human contributions to extreme weather in CAM5.1-1degree , 2018 .

[30]  Finn Lindgren,et al.  Bayesian Spatial Modelling with R-INLA , 2015 .

[31]  A. Gelfand,et al.  Gaussian predictive process models for large spatial data sets , 2008, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[32]  Eric P. Smith,et al.  An Introduction to Statistical Modeling of Extreme Values , 2002, Technometrics.

[33]  Montserrat Fuentes,et al.  A high frequency kriging approach for non‐stationary environmental processes , 2001 .

[34]  Matthias Katzfuss,et al.  Vecchia Approximations of Gaussian-Process Predictions , 2018, Journal of Agricultural, Biological and Environmental Statistics.

[35]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields; The SPDE approach , 2010 .

[36]  Peter D. Hoff,et al.  A Covariance Regression Model , 2011, 1102.5721.

[37]  Dorit Hammerling,et al.  A Case Study Competition Among Methods for Analyzing Large Spatial Data , 2017, Journal of Agricultural, Biological and Environmental Statistics.