Identifying regions of inhomogeneities in spatial processes via an M‐RA and mixture priors

Soils have been heralded as a hidden resource that can be leveraged to mitigate and address some of the major global environmental challenges. Specifically, the organic carbon stored in soils, called Soil Organic Carbon (SOC), can, through proper soil management, help offset fuel emissions, increase food productivity, and improve water quality. As collecting data on SOC is costly and time consuming, not much data on SOC is available, although understanding the spatial variability in SOC is of fundamental importance for effective soil management. In this manuscript, we propose a modeling framework that can be used to gain a better understanding of the dependence structure of a spatial process by identifying regions within a spatial domain where the process displays the same spatial correlation range. To achieve this goal, we propose a generalization of the Multi-Resolution Approximation (M-RA) modeling framework of Katzfuss (2017) originally introduced as a strategy to reduce the computational burden encountered when analyzing massive spatial datasets. To allow for the possibility that the correlation of a spatial process might be characterized by a different range in different subregions of a spatial domain, we provide the M-RA basis functions weights with a two-component mixture prior with one of the mixture components a shrinking prior. We call our approach the mixture M-RA. Application of the mixture M-RA model to both stationary and non-stationary data shows that the mixture M-RA model can handle both types of data, can correctly establish the type of spatial dependence structure in the data (e.g. stationary vs not), and can identify regions of local stationarity. This article is protected by copyright. All rights reserved.

[1]  Adrian E. Raftery,et al.  [Practical Markov Chain Monte Carlo]: Comment: One Long Run with Diagnostics: Implementation Strategies for Markov Chain Monte Carlo , 1992 .

[2]  Paul D. Sampson,et al.  Constructions for Nonstationary Spatial Processes , 2010 .

[3]  Robert B. Gramacy,et al.  Ja n 20 08 Bayesian Treed Gaussian Process Models with an Application to Computer Modeling , 2009 .

[4]  Bo Li,et al.  Testing the covariance structure of multivariate random fields , 2008 .

[5]  Marc G. Genton,et al.  A test for stationarity of spatio-temporal random fields on planar and spherical domains , 2012 .

[6]  Matthias Katzfuss,et al.  A Multi-Resolution Approximation for Massive Spatial Datasets , 2015, 1507.04789.

[7]  Dean Phillips Foster,et al.  Calibration and Empirical Bayes Variable Selection , 1997 .

[8]  Candace Berrett,et al.  Nonstationary spatial prediction of soil organic carbon: Implications for stock assessment decision making , 2016, The Annals of Applied Statistics.

[9]  J. S. Rao,et al.  Spike and slab variable selection: Frequentist and Bayesian strategies , 2005, math/0505633.

[10]  Soutir Bandyopadhyay,et al.  A test for stationarity for irregularly spaced spatial data , 2017 .

[11]  F. Rekik,et al.  Soil Organic Carbon: the hidden potential , 2017 .

[12]  Jennifer A. Hoeting,et al.  A Review of Nonparametric Hypothesis Tests of Isotropy Properties in Spatial Data , 2015, 1508.05973.

[13]  Stuart Barber,et al.  Modelling spatial heterogeneity and discontinuities using Voronoi tessellations , 2018, 1802.05530.

[14]  T. J. Mitchell,et al.  Bayesian Variable Selection in Linear Regression , 1988 .

[15]  B. Mallick,et al.  Analyzing Nonstationary Spatial Data Using Piecewise Gaussian Processes , 2005 .

[16]  W. Dunsmuir,et al.  Estimation of nonstationary spatial covariance structure , 2002 .

[17]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[18]  V. Johnson,et al.  Bayesian Model Selection in High-Dimensional Settings , 2012, Journal of the American Statistical Association.

[19]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[20]  Catherine A. Calder,et al.  Local likelihood estimation for covariance functions with spatially-varying parameters: the convoSPAT package for R , 2015, 1507.08613.

[21]  E. George,et al.  APPROACHES FOR BAYESIAN VARIABLE SELECTION , 1997 .

[22]  Rattan Lal,et al.  Predicting Soil Organic Carbon Stock Using Profile Depth Distribution Functions and Ordinary Kriging , 2009 .

[23]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[24]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[25]  Montserrat Fuentes,et al.  A high frequency kriging approach for non‐stationary environmental processes , 2001 .

[26]  Bas van Wesemael,et al.  Regional assessment of soil organic carbon changes under agriculture in Southern Belgium (1955-2005) , 2007 .

[27]  Daniel Taylor-Rodriguez,et al.  Bayesian Variable Selection on Model Spaces Constrained by Heredity Conditions , 2013, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[28]  Sharon W. Waltman,et al.  Distribution of Soil Organic Carbon in the Conterminous United States , 2014 .

[29]  Yongtao Guan,et al.  A Nonparametric Test for Spatial Isotropy Using Subsampling , 2004 .

[30]  Bani K. Mallick,et al.  Adaptive Bayesian Nonstationary Modeling for Large Spatial Datasets Using Covariance Approximations , 2014 .

[31]  N. Narisetty,et al.  Bayesian variable selection with shrinking and diffusing priors , 2014, 1405.6545.

[32]  A. Gelfand,et al.  Gaussian predictive process models for large spatial data sets , 2008, Journal of the Royal Statistical Society. Series B, Statistical methodology.