Equivalent kriging

Most modern spatially indexed datasets are very large, with sizes commonly ranging from tens of thousands to millions of locations. Spatial analysis often focuses on spatial smoothing using the geostatistical technique known as kriging. Kriging requires covariance matrix computations whose complexity scales with the cube of the number of spatial locations, making analysis infeasible or impossible with large datasets. We introduce an approach to kriging in the presence of large datasets called equivalent kriging, which relies on approximating the krigingweight function using an equivalent kernel, requiring presence of a nontrivial nugget effect. Resulting kriging calculations are extremely fast and feasible in the presence of massive spatial datasets. We derive closed form kriging approximations formultiresolution classes of spatial processes, as well as under any stationary model, including popular choices such as theMatérn. The theoretical justification for equivalent kriging also leads to a correction term for irregularly spaced observations that also reduces edge effects near the domain boundary. For large sample sizes, equivalent kriging is shown to outperform covariance tapering in an example. Equivalent kriging is additionally illustrated on multiple simulated datasets, and a monthly average precipitation dataset whose size prohibits traditional geostatistical approaches. © 2015 Elsevier B.V. All rights reserved. ∗ Corresponding author. E-mail address:william.kleiber@colorado.edu (W. Kleiber). http://dx.doi.org/10.1016/j.spasta.2015.01.004 2211-6753/© 2015 Elsevier B.V. All rights reserved. 32 W. Kleiber, D.W. Nychka / Spatial Statistics 12 (2015) 31–49

[1]  M. Wand Fast Computation of Multivariate Kernel Estimators , 1994 .

[2]  J. Andrew Royle,et al.  Multiresolution models for nonstationary spatial covariance functions , 2002 .

[3]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[4]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[5]  M. Anitescu,et al.  STOCHASTIC APPROXIMATION OF SCORE FUNCTIONS FOR GAUSSIAN PROCESSES , 2013, 1312.2687.

[6]  Philip J. Bones,et al.  Algorithms to numerically evaluate the Hankel transform , 1993 .

[7]  M. Fuentes Approximate Likelihood for Large Irregularly Spaced Spatial Data , 2007, Journal of the American Statistical Association.

[8]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields; The SPDE approach , 2010 .

[9]  Jianhua Z. Huang,et al.  A full scale approximation of covariance functions for large spatial data sets , 2012 .

[10]  D. Cox Asymptotics for $M$-Type Smoothing Splines , 1983 .

[11]  W. H. Young On Multiple Integration by Parts and the Second Theorem of the Mean , 1917 .

[12]  Jerzy Zabczyk,et al.  Stochastic Partial Differential Equations with Lévy Noise: An Evolution Equation Approach , 2007 .

[13]  Douglas W. Nychka,et al.  Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets , 2008 .

[14]  D. Nychka,et al.  A Multiresolution Gaussian Process Model for the Analysis of Large Spatial Datasets , 2015 .

[15]  N. Cressie,et al.  Fixed rank kriging for very large spatial data sets , 2008 .

[16]  A. Siegman Quasi fast Hankel transform. , 1977, Optics letters.

[17]  A. Gelfand,et al.  Gaussian predictive process models for large spatial data sets , 2008, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[18]  B. Silverman,et al.  Spline Smoothing: The Equivalent Variable Kernel Method , 1984 .

[19]  Ying Sun,et al.  Geostatistics for Large Datasets , 2012 .

[20]  Michael L. Stein,et al.  A modeling approach for large spatial datasets , 2008 .

[21]  D. Nychka,et al.  Covariance Tapering for Interpolation of Large Spatial Datasets , 2006 .

[22]  Doug Nychka,et al.  A framework to understand the asymptotic properties of Kriging and splines , 2007 .

[23]  Michael L. Stein,et al.  Limitations on low rank approximations for covariance matrices of spatial data , 2014 .

[24]  Douglas W. Nychka,et al.  Splines as Local Smoothers , 1995 .

[25]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[26]  Kerry Key,et al.  Is the fast Hankel transform faster than quadrature , 2012 .

[27]  Biao Zhang,et al.  Kernel approximations for universal kriging predictors , 1993 .

[28]  Andrew O. Finley,et al.  Improving the performance of predictive process modeling for large datasets , 2009, Comput. Stat. Data Anal..

[29]  Craig J. Johns,et al.  Infilling Sparse Records of Spatial Fields , 2003 .

[30]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach , 2011 .

[31]  Marc G. Genton,et al.  Correlation Models for Temperature Fields , 2011 .

[32]  J. Chilès,et al.  Geostatistics: Modeling Spatial Uncertainty , 1999 .