Kriging and cross‐validation for massive spatial data

Spatial prediction such as kriging involves the inversion of a covariance matrix. When the number of locations is very large as in many studies, inversion of the covariance matrix may not be practical. Covariance tapering, predictive process models, and low rank kriging are some methods for overcoming the large matrix problem, all of which can be regarded as approximations to the underlying stationary process. Therefore, efficient cross‐validation is very helpful for spatial prediction with large data to assess how well an approximation works. This work studies the calculation of drop‐one prediction and various prediction scores. Estimators are constructed to minimize some prediction scores. One advantage of this approach is that it integrates estimation and cross‐validation and does not treat them as two separate procedures. Further simplification of calculation is studied that is based on the infill asymptotic theory. The methods are illustrated through the analysis of a US precipitation dataset. Copyright © 2009 John Wiley & Sons, Ltd.

[1]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[2]  Russell V. Lenth,et al.  Robust Measures of Location for Directional Data , 1981 .

[3]  Olivier Dubrule,et al.  Cross validation of kriging in a unique neighborhood , 1983 .

[4]  T. C. Haas,et al.  Lognormal and Moving Window Methods of Estimating Acid Deposition , 1990 .

[5]  Dale L. Zimmerman,et al.  A comparison of spatial semivariogram estimators and corresponding ordinary Kriging predictors , 1991 .

[6]  T. C. Haas,et al.  Local Prediction of a Spatio-Temporal Process with an Application to Wet Sulfate Deposition , 1995 .

[7]  D. Harville Matrix Algebra From a Statistician's Perspective , 1998 .

[8]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[9]  Michael L. Stein,et al.  The screening effect in Kriging , 2002 .

[10]  Hao Zhang Inconsistent Estimation and Asymptotically Equal Interpolations in Model-Based Geostatistics , 2004 .

[11]  Haotian Hang,et al.  Inconsistent Estimation and Asymptotically Equal Interpolations in Model-Based Geostatistics , 2004 .

[12]  David Higdon,et al.  A process-convolution approach to modelling temperatures in the North Atlantic Ocean , 1998, Environmental and Ecological Statistics.

[13]  D. Zimmerman,et al.  Towards reconciling two asymptotic frameworks in spatial statistics , 2005 .

[14]  D. Nychka,et al.  Covariance Tapering for Interpolation of Large Spatial Datasets , 2006 .

[15]  A. Raftery,et al.  Probabilistic forecasts, calibration and sharpness , 2007 .

[16]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[17]  D. Zimmerman,et al.  Hybrid Estimation of Semivariogram Parameters , 2007 .

[18]  M. Fuentes Approximate Likelihood for Large Irregularly Spaced Spatial Data , 2007, Journal of the American Statistical Association.

[19]  Hao Zhang,et al.  Covariance Tapering in Spatial Statistics , 2007 .

[20]  A. Gelfand,et al.  Gaussian predictive process models for large spatial data sets , 2008, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[21]  N. Cressie,et al.  Fixed rank kriging for very large spatial data sets , 2008 .

[22]  Michael L. Stein,et al.  A modeling approach for large spatial datasets , 2008 .

[23]  V. Mandrekar,et al.  Fixed-domain asymptotic properties of tapered maximum likelihood estimators , 2009, 0909.0359.