Geostatistical interpolation using copulas

In many applications of geostatistical methods, the dependence structure of the investigated parameter is described solely with the variogram or covariance functions, which are susceptible to measurement anomalies and implies the assumption of Gaussian dependence. Moreover the kriging variance respects only observation density, data geometry and the variogram model. To address these problems, we borrow the idea from copulas, to depict the dependence structure without the influence of the marginal distribution. The methodology and basic hypotheses for application of copulas as geostatistical methods are discussed and the Gaussian copula as well as a non‐Gaussian copula are used in this paper. Copula parameters are estimated using a division of the observations into multipoint subsets and a subsequent maximization of the corresponding likelihood function. The interpolation is carried out with two different copulas, where the expected and median values are calculated from the copulas conditioned with the nearby observations. The full conditional copulas provide the estimation distributions for the unobserved locations and can be used to define confidence intervals which depend on both the observation geometry and values. Observations of a large scale groundwater quality measurement network in Baden‐Württemberg are used to demonstrate the methodology. Five groundwater quality parameters: chloride, nitrate, pH, sulfate and dissolved oxygen are investigated. All five parameters show non‐Gaussian dependence. The copula‐based interpolation results of the five parameters are compared to the results of conventional ordinary and indicator kriging. Different statistical measures including mean squared error, relative differences and probability scores are used to compare cross validation and split sampling results of the interpolation methods. The non‐Gaussian copulas give better results than the geostatistical interpolations. Validation of the confidence intervals shows that they are more realistic than the estimation variances obtained by ordinary kriging.

[1]  Pierre Duchesne,et al.  Statistical Modeling and Analysis for Complex Data Problems , 2010 .

[2]  M. Ward,et al.  Prediction of seasonal rainfall in the north nordeste of Brazil using eigenvectors of sea‐surface temperature , 2007 .

[3]  Anne-Catherine Favre,et al.  Importance of Tail Dependence in Bivariate Frequency Analysis , 2007 .

[4]  Carlo De Michele,et al.  Extremes in Nature : an approach using Copulas , 2007 .

[5]  A. Journel,et al.  The Necessity of a Multiple-Point Prior Model , 2007 .

[6]  A. Bárdossy Copula‐based geostatistical models for groundwater quality parameters , 2006 .

[7]  Bruno Rémillard,et al.  Dependence Properties of Meta-Elliptical Distributions , 2005 .

[8]  B. Bobée,et al.  Multivariate hydrological frequency analysis using copulas , 2004 .

[9]  Bruno Rémillard,et al.  Nonparametric weighted symmetry tests , 2003 .

[10]  S. Kotz,et al.  The Meta-elliptical Distributions with Given Marginals , 2002 .

[11]  P. Embrechts,et al.  Correlation and Dependency in Risk Management , 2002 .

[12]  P. Embrechts,et al.  Risk Management: Correlation and Dependence in Risk Management: Properties and Pitfalls , 2002 .

[13]  Yannick Malevergne,et al.  Testing the Gaussian copula hypothesis for financial assets dependences , 2001, cond-mat/0111310.

[14]  H. Joe Multivariate models and dependence concepts , 1998 .

[15]  J. Gómez-Hernández,et al.  To be or not to be multi-Gaussian? A reflection on stochastic hydrogeology , 1998 .

[16]  András Bárdossy,et al.  Interpolation of groundwater quality parameters using additional information. , 1997 .

[17]  F. Alabert,et al.  Non-Gaussian data expansion in the Earth Sciences , 1989 .

[18]  M. Sklar Fonctions de repartition a n dimensions et leurs marges , 1959 .