Spatial data analytics provides new opportunities for automated detection of anomalous data for data quality control and subsurface segmentation to reduce uncertainty in spatial models. Solely data-driven anomaly detection methods do not fully integrate spatial concepts such as spatial continuity and data sparsity. Also, data-driven anomaly detection methods are challenged in integrating critical geoscience and engineering expertise knowledge. The proposed spatial anomaly detection method is based on the semivariogram spatial continuity model derived from sparsely sampled well data and geological interpretations. The method calculates the lag joint cumulative probability for each matched pair of spatial data, given their lag vector and the semivariogram under the assumption of bivariate Gaussian distribution. For each combination of paired spatial data, the associated head and tail Gaussian standardized values of a pair of spatial data are mapped to the joint probability density function informed from the lag vector and semivariogram. The paired data are classified as anomalous if the associated head and tail Gaussian standardized values fall within a low probability zone. The anomaly decision threshold can be decided based on a loss function quantifying the cost of overestimation or underestimation. The proposed spatial correlation anomaly detection method is able to integrate domain expertise knowledge through trend and correlogram models with sparse spatial data to identify anomalous samples, region, segmentation boundaries, or facies transition zones. This is a useful automation tool for identifying samples in big spatial data on which to focus professional attention.
[1]
VARUN CHANDOLA,et al.
Anomaly detection: A survey
,
2009,
CSUR.
[2]
Felix Naumann,et al.
Data fusion
,
2009,
CSUR.
[3]
Clayton V. Deutsch,et al.
Geostatistical Reservoir Modeling
,
2002
.
[4]
Shashi Shekhar,et al.
Detecting graph-based spatial outliers: algorithms and applications (a summary of results)
,
2001,
KDD '01.
[5]
A. Raftery,et al.
Nearest-Neighbor Clutter Removal for Estimating Features in Spatial Point Processes
,
1998
.
[6]
Clayton V. Deutsch,et al.
GSLIB: Geostatistical Software Library and User's Guide
,
1993
.
[7]
Michael Edward Hohn,et al.
An Introduction to Applied Geostatistics: by Edward H. Isaaks and R. Mohan Srivastava, 1989, Oxford University Press, New York, 561 p., ISBN 0-19-505012-6, ISBN 0-19-505013-4 (paperback), $55.00 cloth, $35.00 paper (US)
,
1991
.
[8]
Sanjay Chawla,et al.
SLOM: a new measure for local spatial outliers
,
2006,
Knowledge and Information Systems.
[9]
Donato Posa,et al.
Characteristic behavior and order relations for indicator variograms
,
1990
.
[10]
Sanjay Chawla,et al.
On local spatial outliers
,
2004,
Fourth IEEE International Conference on Data Mining (ICDM'04).