Combining spatial transition probabilities for stochastic simulation of categorical fields

Categorical spatial data, such as land use classes and socioeconomic statistics data, are important data sources in geographical information science (GIS). The investigation of spatial patterns implied in these data can benefit many aspects of GIS research, such as classification of spatial data, spatial data mining, and spatial uncertainty modeling. However, the discrete nature of categorical data limits the application of traditional kriging methods widely used in Gaussian random fields. In this article, we present a new probabilistic method for modeling the posterior probability of class occurrence at any target location in space-given known class labels at source data locations within a neighborhood around that prediction location. In the proposed method, transition probabilities rather than indicator covariances or variograms are used as measures of spatial structure and the conditional or posterior (multi-point) probability is approximated by a weighted combination of preposterior (two-point) transition probabilities, while accounting for spatial interdependencies often ignored by existing approaches. In addition, the connections of the proposed method with probabilistic graphical models (Bayesian networks) and weights of evidence method are also discussed. The advantages of this new proposed approach are analyzed and highlighted through a case study involving the generation of spatial patterns via sequential indicator simulation.

[1]  Clayton V. Deutsch,et al.  Correcting for negative weights in ordinary kriging , 1996 .

[2]  Mark E. Johnson,et al.  Multivariate Statistical Simulation , 1989, International Encyclopedia of Statistical Science.

[3]  Jef Caers,et al.  Sequential simulation with patterns , 2005 .

[4]  Jef Caers,et al.  Feature-Based Geostatistics: An Application to a Submarine Channel Reservoir , 2002 .

[5]  G. Bonham-Carter Geographic Information Systems for Geoscientists: Modelling with GIS , 1995 .

[6]  G. Fogg,et al.  Modeling Spatial Variability with One and Multidimensional Continuous-Lag Markov Chains , 1997 .

[7]  Mehran Sahami,et al.  Learning Limited Dependence Bayesian Classifiers , 1996, KDD.

[8]  A. Journel Combining Knowledge from Diverse Sources: An Alternative to Traditional Data Independence Hypotheses , 2002 .

[9]  X. Emery Properties and limitations of sequential indicator simulation , 2004 .

[10]  Michael F. Goodchild,et al.  Development and test of an error model for categorical data , 1992, Int. J. Geogr. Inf. Sci..

[11]  Donato Posa,et al.  Characteristic behavior and order relations for indicator variograms , 1990 .

[12]  Weidong Li,et al.  Markov Chain Random Fields for Estimation of Categorical Variables , 2007 .

[13]  Li Weidong Transiogram: A spatial relationship measure for categorical data , 2006 .

[14]  R. Bordley A Multiplicative Formula for Aggregating Probability Assessments , 1982 .

[15]  G. Fogg,et al.  Transition probability-based indicator geostatistics , 1996 .

[16]  Michael F. Goodchild,et al.  Statistical Perspectives on Geographic Information Science , 2008 .

[17]  Sebastien Strebelle,et al.  Conditional Simulation of Complex Geological Structures Using Multiple-Point Statistics , 2002 .

[18]  A. Raftery,et al.  The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[19]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[20]  S. Krishnan The Tau Model for Data Redundancy and Information Combination in Earth Sciences: Theory and Application , 2008 .

[21]  Eamonn J. Keogh,et al.  Learning augmented Bayesian classifiers: A comparison of distribution-based and classification-based approaches , 1999, AISTATS.

[22]  A. Raftery A model for high-order Markov chains , 1985 .

[23]  Alexandre Boucher,et al.  Super-resolution land cover mapping with indicator geostatistics , 2006 .

[24]  Jon Atli Benediktsson,et al.  Consensus theoretic classification methods , 1992, IEEE Trans. Syst. Man Cybern..

[25]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  R. M. Srivastava,et al.  Multivariate Geostatistics: Beyond Bivariate Moments , 1993 .

[27]  P. Bogaert Spatial prediction of categorical variables: the Bayesian maximum entropy approach , 2002 .

[28]  Andrew R. Solow,et al.  Mapping by simple indicator kriging , 1986 .

[29]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[30]  Mark E. Johnson Multivariate Statistical Simulation: Johnson/Multivariate , 1987 .

[31]  Clayton V. Deutsch,et al.  GSLIB: Geostatistical Software Library and User's Guide , 1993 .

[32]  Guofeng Cao,et al.  Combining Transition Probabilities in the Prediction and Simulation of Categorical Fields , 2008 .