An algorithm for spatially constrained classification of categorical and continuous soil properties

The creation of spatially contiguous clusters from multiple sources of information is an important objective in many scientific disciplines. We describe a supervised spatial classification algorithm in which the primary goal is to stratify a landscape or other spatial data into a set of discrete patches based on available categorical and continuous data. The proposed method consists of four major steps: (1) data pre-processing, (2) initial allocation through spatially constrained cluster analysis, (3) spatial aggregation and peripheral re-allocation of individuals, and (4) an optional hierarchical or non-hierarchical classification to summarize individual patches in few classes. Parameters needed are the number of spatial clusters, the minimum size of a cluster and the weight assigned to the categorical variable(s). The proposed method was sensitive to the spatial association between continuous and categorical variables as well as to the spatial correlation structure of the continuous variables. Both determine the optimum weight assigned to the categorical information and, therefore, the effectiveness of the classification. The proposed clustering method was robust in terms of finding natural clusters, allows using categorical and continuous variables, and offers a reasonable compromise between geographic contiguity and class homogeneity. Possibilities for further modifications are outlined.

[1]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[2]  Alan Wee-Chung Liew,et al.  Fuzzy image clustering incorporating spatial continuity , 2000 .

[3]  M. Fortin,et al.  Spatial pattern and ecological analysis , 1989, Vegetatio.

[4]  M. Roubens Fuzzy clustering algorithms and their cluster validity , 1982 .

[5]  R. Reese Geostatistics for Environmental Scientists , 2001 .

[6]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[7]  Denis Marcotte,et al.  Multivariable variogram and its application to the linear model of coregionalization , 1991 .

[8]  Paul M. Mather,et al.  Computer Processing of Remotely-Sensed Images: An Introduction , 1988 .

[9]  Achim Dobermann,et al.  Sampling optimization based on secondary information and its utilization in soil carbon mapping , 2006 .

[10]  Stan Openshaw,et al.  A geographical solution to scale and aggregation problems in region-building, partitioning and spatial modelling , 1977 .

[11]  Philip Lewis,et al.  Geostatistical classification for remote sensing: an introduction , 2000 .

[12]  P A Burrough,et al.  COMPUTER‐BASED SOIL MAPPING OF SMALL AREAS FROM SAMPLE DATA , 1972 .

[13]  P. Goovaerts,et al.  Delineation of estuarine management areas using multivariate geostatistics: the case of Sado Estuary. , 2003, Environmental science & technology.

[14]  R. Webster,et al.  Statistical Methods in Soil and Land Resource Survey. , 1990 .

[15]  G. Raines Description and comparison of geologic maps with FRAGSTATS: a spatial statistics program , 2002 .

[16]  K. McGarigal,et al.  FRAGSTATS: spatial pattern analysis program for quantifying landscape structure. , 1995 .

[17]  Achim Dobermann,et al.  Creating Spatially Contiguous Yield Classes for Site‐Specific Management , 2003 .

[18]  R. Webster,et al.  A geostatistical basis for spatial weighting in multivariate classification , 1989 .

[19]  Timothy C. Coburn,et al.  Geostatistics for Natural Resources Evaluation , 2000, Technometrics.

[20]  R. M. Lark,et al.  Forming Spatially Coherent Regions by Classification of Multi-Variate Data: An Example from the Analysis of Maps of Crop Yield , 1998, Int. J. Geogr. Inf. Sci..

[21]  Dzung L. Pham,et al.  Spatial Models for Fuzzy Clustering , 2001, Comput. Vis. Image Underst..

[22]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[23]  Denis Marcotte,et al.  The multivariate (co)variogram as a spatial weighting function in classification methods , 1992 .

[24]  A. Ferligoj,et al.  Clustering with relational constraint , 1982 .

[25]  Pierre Goovaerts,et al.  Fine-resolution mapping of soil organic carbon based on multivariate secondary data , 2006 .

[26]  J. Reynolds,et al.  A new contagion index to quantify spatial patterns of landscapes , 1993, Landscape Ecology.