Sensitivity of digital soil maps based on FCM to the fuzzy exponent and the number of clusters

Fuzzy c-means clustering (FCM) has been used frequently in digital soil mapping. One of the key issues in applying FCM is the determination of the appropriate classification parameters of the fuzzy exponent (m) and the number of clusters (c). To determine the optimal selection of appropriate m and c values, in this study, we first used two simulated datasets to demonstrate the sensitivity of three commonly used validity functions to m and c. These two simulated datasets contained overlapping clusters and hierarchical clusters, respectively. The three studied validity functions were fuzzy performance index (FPI), compactness and separation (S) and a derivative of the objective function with respect to the fuzzy exponent (-[(delta J(E)/delta m)c(0.5)]). Then, a case study mapping soil organic matter (SOM) based on memberships from FCM clustering terrain attributes was conducted to investigate the sensitivity of soil maps to m and c. The results of the study on the simulated datasets showed that the three validity functions were sensitive in differing degrees to the structures of the clustered datasets under a wide range of m, but the sensitivities and the range of m were different for different validity functions and depended on the clustered datasets. The results from the case study of the soil mapping showed that soil maps based on FCM clustering were sensitive to m and c, but only the spatial variations of SOM presented on the maps were significantly sensitive to c. Furthermore, mapping accuracy was slightly sensitive to m and c. It is concluded that there was a range of optimal m over which digital soil maps did not change very much, but this was not certain for c, given that the spatial variation presented on the maps changed significantly with c. (C) 2011 Elsevier B.V. All rights reserved.

[1]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Yuanfang Huang,et al.  Spatial prediction of soil organic matter in the presence of different external trends with REML-EBLUP , 2008 .

[3]  P. Lagacherie,et al.  Fuzzy k-means clustering of fields in an elementary catchment and extrapolation to a larger area , 1997 .

[4]  A. McBratney,et al.  Application of fuzzy sets in soil science: fuzzy logic, fuzzy measurements and fuzzy decisions , 1997 .

[5]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[6]  Alex B. McBratney,et al.  Soil pattern recognition with fuzzy-c-means : application to classification and soil-landform interrelationships , 1992 .

[7]  A. Ramachandra Rao,et al.  Regional flood frequency analysis by combining self-organizing feature map and fuzzy clustering , 2008 .

[8]  Jie Chen,et al.  Spatial Prediction of Heavy Metal Pollution for Soils in Peri-Urban Beijing, China Based on Fuzzy Set Theory , 2006 .

[9]  A. McBratney,et al.  A continuum approach to soil classification by modified fuzzy k‐means with extragrades , 1992 .

[10]  Nader Fathianpour,et al.  Continuous soil pollution mapping using fuzzy logic and spatial interpolation , 2005 .

[11]  Miklos Feher,et al.  Fuzzy Clustering as a Means of Selecting Representative Conformers and Molecular Alignments , 2003, J. Chem. Inf. Comput. Sci..

[12]  R. M. Lark,et al.  Soil–landform relationships at within-field scales: an investigation using continuous classification , 1999 .

[13]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[14]  Gan‐Lin Zhang,et al.  Application of a Digital Soil Mapping Method in Producing Soil Orders on Mountain Areas of Hong Kong Based on Legacy Soil Data , 2011 .

[15]  Alex B. McBratney,et al.  Application of fuzzy sets to climatic classification , 1985 .

[16]  Gerard B. M. Heuvelink,et al.  Sampling for validation of digital soil maps , 2011 .

[17]  Tao Pei,et al.  Construction of membership functions for predictive soil mapping under fuzzy logic , 2010 .

[18]  S. de Bruin,et al.  Soil-landscape modelling using fuzzy c-means clustering of attribute data derived from a Digital Elevation Model (DEM) , 1998 .

[19]  Chenghu Zhou,et al.  Quantification of spatial gradation of slope positions , 2009 .

[20]  John Triantafilis,et al.  Digital Soil-Class Mapping from Proximal and Remotely Sensed Data at the Field Level , 2009 .

[21]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[22]  R. Lark,et al.  On spatial prediction of soil properties in the presence of a spatial trend: the empirical best linear unbiased predictor (E‐BLUP) with REML , 2006 .