Incorporating taxonomic distance into spatial prediction and digital mapping of soil classes

Abstract Mapping soil classes digitally generally starts with soil profile description with observed soil classes at a taxonomic level in a particular classification system. At each soil observation location there is a set of co-located environmental variables, and the challenge is to correlate the soil class with the environmental variables. The current methodology treats soil classes as ‘labels’ and their prediction only considers the minimisation of the misclassification error. Soil classes at any taxonomic level have taxonomic relationships between each other, and in some instances the errors in prediction of certain classes are more serious than the others. No statistical procedure so far has been utilised to account for these relationships. This paper shows that in digital mapping of soil classes, we can incorporate the taxonomic distance between soil classes in a supervised classification routine. Using classification trees, we can specify an algorithm that minimises the taxonomic distance rather than misclassification error. Two examples are given in this paper for mapping soil orders in the Australian soil classification system. A site in the Edgeroi area showed the advantage of using the method that minimises the taxonomic distance. Meanwhile a site in the Hunter Valley showed minimising the misclassification error performed similarly to minimising taxonomic distance. The advantages and challenges of using soil taxonomic distance are discussed.

[1]  Budiman Minasny,et al.  On digital soil mapping , 2003 .

[2]  Anne Gobin,et al.  Logistic Modeling to Spatially Predict the Probability of Soil Drainage Classes , 2002 .

[3]  F. Carré,et al.  Quantitative mapping of soil types based on regression kriging of taxonomic distances with landform and land cover attributes , 2002 .

[4]  Philippe Lagacherie,et al.  Addressing Geographical Data Errors in a Classification Tree for Soil Unit Prediction , 1997, Int. J. Geogr. Inf. Sci..

[5]  Chris Moran,et al.  A strategy to fill gaps in soil survey over large spatial extents: an example from the Murray-Darling basin of Australia , 2003 .

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  Alex B. McBratney,et al.  Fuzzy‐c‐Means and Kriging for Mapping Soil as a Continuous System , 1992 .

[9]  Tomislav Hengl,et al.  Methods to interpolate soil categorical variables from profile observations: Lessons from Iran , 2007 .

[10]  Elisabeth N. Bui,et al.  Soil survey as a knowledge system , 2004 .

[11]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[12]  R. Isbell Australian Soil Classification , 1996 .

[13]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[14]  Budiman Minasny,et al.  Prediction and digital mapping of soil carbon storage in the Lower Namoi Valley , 2006 .

[15]  Budiman Minasny,et al.  On measuring pedodiversity , 2007 .

[16]  J. Beek,et al.  Developments in Soil Science , 2019, Global Change and Forest Soils.

[17]  James C. Bell,et al.  Calibration and Validation of a Soil-Landscape Model for Predicting Soil Drainage Class , 1992 .

[18]  G. Bragato Fuzzy continuous classification and spatial interpolation in conventional soil survey for soil mapping of the lower Piave plain , 2004 .

[19]  Francis D. Hole,et al.  An experiment in ordination of some soil profiles. , 1960 .

[20]  Michael J. Pazzani,et al.  Reducing Misclassification Costs , 1994, ICML.

[21]  R. Isbell,et al.  Concepts and rationale of the Australian soil classification. , 1997 .

[22]  A. Zhu Mapping soil landscape as spatial continua: The Neural Network Approach , 2000 .

[23]  Philippe Lagacherie,et al.  An algorithm for fuzzy pattern matching to allocate soil individuals to pre-existing soil classes , 2005 .

[24]  A. McBratney,et al.  A continuum approach to soil classification by modified fuzzy k‐means with extragrades , 1992 .

[25]  Elisabeth N. Bui,et al.  Spatial data mining for enhanced soil map modelling , 2002, Int. J. Geogr. Inf. Sci..

[26]  Alex B. McBratney,et al.  A fuzzy allocation scheme for the Australian Great Soil Groups Classification system , 1995 .