Transferability, accuracy, and uncertainty assessment of different knowledge-based approaches for soil types mapping

Abstract Soil legacy data are important sources of soil information, especially when dealing with limited resources. In countries with high geographical diversity and few financial resources, such as Brazil, they represent an economical alternative to obtaining soil spatial information in higher resolution. By retrieving the soil scientist's knowledge, it can be used as guidance for knowledge-based digital soil mapping approaches. In this sense, this work aimed to evaluate Rule-Based Reasoning and Case-Based Reasoning knowledge-based approaches to predict soil types up to the third categorical level (U.S Soil Taxonomy) in a non-sampled area, by retrieving and then extrapolating the information of a detailed soil legacy map, from a reference area. The study was carried out in Minas Gerais state, Southeastern Brazil. The methodology includes three main steps: i) knowledge acquisition; ii) soil inference; and iii) accuracy and uncertainty assessment. For the validation, 23 independent samples were chosen by means of the Regional Random method, and the accuracy was assessed by Kappa index, Overall Accuracy, Users', and Producers' Accuracy. The uncertainty was evaluated through entropy and exaggeration. A total of 24 inference models were obtained with the Case-Based Reasoning approach, in which the best model had an overall accuracy of 61% and a Kappa index of 0.52. The Rule-based reasoning approach performed better, with an overall accuracy of 82% and 0.75 for Kappa index. These approaches generated a higher accuracy soil map for an unmapped area that was 15 times larger than the reference area and at lower cost.

[1]  Larry Biehl,et al.  Use of combined digital elevation model and satellite radiometric data for regional soil mapping , 2000 .

[2]  A-Xing Zhu,et al.  Automated soil inference under fuzzy logic , 1996 .

[3]  Panos Panagos,et al.  European digital archive on soil maps (EuDASM): preserving important soil data for public free access , 2011, Int. J. Digit. Earth.

[4]  Feng Qi,et al.  Comparing three methods for modeling the uncertainty in knowledge discovery from area-class soil maps , 2011, Comput. Geosci..

[5]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[6]  R. J. Pike,et al.  Automated classifications of topography from DEMs by an unsupervised nested-means algorithm and a three-part geometric signature , 2007 .

[7]  Murray Woods,et al.  GIS-fuzzy logic based approach in modeling soil texture: Using parts of the Clay Belt and Hornepayne region in Ontario Canada as a case study , 2015 .

[8]  P. Owens,et al.  Retrieving pedologist's mental model from existing soil map and comparing data mining tools for refining a larger area map under similar environmental conditions in Southeastern Brazil , 2016 .

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  A-Xing Zhu,et al.  A case-based reasoning approach to fuzzy soil mapping , 2002 .

[11]  B. Hudson,et al.  The Soil Survey as Paradigm-based Science , 1992 .

[12]  A-Xing Zhu,et al.  Measuring Uncertainty in Class Assignment for Natural Resource Maps under Fuzzy Logic , 1997 .

[13]  Michele Duarte de Menezes,et al.  Mapping soils in two watersheds using legacy data and extrapolation for similar surrounding areas , 2016 .

[14]  Elisabeth N. Bui,et al.  Soil survey as a knowledge system , 2004 .

[15]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[16]  Koreen Millard,et al.  On the Importance of Training Data Sample Selection in Random Forest Image Classification: A Case Study in Peatland Ecosystem Mapping , 2015, Remote. Sens..

[17]  I. Nonaka,et al.  A firm as a knowledge-creating entity: a new perspective on the theory of the firm , 2000 .

[18]  A-Xing Zhu,et al.  A similarity model for representing soil spatial information , 1997 .

[19]  J.-C. Favrot,et al.  Une stratégie d'inventaire cartographique à grande échelle: la méthode des secteurs de référence , 1989 .

[20]  J. Gallant,et al.  A multiresolution index of valley bottom flatness for mapping depositional areas , 2003 .

[21]  Margaret G. Schmidt,et al.  Semi-automated classification of exposed bedrock cover in British Columbia's Southern Mountains using a Random Forest approach , 2017 .

[22]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[23]  Sandra Rothenberg,et al.  Tacit knowledge transfer and the knowledge disconnect , 2006, J. Knowl. Manag..

[24]  P. Gessler,et al.  Terrain Attribute Modeling of Volcanic Ash Distributions in Northern Idaho , 2012 .

[25]  Johan Bouma,et al.  Using Soil Survey Data for Quantitative Land Evaluation , 1989 .

[26]  N. Curi,et al.  Levantamento pedológico e sistema de informações geográficas naavaliação do uso das terras em sub-bacia hidrográfica de Minas Gerais , 2009 .

[27]  Xun Shi,et al.  Integrating different types of knowledge for digital soil mapping. , 2009 .

[28]  Tao Pei,et al.  Construction of membership functions for predictive soil mapping under fuzzy logic , 2010 .

[29]  A-Xing Zhu,et al.  Fuzzy soil mapping based on prototype category theory , 2006 .

[30]  Jack Chertok Conservação do solo , 2008 .

[31]  K. Beven,et al.  A physically based, variable contributing area model of basin hydrology , 1979 .

[32]  D. P. Franzmeier,et al.  Toposequence of Oxisols from the Central Plateau of Brazil1 , 1984 .

[33]  J. McKay,et al.  Evaluation of the Transferability of a Knowledge-Based Soil-Landscape Model , 2010 .

[34]  Gerard B. M. Heuvelink,et al.  Efficiency comparison of conventional and digital soil mapping for updating soil maps , 2012 .

[35]  Elvio Giasson,et al.  Instance selection in digital soil mapping: a study case in Rio Grande do Sul, Brazil , 2015 .

[36]  L. F. Ruiz,et al.  Digital soil mapping: strategy for data pre-processing , 2012 .

[37]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[38]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[39]  Margaret G. Schmidt,et al.  Comparing the use of training data derived from legacy soil pits and soil survey polygons for mapping soil classes , 2017 .

[40]  H. Jenny,et al.  Factors of Soil Formation , 1941 .

[41]  Anònim Anònim Keys to Soil Taxonomy , 2010 .

[42]  R. M. Neto AS SUPERFÍCIES GEOMORFOLÓGICAS E A EVOLUÇÃO DO RELEVO BRASILEIRO: transcurso das ideias e correspondências no sul de Minas Gerais, sudeste do Brasil , 2014 .

[43]  Michael Bock,et al.  System for Automated Geoscientific Analyses (SAGA) v. 2.1.4 , 2015 .

[44]  R. Rizzo,et al.  Pedological mapping through integration of digital terrain models spectral sensing and photopedology , 2015 .

[45]  A-Xing Zhu,et al.  Soil Mapping Using GIS, Expert Knowledge, and Fuzzy Logic , 2001 .

[46]  R. Morgan Soil Erosion and Conservation , 1988 .

[47]  Haohuan Fu,et al.  Exploring the potential role of feature selection in global land-cover mapping , 2016 .

[48]  Michele Duarte de Menezes,et al.  Digital soil mapping approach based on fuzzy logic and field expert knowledge , 2013 .

[49]  A. Zhu,et al.  An expert knowledge-based approach to landslide susceptibility mapping using GIS and fuzzy logic , 2014 .

[50]  Chris Moran,et al.  A strategy to fill gaps in soil survey over large spatial extents: an example from the Murray-Darling basin of Australia , 2003 .

[51]  A. Zhu,et al.  Derivation of Soil Properties Using a Soil Land Inference Model (SoLIM) , 1997 .

[52]  Estimativa do benefício econômico potencial de dois levantamentos de solos no Estado do Rio Grande do Sul , 2006 .

[53]  Andrew K. Skidmore,et al.  Use of an expert system to map forest soils from a geographical information system , 1991, Int. J. Geogr. Inf. Sci..

[54]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[55]  László Pásztor,et al.  Compilation of functional soil maps for the support of spatial planning and land management in Hungary , 2017 .