Comparing three methods for modeling the uncertainty in knowledge discovery from area-class soil maps

Knowledge discovery has been demonstrated as an effective approach to extracting knowledge from existing data sources for soil classification and mapping. Soils are spatial entities with fuzzy boundaries. Our study focuses on the uncertainty associated with class assignments when classifying such entities. We first present a framework of knowledge representation for categorizing spatial entities with fuzzy boundaries. Three knowledge discovery methods are discussed next for extracting knowledge from data sources. The methods were designed to maintain information for modeling the uncertainties associated with class assignments when using the extracted knowledge for classification. In a case study of knowledge discovery from an area-class soil map, all three methods were able to extract knowledge embedded in the map to classify soils at accuracies comparable to that of the original map. The methods were also able to capture membership gradations and helped to identify transitional zones and areas of potential problems on the source map when measures of uncertainties were mapped. Among the three methods compared, a fuzzy decision tree approach demonstrated the best performance in modeling the transitions between soil prototypes.

[1]  Feng Qi,et al.  Knowledge Discovery from Area‐Class Resource Maps: Data Preprocessing for Noise Reduction , 2004, Trans. GIS.

[2]  Ola Ahlqvist,et al.  Rough classification and accuracy assessment , 2000, Int. J. Geogr. Inf. Sci..

[3]  Linda C. van der Gaag,et al.  Visual exploration of uncertainty in remote-sensing classification , 1998 .

[4]  Hans-Jürgen Zimmermann,et al.  An application-oriented view of modeling uncertainty , 2000, Eur. J. Oper. Res..

[5]  Jane Yung-jen Hsu,et al.  Fuzzy classification trees for data analysis , 2002, Fuzzy Sets Syst..

[6]  G. Lakoff Women, fire, and dangerous things : what categories reveal about the mind , 1989 .

[7]  Peter Scull,et al.  Predictive soil mapping: a review , 2003 .

[8]  Michael F. Worboys,et al.  Nearness relations in environmental space , 2001, Int. J. Geogr. Inf. Sci..

[9]  A-Xing Zhu,et al.  Measuring Uncertainty in Class Assignment for Natural Resource Maps under Fuzzy Logic , 1997 .

[10]  Wenzhong Shi A Generic Statistical Approach for Modelling Error of Geometric Features in GIS , 1998, Int. J. Geogr. Inf. Sci..

[11]  D. A. Davidson,et al.  A Land Evaluation Project in Greece Using GIS and Based on Boolean and Fuzzy Set Methodologies , 1996, Int. J. Geogr. Inf. Sci..

[12]  James B. Campbell,et al.  The Missing Geographic Dimension to Soil Taxonomy , 1984 .

[13]  P. Burrough,et al.  Geographic Objects with Indeterminate Boundaries , 1996 .

[14]  Yee Leung,et al.  A Locational Error Model for Spatial Features , 1998, Int. J. Geogr. Inf. Sci..

[15]  A-Xing Zhu,et al.  A personal construct-based knowledge acquisition process for natural resource mapping , 1999, Int. J. Geogr. Inf. Sci..

[16]  Andy Evans,et al.  Uncertainty and Error , 2012 .

[17]  Peter A. Burrough,et al.  Fuzzy mathematical methods for soil survey and land evaluation , 1989 .

[18]  Paddington Hodza,et al.  Fuzzy logic and differences between interpretive soil maps , 2010 .

[19]  A-Xing Zhu,et al.  Fuzzy soil mapping based on prototype category theory , 2006 .

[20]  B. Tversky,et al.  Journal of Experimental Psychology : General VOL . 113 , No . 2 JUNE 1984 Objects , Parts , and Categories , 2005 .

[21]  J. D. Smith,et al.  Prototypes in category learning: the effects of category size, category structure, and stimulus complexity. , 2001, Journal of experimental psychology. Learning, memory, and cognition.

[22]  Elisabeth N. Bui,et al.  Spatial data mining for enhanced soil map modelling , 2002, Int. J. Geogr. Inf. Sci..

[23]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[24]  P. Fisher Visualizing Uncertainty in Soil Maps by Animation , 1993 .

[25]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[26]  M. Swaine,et al.  What is forest , 1981 .

[27]  Charles J. Fillmore,et al.  Frames and the semantics of understanding , 1985 .

[28]  A-Xing Zhu,et al.  A case-based reasoning approach to fuzzy soil mapping , 2002 .

[29]  Mogens Humlekrog Greve,et al.  Determining and representing width of soil boundaries using electrical conductivity and MultiGrid , 2004, Comput. Geosci..

[30]  Wolfgang Kainz,et al.  Progress in Spatial Data Handling: 12th International Symposium on Spatial Data Handling , 2006 .

[31]  C. Peter Keller,et al.  Modelling and visualizing multiple spatial uncertainties , 1997 .

[32]  Morton J. Canty,et al.  Boosting a fast neural network for supervised land cover classification , 2009, Comput. Geosci..

[33]  Brandon Plewe,et al.  The Nature of Uncertainty in Historical Geographic Information , 2002, Trans. GIS.

[34]  Peter D. Kemp,et al.  Modelling the productivity of naturalised pasture in the North Island, New Zealand: a decision tree approach , 2005 .

[35]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[36]  Peter Szolovits,et al.  What Is a Knowledge Representation? , 1993, AI Mag..

[37]  Peter A. Burrough,et al.  Natural Objects with Indeterminate Boundaries , 2020 .

[38]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[39]  E. Rosch,et al.  Family resemblances: Studies in the internal structure of categories , 1975, Cognitive Psychology.

[40]  Robert Jeansoulin,et al.  Fundamentals of Spatial Data Quality , 2006 .

[41]  Michael F. Goodchild,et al.  Visualizing spatial data uncertainty using animation , 1997 .

[42]  A-Xing Zhu,et al.  A similarity model for representing soil spatial information , 1997 .

[43]  G. Lakoff,et al.  Women, Fire, and Dangerous Things: What Categories Reveal about the Mind , 1988 .

[44]  Cort J. Willmott,et al.  Spatial statistics and models , 1984 .

[45]  Qi Feng,et al.  Knowledge discovery from area-class resource maps: Capturing prototype effects , 2008 .

[46]  Michael F. Goodchild,et al.  Development and test of an error model for categorical data , 1992, Int. J. Geogr. Inf. Sci..

[47]  A-Xing Zhu,et al.  Modeling Uncertainty in Knowledge Discovery for Classifying Geographic Entities with Fuzzy Boundaries , 2006 .

[48]  B. Bennett What is a Forest? On the Vagueness of Certain Geographic Concepts , 2001 .

[49]  Jiawei Han,et al.  Geographic data mining and knowledge discovery: An overview , 2009 .

[50]  David Unwin,et al.  Visualization In Geographical Information Systems , 1996 .

[51]  Eleanor Rosch,et al.  Principles of Categorization , 1978 .

[52]  E. Rosch,et al.  Cognition and Categorization , 1980 .

[53]  A. Comber,et al.  Approaches to Uncertainty in Spatial Data , 2010 .

[54]  Thomas G. Dietterich Machine-Learning Research Four Current Directions , 1997 .

[55]  Feng Qi,et al.  Knowledge discovery from soil maps using inductive learning , 2003, Int. J. Geogr. Inf. Sci..