Geo-parcel-based geographical thematic mapping using C5.0 decision tree: a case study of evaluating sugarcane planting suitability

Geographical thematic mapping based on spatial information can effectively support scientific decision-making in Geosciences. To obtain finer spatial decision information, this paper proposes a geo-parcel-based thematic mapping methodology for evaluating cash crop planting suitability using C5.0 decision tree (DT). In this study, geo-parcels are utilized as basic mapping units. Multi-source data are firstly employed to increase geo-parcel units’ attributes and a decision table then is constructed under a multi-attribute index system. Next, rules are mined using a C5.0 DT algorithm according to local geo-parcels in this decision table. Finally, rules are referred as thematic-distinguishing knowledge for inferential mapping in global geo-parcels. A case study of sugarcane planting suitability evaluation is conduct based on the proposed methodology. The experimental results showed that the cross-validation accuracy of the rules is 81.34% and the sum of the very suitable area and suitable area in the generated evaluation map is close to that of historical selected high-yield and high-sugar-content sugarcane bases, which indicated that the mapping result is in good agreement with the actual selection situation. These also demonstrate the effectiveness of our method and thus may be extended to other domains requiring fine geographical thematic mapping of cash crop planting suitability.

[1]  Josef Strobl,et al.  What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS , 2001 .

[2]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[3]  A-Xing Zhu,et al.  A personal construct-based knowledge acquisition process for natural resource mapping , 1999, Int. J. Geogr. Inf. Sci..

[4]  Yanqing Zhang,et al.  Granular support vector machines with association rules mining for protein homology prediction , 2005, Artif. Intell. Medicine.

[5]  Fakhri Karray,et al.  Multisensor data fusion: A review of the state-of-the-art , 2013, Inf. Fusion.

[6]  Paul M. Mather,et al.  An assessment of the effectiveness of decision tree methods for land cover classification , 2003 .

[7]  Stefan W. Maier,et al.  Comparing object-based and pixel-based classifications for mapping savannas , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[8]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[9]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[10]  Uncertainty quantification of interpolated maps derived from observations with different accuracy levels , 2016 .

[11]  Hong Jiang,et al.  Application of fuzzy measures in multi-criteria evaluation in GIS , 2000, Int. J. Geogr. Inf. Sci..

[12]  Rodger Tomlinson,et al.  Comparison of Fuzzy-AHP and AHP in a spatial multi-criteria decision making model for urban land-use planning , 2015, Comput. Environ. Urban Syst..

[13]  Piotr Jankowski,et al.  Integrating Geographical Information Systems and Multiple Criteria Decision-Making Methods , 1995, Int. J. Geogr. Inf. Sci..

[14]  Gwo-Hshiung Tzeng,et al.  A decision rule-based soft computing model for supporting financial performance improvement of the banking industry , 2014, Soft Computing.

[15]  Assessment of Irrigation Land Suitability and Development of Map for the Fogera Catchment Using GIS, South Gondar , 2013 .

[16]  Tao Pei,et al.  A citizen data-based approach to predictive mapping of spatial variation of natural phenomena , 2015, Int. J. Geogr. Inf. Sci..

[17]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[18]  A-Xing Zhu,et al.  Automated soil inference under fuzzy logic , 1996 .

[19]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[20]  Olaf Conrad,et al.  Climatologies at high resolution for the earth’s land surface areas , 2016, Scientific Data.

[21]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[22]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[23]  J. Zhou,et al.  Using Genetic Learning Neural Networks for Spatial Decision Making in GIs , 1996 .

[24]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[25]  Zhong Zheng,et al.  Rule acquisition and optimal scale selection in multi-scale formal decision contexts and their applications to smart city , 2017, Future Gener. Comput. Syst..

[26]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[27]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Decision-Tree Induction , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Feng Qi,et al.  Knowledge discovery from soil maps using inductive learning , 2003, Int. J. Geogr. Inf. Sci..

[29]  Jacek Malczewski,et al.  GIS and Multicriteria Decision Analysis , 1999 .

[30]  Carlo Gatta,et al.  Unsupervised Deep Feature Extraction for Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Jacek Malczewski,et al.  GIS‐based multicriteria decision analysis: a survey of the literature , 2006, Int. J. Geogr. Inf. Sci..

[32]  A. E. Baroudy Mapping and evaluating land suitability using a GIS-based model , 2016 .

[33]  Yiyu Yao,et al.  A Granular Computing Approach to Machine Learning , 2002, FSKD.

[34]  Wei Wu,et al.  Geo-Parcel Based Crop Identification by Integrating High Spatial-Temporal Resolution Imagery from Multi-Source Satellite Data , 2017, Remote. Sens..

[35]  J. Strobl,et al.  Object-Oriented Image Processing in an Integrated GIS/Remote Sensing Environment and Perspectives for Environmental Applications , 2000 .

[36]  A-Xing Zhu Rule‐Based Mapping , 2008 .

[37]  Salem Chakhar,et al.  Please Scroll down for Article International Journal of Geographical Information Science Gis-based Multicriteria Spatial Modeling Generic Framework Gis-based Multicriteria Spatial Modeling Generic Framework , 2022 .

[38]  Yee Leung,et al.  Theory and applications of granular labelled partitions in multi-scale decision tables , 2011, Inf. Sci..

[39]  A. Bagherzadeh,et al.  Parametric-based neural networks and TOPSIS modeling in land suitability evaluation for alfalfa production using GIS , 2017, Modeling Earth Systems and Environment.

[40]  Johan Jansson Decision Tree Classification od Products Using C5.0 and Prediction of Workload Using Time Series Analysis , 2016 .

[41]  B. Minasny,et al.  Digital Mapping of Soil Classes Using Decision Tree and Auxiliary Data in the Ardakan Region, Iran , 2014 .

[42]  Marvin N. Wright,et al.  SoilGrids250m: Global gridded soil information based on machine learning , 2017, PloS one.

[43]  Weldon A. Lodwick,et al.  Attribute error and sensitivity analysis of map operations in geographical informations systems: suitability analysis , 1990, Int. J. Geogr. Inf. Sci..

[44]  Peter F. Fisher,et al.  A Comparison of Fuzzy AHP and Ideal Point Methods for Evaluating Land Suitability , 2011, Trans. GIS.

[45]  Ruirun Chen,et al.  A hydraulic gradient similitude testing system for studying the responses of a laterally loaded pile and soil deformation , 2016, Environmental Earth Sciences.

[46]  Keith McCloy,et al.  Predictive mapping of soil organic carbon in wet cultivated lands using classification-tree based models: the case study of Denmark. , 2010, Journal of environmental management.

[47]  Lucien Duckstein,et al.  A Multiple Criteria Decision-Making Approach to GIS-Based Land Suitability Evaluation , 1993, Int. J. Geogr. Inf. Sci..

[48]  J. Ritchie,et al.  Modeling Plant and Soil Systems , 1991 .

[49]  Reza Banai,et al.  Fuzziness in Geographical Information Systems: Contributions from the Analytic Hierarchy Process , 1993, Int. J. Geogr. Inf. Sci..

[50]  Witold Pedrycz,et al.  Granular computing: an introduction , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[51]  R. Jana,et al.  A GIS-integrated fuzzy rule-based inference system for land suitability evaluation in agricultural watersheds , 2009 .

[52]  Samee Ullah Khan,et al.  Spatial sensitivity analysis of multi-criteria weights in GIS-based land suitability evaluation , 2010, Environ. Model. Softw..

[53]  Stephen J. Carver,et al.  Integrating multi-criteria evaluation with geographical information systems , 1991, Int. J. Geogr. Inf. Sci..

[54]  A. Bozdağ,et al.  AHP and GIS based land suitability analysis for Cihanbeyli (Turkey) County , 2016, Environmental Earth Sciences.

[55]  Land suitability evaluation for rainfed production of barley and wheat at Kabe Subwatershed, Northeastern Ethiopia , 2015 .

[56]  Zhe Jiang,et al.  Monitoring Land-Cover Changes: A Machine-Learning Perspective , 2016, IEEE Geoscience and Remote Sensing Magazine.

[57]  P. Burrough,et al.  Principles of geographical information systems , 1998 .

[58]  Jeremy Mennis,et al.  Spatial data mining and geographic knowledge discovery - An introduction , 2009, Comput. Environ. Urban Syst..

[59]  Giovanni Romano,et al.  Multi-criteria decision analysis for land suitability mapping in a rural area of Southern Italy , 2015 .

[60]  Yu Zheng,et al.  Methodologies for Cross-Domain Data Fusion: An Overview , 2015, IEEE Transactions on Big Data.

[61]  Salem Chakhar,et al.  Spatial multicriteria decision making , 2008 .

[62]  Feng Qi,et al.  Comparing three methods for modeling the uncertainty in knowledge discovery from area-class soil maps , 2011, Comput. Geosci..

[63]  S. A. Ogunyemi,et al.  Agricultural Land Suitability Assessment using Fuzzy Logic and Geographic Information System Techniques , 2015 .

[64]  T. Saaty,et al.  The Analytic Hierarchy Process , 1985 .