Spatial Modelling of Gully Erosion Using GIS and R Programing: A Comparison among Three Data Mining Algorithms

Gully erosion triggers land degradation and restricts the use of land. This study assesses the spatial relationship between gully erosion (GE) and geo-environmental variables (GEVs) using Weights-of-Evidence (WoE) Bayes theory, and then applies three data mining methods—Random Forest (RF), boosted regression tree (BRT), and multivariate adaptive regression spline (MARS)— for gully erosion susceptibility mapping (GESM) in the Shahroud watershed, Iran. Gully locations were identified by extensive field surveys, and a total of 172 GE locations were mapped. Twelve gully-related GEVs: Elevation, slope degree, slope aspect, plan curvature, convergence index, topographic wetness index (TWI), lithology, land use/land cover (LU/LC), distance from rivers, distance from roads, drainage density, and NDVI were selected to model GE. The results of variables importance by RF and BRT models indicated that distance from road, elevation, and lithology had the highest effect on GE occurrence. The area under the curve (AUC) and seed cell area index (SCAI) methods were used to validate the three GE maps. The results showed that AUC for the three models varies from 0.911 to 0.927, whereas the RF model had a prediction accuracy of 0.927 as per SCAI values, when compared to the other models. The findings will be of help for planning and developing the studied region.

[1]  P. Kuhnert,et al.  Incorporating uncertainty in gully erosion calculations using the random forests modelling approach , 2009 .

[2]  Michael E. Barber,et al.  Ephemeral gully erosion from agricultural regions in the Pacific Northwest, USA , 2010 .

[3]  Atsushi Tsunekawa,et al.  Comprehensive assessment of soil erosion risk for better land use planning in river basins: Case study of the Upper Blue Nile River. , 2017, The Science of the total environment.

[4]  W. G. Knisel,et al.  GLEAMS: Groundwater Loading Effects of Agricultural Management Systems , 1987 .

[5]  Mustafa Neamah Jebur,et al.  Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia , 2014 .

[6]  B. Pradhan,et al.  Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in Central Nepal Himalaya , 2014, Arabian Journal of Geosciences.

[7]  Hamid Reza Pourghasemi,et al.  A comparative assessment of prediction capabilities of modified analytical hierarchy process (M-AHP) and Mamdani fuzzy logic models using Netcad-GIS for forest fire susceptibility mapping , 2016 .

[8]  E. Rotigliano,et al.  Gully erosion susceptibility assessment by means of GIS-based logistic regression: A case of Sicily (Italy) , 2014 .

[9]  Biswajeet Pradhan,et al.  Soil erosion prediction based on land cover dynamics at the Semenyih watershed in Malaysia using LTM and USLE models , 2016 .

[10]  H. John B. Birks,et al.  Statistical Learning in Palaeolimnology , 2012 .

[11]  M. Maerker,et al.  Prediction of gully erosion susceptibilities using detailed terrain analysis and maximum entropy modeling: A case study in the Mazayejan plain, southwest Iran , 2014 .

[12]  Bahareh Kalantar,et al.  Oil Palm Counting and Age Estimation from WorldView-3 Imagery and LiDAR Data Using an Integrated OBIA Height Model and Regression Analysis , 2018, J. Sensors.

[13]  Yi Zhang,et al.  A comparative study of landslide susceptibility mapping using weight of evidence, logistic regression and support vector machine and evaluated by SBAS-InSAR monitoring: Zhouqu to Wudu segment in Bailong River Basin, China , 2017, Environmental Earth Sciences.

[14]  Biswajeet Pradhan,et al.  Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area , 2011, Comput. Geosci..

[15]  Mahyat Shafapour Tehrany,et al.  Flood susceptibility assessment using GIS-based support vector machine model with different kernel types , 2015 .

[16]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[17]  H. Pourghasemi,et al.  Evaluating the influence of geo-environmental factors on gully erosion in a semi-arid region of Iran: An integrated framework. , 2017, The Science of the total environment.

[18]  Dino Torri,et al.  Spatial variation of bed roughness in eroding rills and gullies , 2012 .

[19]  M. Conforti,et al.  Geomorphology and GIS analysis for mapping gully erosion susceptibility in the Turbolo stream catchment (Northern Calabria, Italy) , 2011 .

[20]  Biswajeet Pradhan,et al.  Assessment of land cover and land use change impact on soil loss in a tropical catchment by using multitemporal SPOT‐5 satellite images and Revised Universal Soil Loss Equation model , 2018, Land Degradation & Development.

[21]  Chong-Yu Xu,et al.  Rainfall-induced landslide susceptibility assessment using random forest weight at basin scale , 2018 .

[22]  Federica Lucà,et al.  Comparison of GIS-based gullying susceptibility mapping using bivariate and multivariate statistics: Northern Calabria, South Italy , 2011 .

[23]  Michael Maerker,et al.  An integrated assessment of soil erosion dynamics with special emphasis on gully erosion in the Mazayjan basin, southwestern Iran , 2015, Natural Hazards.

[24]  A. Murwira,et al.  Potential of weight of evidence modelling for gully erosion hazard assessment in Mbire District – Zimbabwe , 2014 .

[25]  H. Pourghasemi,et al.  Gully erosion susceptibility mapping: the role of GIS-based bivariate statistical models and their comparison , 2016, Natural Hazards.

[26]  Kristin K. Nicodemus,et al.  Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measures , 2011, Briefings Bioinform..

[27]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[28]  L. Bracken,et al.  Gully processes and gully dynamics , 2009 .

[29]  Biswajeet Pradhan,et al.  Optimized Hierarchical Rule-Based Classification for Differentiating Shallow and Deep-Seated Landslide Using High-Resolution LiDAR Data , 2017 .

[30]  Guy S. Boggs,et al.  Timing and causes of gully erosion in the riparian zone of the semi-arid tropical Victoria River, Australia: Management implications , 2016 .

[31]  Abdul Halim Ghazali,et al.  Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and GIS , 2017 .

[32]  Biswajeet Pradhan,et al.  Application of an evidential belief function model in landslide susceptibility mapping , 2012, Comput. Geosci..

[33]  Hamid Reza Pourghasemi,et al.  Spatial modelling of gully erosion in Mazandaran Province, northern Iran , 2018 .

[34]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[35]  Biswajeet Pradhan,et al.  City Compactness: Assessing the Influence of the Growth of Residential Land Use , 2018 .

[36]  Biswajeet Pradhan,et al.  A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India) , 2016, Environ. Model. Softw..

[37]  José A. Martínez-Casasnovas,et al.  Prediction of existing gully erosion in vineyard parcels of the NE Spain: a logistic modelling approach , 1999 .

[38]  Hamid Reza Pourghasemi,et al.  A comparative assessment between linear and quadratic discriminant analyses (LDA-QDA) with frequency ratio and weights-of-evidence models for forest fire susceptibility mapping in China , 2017, Arabian Journal of Geosciences.

[39]  Jianrong Fan,et al.  The contribution of gully erosion to total sediment production in a small watershed in Southwest China , 2018 .

[40]  Luís Torgo,et al.  Data Mining with R: Learning with Case Studies , 2010 .

[41]  T. Svoray,et al.  Predicting gully initiation: comparing data mining techniques, analytical hierarchy processes and the topographic threshold , 2012 .

[42]  J. Poesen,et al.  The European Soil Erosion Model (EUROSEM): A dynamic approach for predicting sediment transport from fields and small catchments. , 1998 .

[43]  Biswajeet Pradhan,et al.  Extraction and Modeling of Urban Sprawl Development in Karbala City Using VHR Satellite Imagery , 2017 .

[44]  Jung Hyun Lee,et al.  A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping , 2014 .

[45]  Sadat Feiznia,et al.  Spatial prediction of soil erosion susceptibility: an evaluation of the maximum entropy model , 2018, Earth Science Informatics.

[46]  Hossein Mojaddadi Rizeei,et al.  Surface runoff prediction regarding LULC and climate dynamics using coupled LTM, optimized ARIMA, and GIS-based SCS-CN models in tropical region , 2018, Arabian Journal of Geosciences.

[47]  G. Narsimha,et al.  Heart Disease Prediction System using Data Mining Techniques and Intelligent Fuzzy Approach: A Review , 2016 .

[48]  Biswajeet Pradhan,et al.  An integrated fluvial and flash pluvial model using 2D high-resolution sub-grid and particle swarm optimization-based random forest approaches in GIS , 2018, Complex & Intelligent Systems.

[49]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[50]  Mauro Fiorentino,et al.  Informational entropy of fractal river networks , 1996 .

[51]  Hyung-Sup Jung,et al.  Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea , 2017 .

[52]  Biswajeet Pradhan,et al.  A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area , 2017 .

[53]  Aykut Akgün,et al.  Mapping erosion susceptibility by a multivariate statistical method: A case study from the Ayvalık region, NW Turkey , 2011, Comput. Geosci..

[54]  B. Muys,et al.  Comparison and ranking of different modelling techniques for prediction of site index in Mediterranean mountain forests , 2010 .

[55]  H. Pourghasemi,et al.  Performance assessment of individual and ensemble data-mining techniques for gully erosion modeling. , 2017, The Science of the total environment.

[56]  Paolo Magliulo,et al.  Assessing the susceptibility to water-induced soil erosion using a geomorphological, bivariate statistics-based approach , 2012, Environmental Earth Sciences.

[57]  Martin Kappas,et al.  Integrated universal soil loss equation (USLE) and Geographical Information System (GIS) for soil erosion estimation in A Sap basin: Central Vietnam , 2018, International Soil and Water Conservation Research.

[58]  M. Habibnejad,et al.  An investigation of efficiency of outlet runoff assessment models: Navroud watershed, Iran. , 2009 .