A comparative assessment of gully erosion spatial predictive modeling using statistical and machine learning models

Abstract In recent years, gully erosion has ceased many development activities and imposed a living threat to local communities residing in southern Iran. Hence, this study sets out to investigate the prediction performance of a machine learning model named the quick, unbiased, efficient statistical tree (QUEST) model for gully susceptibility mapping. Its results were compared to two conventional statistical models: frequency ratio (FR) and evidential belief function (EBF). The area under the receiver operating characteristic (AUROC) and the true skill statistic (TSS) metrics were adopted to assess models' goodness-of-fit and predictive performance in the corresponding training and validation stages. Results revealed that the QUEST model outperforms its counterparts by giving respective AUROC and TSS values of 88.5% and 0.77 in the training stage, followed by EBF (82.3% and 0.65) and FR (80.4% and 0.62). Similarly, the QUEST model showed the highest AUROC and TSS values in the validation stage (83.2% and 0.63, respectively), followed by the EBF (78.6% and 0.63, respectively) and FR (77.1% and 0.58, respectively). Further scrutinization attested that the QUEST model offers a more practical, compendious, and adaptable susceptibility map based on which about 32% of the study area was identified as the high susceptibility zone to gully erosion. Hence, highly gully susceptible areas require pragmatic mitigation plans. In addition, the application of machine learning models for gully erosion merits further studies.

[1]  C.A. Valera,et al.  The role of environmental land use conflicts in soil fertility: A study on the Uberaba River basin, Brazil. , 2016, The Science of the total environment.

[2]  R. O’Brien,et al.  A Caution Regarding Rules of Thumb for Variance Inflation Factors , 2007 .

[3]  G. Bonham-Carter,et al.  VHMS favourability mapping with GIS-based integration models, Chisel Lake-Anderson Lake area , 1996 .

[4]  H. Lim,et al.  Application of GIS-Based Evidential Belief Function Model to Regional Groundwater Recharge Potential Zones Mapping in Hardrock Geologic Terrain , 2016, Environmental Processes.

[5]  J. Poesen,et al.  Impact of road building on gully erosion risk: a case study from the Northern Ethiopian Highlands , 2002 .

[6]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[7]  Necdet Süt,et al.  Comparison of regression tree data mining methods for prediction of mortality in head injury , 2011, Expert Syst. Appl..

[8]  Jacquomo Monk,et al.  Comparison of automated classification techniques for predicting benthic biological communities using hydroacoustics and video observations , 2011 .

[9]  M. Karimi,et al.  Hydrocarbon resources potential mapping using the evidential belief functions and GIS, Ahvaz/Khuzestan Province, southwest Iran , 2015, Arabian Journal of Geosciences.

[10]  E. Rotigliano,et al.  Gully erosion susceptibility assessment by means of GIS-based logistic regression: A case of Sicily (Italy) , 2014 .

[11]  Saro Lee,et al.  Application of decision tree model for the ground subsidence hazard mapping near abandoned underground coal mines. , 2013, Journal of environmental management.

[12]  Saro Lee,et al.  Application of Decision-Tree Model to Groundwater Productivity-Potential Mapping , 2015 .

[13]  B. Pradhan,et al.  Application of GIS based data driven evidential belief function model to predict groundwater potential zonation , 2014 .

[14]  H. S. Lim,et al.  Regional prediction of groundwater potential mapping in a multifaceted geology terrain using GIS-based Dempster–Shafer model , 2015, Arabian Journal of Geosciences.

[15]  P. Reichenbach,et al.  GIS techniques and statistical models in evaluating landslide hazard , 1991 .

[16]  P. Kuhnert,et al.  Incorporating uncertainty in gully erosion calculations using the random forests modelling approach , 2009 .

[17]  E. Rotigliano,et al.  Using topographical attributes to evaluate gully erosion proneness (susceptibility) in two mediterranean basins: advantages and limitations , 2015, Natural Hazards.

[18]  B. Schröder,et al.  A functional entity approach to predict soil erosion processes in a small Plio-Pleistocene Mediterranean catchment in Northern Chianti, Italy , 2011 .

[19]  E. Carranza,et al.  Application of Data-Driven Evidential Belief Functions to Prospectivity Mapping for Aquamarine-Bearing Pegmatites, Lundazi District, Zambia , 2005 .

[20]  Saro Lee,et al.  Application of data-driven evidential belief functions to landslide susceptibility mapping in Jinbu, Korea , 2013 .

[21]  Leo Stroosnijder,et al.  Reducing Sediment Connectivity Through man‐Made and Natural Sediment Sinks in the Minizr Catchment, Northwest Ethiopia , 2017 .

[22]  Jung Hyun Lee,et al.  A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping , 2014 .

[23]  Jui-Sheng Chou,et al.  Comparison of multilabel classification models to forecast project dispute resolutions , 2012, Expert Syst. Appl..

[24]  Hamid Reza Pourghasemi,et al.  Evaluation of different machine learning models for predicting and mapping the susceptibility of gully erosion , 2017 .

[25]  Federica Lucà,et al.  Comparison of GIS-based gullying susceptibility mapping using bivariate and multivariate statistics: Northern Calabria, South Italy , 2011 .

[26]  Swades Pal,et al.  Assessing gully erosion susceptibility in Mayurakshi river basin of eastern India , 2018, Environment, Development and Sustainability.

[27]  M. H. Abokharima,et al.  Land subsidence susceptibility mapping at Kinta Valley (Malaysia) using the evidential belief function model in GIS , 2014, Natural Hazards.

[28]  R. Gil Pontius,et al.  Land-cover change model validation by an ROC method for the Ipswich watershed, Massachusetts, USA , 2001 .

[29]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[30]  Saro Lee,et al.  Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models. , 2019, The Science of the total environment.

[31]  J. Einax,et al.  GEOSTATISTICAL AND MULTIVARIATE STATISTICAL METHODS FOR THE ASSESSMENT OF POLLUTED SOILS : MERITS AND LIMITATIONS , 1999 .

[32]  M. Maerker,et al.  Prediction of gully erosion susceptibilities using detailed terrain analysis and maximum entropy modeling: A case study in the Mazayejan plain, southwest Iran , 2014 .

[33]  F. Dramis,et al.  Geomorphological investigation on gully erosion in the Rift Valley and the northern highlands of Ethiopia , 2003 .

[34]  D. Ierodiaconou,et al.  Hydro-acoustic remote sensing of benthic biological communities on the shallow South East Australian continental shelf , 2009 .

[35]  G. Bonham-Carter Geographic Information Systems for Geoscientists: Modelling with GIS , 1995 .

[36]  D. Woodward,et al.  Method to predict cropland ephemeral gully erosion , 1999 .

[37]  A. Al-Abadi,et al.  Susceptibility mapping of gully erosion using GIS-based statistical bivariate models: a case study from Ali Al-Gharbi District, Maysan Governorate, southern Iraq , 2018, Environmental Earth Sciences.

[38]  Mustafa Neamah Jebur,et al.  Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia , 2014 .

[39]  T. Svoray,et al.  Predicting gully initiation: comparing data mining techniques, analytical hierarchy processes and the topographic threshold , 2012 .

[40]  G. Hall,et al.  EXTECH I: a multidisciplinary approach to massive sulphide research in the Rusty Lake-Snow Lake greenstone belts, Manitoba , 1996 .

[41]  A. Dempster Upper and lower probability inferences based on a sample from a finite univariate population. , 1967, Biometrika.

[42]  Akobundu Nwanosike Amadi,et al.  Processing Nigerian coal deposits for energy source , 2018, Environmental Earth Sciences.

[43]  Thomas Blaschke,et al.  Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India , 2020, Sensors.

[44]  B. Pradhan,et al.  Urban flood risk mapping using the GARP and QUEST models: A comparative study of machine learning techniques , 2019, Journal of Hydrology.

[45]  Mevlut Ture,et al.  Using Kaplan-Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recurrence-free survival of breast cancer patients , 2009, Expert Syst. Appl..