Novel Ensemble Approaches of Machine Learning Techniques in Modeling the Gully Erosion Susceptibility

Gully erosion has become one of the major environmental issues, due to the severity of its impact in many parts of the world. Gully erosion directly and indirectly affects agriculture and infrastructural development. The Golestan Dam basin, where soil erosion and degradation are very severe problems, was selected as the study area. This research maps gully erosion susceptibility (GES) by integrating four models: maximum entropy (MaxEnt), artificial neural network (ANN), support vector machine (SVM), and general linear model (GLM). Of 1042 gully locations, 729 (70%) and 313 (30%) gully locations were used for modeling and validation purposes, respectively. Fourteen effective gully erosion conditioning factors (GECFs) were selected for spatial gully erosion modeling. Tolerance and variance inflation factors (VIFs) were used to examine the collinearity among the GECFs. The random forest (RF) model was used to assess factors’ effectiveness and significance in gully erosion modeling. An ensemble of techniques can provide more accurate results than can single, standalone models. Therefore, we compared two-, three-, and four-model ensembles (ANN-SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, ANN-SVM-GLM, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM, MaxEnt-ANN-SVM and GLM-ANN-SVM-MaxEnt) for GES modeling. The susceptibility zones of the GESMs were classified as very-low, low, medium, high, and very-high using Jenks’ natural break classification method (NBM). Subsequently, the receiver operating characteristics (ROC) curve and the seed cell area index (SCAI) methods measured the reliability of the models. The success rate curve (SRC) and predication rate curve (PRC) and their area under the curve (AUC) values were obtained from the GES maps. The results show that the ANN model combined with two and three models are more accurate than the other combinations, but the ANN-SVM model had the highest accuracy. The rank of the others from best to worst accuracy is GLM, MaxEnt, SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, GLM-ANN-SVM-MaxEnt, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM and MaxEnt-ANN-SVM. The resulting gully erosion susceptibility models (GESMs) are efficient and powerful and could be used to improve soil and water conservation and management.

[1]  Hamid Reza Pourghasemi,et al.  Spatial modelling of gully headcuts using UAV data and four best-first decision classifier ensembles (BFTree, Bag-BFTree, RS-BFTree, and RF-BFTree) , 2019, Geomorphology.

[2]  A. Kornejady,et al.  Assessment of landslide susceptibility, semi-quantitative risk and management in the Ilam dam basin, Ilam, Iran , 2015 .

[3]  Rajeev Kumar,et al.  Receiver operating characteristic (ROC) curve for medical researchers , 2011, Indian pediatrics.

[4]  A. Ribolini,et al.  Logistic regression versus artificial neural networks: landslide susceptibility evaluation in a sample area of the Serchio River valley, Italy , 2009 .

[5]  E. Rotigliano,et al.  Using topographical attributes to evaluate gully erosion proneness (susceptibility) in two mediterranean basins: advantages and limitations , 2015, Natural Hazards.

[6]  Omid Ghorbanzadeh,et al.  Application of Probabilistic and Machine Learning Models for Groundwater Potentiality Mapping in Damghan Sedimentary Plain, Iran , 2019, Remote. Sens..

[7]  Martino Bernard,et al.  Evaluating the Differences of Gridding Techniques for Digital Elevation Models Generation and Their Influence on the Modeling of Stony Debris Flows Routing: A Case Study From Rovina di Cancia Basin (North-Eastern Italian Alps) , 2018, Front. Earth Sci..

[8]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[9]  Saskia Keesstra,et al.  Long-term effects of soil management on ecosystem services and soil loss estimation in olive grove top soils. , 2016, The Science of the total environment.

[10]  Kirsten M. de Beurs,et al.  Kriging and thin plate splines for mapping climate variables , 2001 .

[11]  Trevor Hastie,et al.  A statistical explanation of MaxEnt for ecologists , 2011 .

[12]  Stan Lipovetsky,et al.  Pareto 80/20 law: derivation via random partitioning , 2009 .

[13]  R. Deo,et al.  Forecasting effective drought index using a wavelet extreme learning machine (W-ELM) model , 2017, Stochastic Environmental Research and Risk Assessment.

[14]  H. Pourghasemi,et al.  Gully erosion susceptibility mapping: the role of GIS-based bivariate statistical models and their comparison , 2016, Natural Hazards.

[15]  Giovanni B. Crosta,et al.  Techniques for evaluating the performance of landslide susceptibility models , 2010 .

[16]  J. Adamowski,et al.  An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. , 2019, The Science of the total environment.

[17]  Richard Bernknopf,et al.  A Probabilistic Approach to Landslide Hazard Mapping in Cincinnati, Ohio, with Applications for Economic Evaluation , 1988 .

[18]  Fernando T. Maestre,et al.  Spatial patterns of surface soil properties and vegetation in a Mediterranean semi-arid steppe , 2002, Plant and Soil.

[19]  Sunil Saha,et al.  Application of the GIS-Based Probabilistic Models for Mapping the Flood Susceptibility in Bansloi Sub-basin of Ganga-Bhagirathi River and Their Comparison , 2019, Remote Sensing in Earth Systems Sciences.

[20]  S. Keesstra,et al.  Effects of soil management techniques on soil water erosion in apricot orchards. , 2016, The Science of the total environment.

[21]  Omvir Singh,et al.  Soil Erosion Susceptibility Assessment of the Lower Himachal Himalayan Watershed , 2018, Journal of the Geological Society of India.

[22]  Hamid Reza Pourghasemi,et al.  Comparison of differences in resolution and sources of controlling factors for gully erosion susceptibility mapping , 2018, Geoderma.

[23]  D. Montgomery,et al.  Channel Initiation and the Problem of Landscape Scale , 1992, Science.

[24]  Hamid Reza Pourghasemi,et al.  Applying different scenarios for landslide spatial modeling using computational intelligence methods , 2017, Environmental Earth Sciences.

[25]  B. Pradhan,et al.  A methodological comparison of head-cut based gully erosion susceptibility models: Combined use of statistical and artificial intelligence , 2020 .

[26]  M. K. Arora,et al.  An artificial neural network approach for landslide hazard zonation in the Bhagirathi (Ganga) Valley, Himalayas , 2004 .

[27]  Margaret S. Pepe,et al.  Receiver Operating Characteristic Methodology , 2000 .

[28]  John P. Wilson,et al.  Use of terrain variables for mapping gully erosion susceptibility in Lebanon , 2007 .

[29]  Steven J. Phillips,et al.  Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. , 2009, Ecological applications : a publication of the Ecological Society of America.

[30]  Iman Nasiri Aghdam,et al.  A new hybrid model using Step-wise Weight Assessment Ratio Analysis (SWARA) technique and Adaptive Neuro-fuzzy Inference System (ANFIS) for regional landslide hazard assessment in Iran , 2015 .

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  Sunil Saha,et al.  Groundwater potential mapping using analytical hierarchical process: a study on Md. Bazar Block of Birbhum District, West Bengal , 2017, Spatial Information Research.

[33]  Michael Maerker,et al.  An integrated assessment of soil erosion dynamics with special emphasis on gully erosion in the Mazayjan basin, southwestern Iran , 2015, Natural Hazards.

[34]  A. Murwira,et al.  Potential of weight of evidence modelling for gully erosion hazard assessment in Mbire District – Zimbabwe , 2014 .

[35]  Yannick Thiery,et al.  Which data for quantitative landslide susceptibility mapping at operational scale? Case study of the Pays d'Auge plateau hillslopes (Normandy, France) , 2013 .

[36]  Alireza Arabameri,et al.  Spatial Pattern Analysis and Prediction of Gully Erosion Using Novel Hybrid Model of Entropy-Weight of Evidence , 2019, Water.

[37]  Christian Conoscenti,et al.  Assessment of Gully Erosion Susceptibility Using Multivariate Adaptive Regression Splines and Accounting for Terrain Connectivity , 2018 .

[38]  Víctor Urrea,et al.  Letter to the Editor: Stability of Random Forest importance measures , 2011, Briefings Bioinform..

[39]  E. Rotigliano,et al.  Gully erosion susceptibility assessment by means of GIS-based logistic regression: A case of Sicily (Italy) , 2014 .

[40]  Annalisa,et al.  SPATIAL DISTRIBUTION AND MORPHOLOGICAL FEATURES OF GULLIES IN AN AGROPASTORAL AREA IN SARDINIA, ITALY , 2006 .

[41]  Iman Nasiri Aghdam,et al.  Landslide susceptibility mapping using an ensemble statistical index (Wi) and adaptive neuro-fuzzy inference system (ANFIS) model at Alborz Mountains (Iran) , 2016, Environmental Earth Sciences.

[42]  S. Reddy,et al.  Geographical sampling bias and its implications for conservation priorities in Africa , 2003 .

[43]  E. Rotigliano,et al.  Improving transferability strategies for debris flow susceptibility assessment: Application to the Saponara and Itala catchments (Messina, Italy) , 2017 .

[44]  Hamid Reza Pourghasemi,et al.  Spatial Modeling of Gully Erosion Using Linear and Quadratic Discriminant Analyses in GIS and R , 2019, Spatial Modeling in GIS and R for Earth and Environmental Sciences.

[45]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[46]  Hamid Reza Pourghasemi,et al.  Identification of soil erosion-susceptible areas using fuzzy logic and analytical hierarchy process modeling in an agricultural watershed of Burdwan district, India , 2019, Environmental Earth Sciences.

[47]  Biswajeet Pradhan,et al.  Comparison of machine learning models for gully erosion susceptibility mapping , 2020, Geoscience Frontiers.

[48]  A. B. M. Saiful Islam,et al.  Impact assessment of rainfall-vegetation on sedimentation and predicting erosion-prone region by GIS and RS , 2016 .

[49]  Efthymia Nikita The use of generalized linear models and generalized estimating equations in bioarchaeological studies. , 2014, American journal of physical anthropology.

[50]  Gouri Sankar Bhunia,et al.  Modeling of potential gully erosion hazard using geo-spatial technology at Garbheta block, West Bengal in India , 2015, Modeling Earth Systems and Environment.

[51]  P. Gong,et al.  Mapping Ecological Land Systems and Classification Uncertainties from Digital Elevation and Forest-Cover Data Using Neural Networks , 1996 .

[52]  Wei Chen,et al.  GIS-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. , 2018, The Science of the total environment.

[53]  Ravinesh C. Deo,et al.  Land subsidence modelling using tree-based machine learning algorithms. , 2019, The Science of the total environment.

[54]  Omid Rahmati,et al.  Delineation of groundwater potential zones using remote sensing and GIS-based data-driven models , 2016 .

[55]  Mikhail Kanevski,et al.  Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping , 2013, Mathematical Geosciences.

[56]  H. Pourghasemi,et al.  Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms , 2019, Geoderma.

[57]  Hamid Reza Pourghasemi,et al.  Identification of erosion-prone areas using different multi-criteria decision-making techniques and GIS , 2018 .

[58]  Biswajeet Pradhan,et al.  Proposing a Novel Predictive Technique for Gully Erosion Susceptibility Mapping in Arid and Semi-arid Regions (Iran) , 2019, Remote. Sens..

[59]  Shivam Tripathi,et al.  Assessment of uncertainties in soil erosion and sediment yield estimates at ungauged basins: an application to the Garra River basin, India , 2017 .

[60]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[61]  Javier Casalí,et al.  Assessment of soil factors controlling ephemeral gully erosion on agricultural fields , 2018 .

[62]  H. Pourghasemi,et al.  GIS-based frequency ratio and index of entropy models for landslide susceptibility assessment in the Caspian forest, northern Iran , 2014, International Journal of Environmental Science and Technology.

[63]  H. Pourghasemi,et al.  Performance assessment of individual and ensemble data-mining techniques for gully erosion modeling. , 2017, The Science of the total environment.

[64]  B. Pradhan,et al.  Landslide susceptibility mapping using index of entropy and conditional probability models in GIS: Safarood Basin, Iran , 2012 .

[65]  B. Pradhan,et al.  Gully erosion zonation mapping using integrated geographically weighted regression with certainty factor and random forest models in GIS. , 2019, Journal of environmental management.

[66]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[67]  L. Tham,et al.  Landslide susceptibility mapping based on Support Vector Machine: A case study on natural slopes of Hong Kong, China , 2008 .

[68]  Thomas Blaschke,et al.  A Novel Ensemble Approach for Landslide Susceptibility Mapping (LSM) in Darjeeling and Kalimpong Districts, West Bengal, India , 2019, Remote. Sens..

[69]  Jagabandhu Roy,et al.  GIS-based Gully Erosion Susceptibility Evaluation Using Frequency Ratio, Cosine Amplitude and Logistic Regression Ensembled with fuzzy logic in Hinglo River Basin, India , 2019, Remote Sensing Applications: Society and Environment.

[70]  Shakirudeen Odunuga,et al.  Land susceptibility to soil erosion in Orashi Catchment, Nnewi South, Anambra State, Nigeria , 2018 .

[71]  M. Zerihun,et al.  Assessment of soil erosion using RUSLE, GIS and remote sensing in NW Ethiopia , 2018 .

[72]  Dieu Tien Bui,et al.  Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS , 2017 .

[73]  M. Maerker,et al.  Prediction of gully erosion susceptibilities using detailed terrain analysis and maximum entropy modeling: A case study in the Mazayejan plain, southwest Iran , 2014 .

[74]  Zhou Chunxia,et al.  A case study of using external DEM in InSAR DEM generation , 2005 .

[75]  B. Pradhan,et al.  Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia , 2010 .

[76]  Desmond E. Walling,et al.  Using 137Cs and 210Pbex measurements and conventional surveys to investigate the relative contributions of interrill/rill and gully erosion to soil loss from a small cultivated catchment in Sicily , 2014 .

[77]  B. Pradhan,et al.  Spatial prediction of gully erosion using ALOS PALSAR data and ensemble bivariate and data mining models , 2019, Geosciences Journal.

[78]  Saro Lee,et al.  An Automated Python Language-Based Tool for Creating Absence Samples in Groundwater Potential Mapping , 2019, Remote. Sens..

[79]  Omid Rahmati,et al.  Land subsidence hazard modeling: Machine learning to identify predictors and the role of human activities. , 2019, Journal of environmental management.

[80]  D. Peddle,et al.  Multi-Source Image Classification II: An Empirical Comparison of Evidential Reasoning and Neural Network Approaches , 1994 .

[81]  Rattan Lal,et al.  The ethics of soil conservation in India , 2018 .

[82]  Biswajeet Pradhan,et al.  Comparative assessment using boosted regression trees, binary logistic regression, frequency ratio and numerical risk factor for gully erosion susceptibility modelling , 2019 .

[83]  Paolo Magliulo,et al.  Assessing the susceptibility to water-induced soil erosion using a geomorphological, bivariate statistics-based approach , 2012, Environmental Earth Sciences.

[84]  Xiaodong Nie,et al.  Coupling effects of erosion and surface roughness on colluvial deposits under continuous rainfall , 2019, Soil and Tillage Research.

[85]  M. Conforti,et al.  Geomorphology and GIS analysis for mapping gully erosion susceptibility in the Turbolo stream catchment (Northern Calabria, Italy) , 2011 .

[86]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[87]  Biswajeet Pradhan,et al.  Novel ensembles of COPRAS multi-criteria decision-making with logistic regression, boosted regression tree, and random forest for spatial prediction of gully erosion susceptibility. , 2019, The Science of the total environment.

[88]  Thomas Blaschke,et al.  Hybrid Computational Intelligence Models for Improvement Gully Erosion Assessment , 2020, Remote. Sens..

[89]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[90]  Biswajeet Pradhan,et al.  Assessment of Landslide Susceptibility Using Statistical- and Artificial Intelligence-Based FR-RF Integrated Model and Multiresolution DEMs , 2019, Remote. Sens..

[91]  Biswajeet Pradhan,et al.  Gully erosion susceptibility mapping using GIS-based multi-criteria decision analysis techniques , 2019, CATENA.

[92]  Thomas Blaschke,et al.  Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India , 2020, Sensors.

[93]  J. Poesen,et al.  Gully erosion and environmental change: importance and research needs , 2003 .

[94]  Panos Panagos,et al.  Modeling soil erosion and river sediment yield for an intermountain drainage basin of the Central Apennines, Italy , 2014 .

[95]  Saro Lee,et al.  Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models. , 2019, The Science of the total environment.

[96]  Biswajeet Pradhan,et al.  Spatial modelling of gully erosion in the Ardib River Watershed using three statistical-based techniques , 2020 .

[97]  Musaib U Zaman Methods to Control Soil Erosion-A Review , 2018, International Journal of Pure & Applied Bioscience.

[98]  Wenjun Zhang,et al.  Constructing DEM Based on InSAR and the Relationship between InSAR DEM's Precision and Terrain Factors , 2012 .

[99]  B. Pradhan,et al.  Spatial modelling of gully erosion using evidential belief function, logistic regression, and a new ensemble of evidential belief function–logistic regression algorithm , 2018, Land Degradation & Development.

[100]  Ravindra Boojhawon,et al.  Bedrock-incised gully erosion phenomena on Round Island, Mauritius , 2017 .

[101]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[102]  Sunil Saha,et al.  Assessment of land suitability for the paddy cultivation using analytical hierarchical process (AHP): a study on Hinglo river basin, Eastern India , 2018, Modeling Earth Systems and Environment.

[103]  A. Kornejady,et al.  Landslide susceptibility assessment using maximum entropy model with two different data sampling methods , 2017 .

[104]  H. Pourghasemi,et al.  GIS-based gully erosion susceptibility mapping: a comparison among three data-driven models and AHP knowledge-based technique , 2018, Environmental Earth Sciences.

[105]  Abdul Halim Ghazali,et al.  Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and GIS , 2017 .

[106]  Michael J. Oimoen,et al.  Validation of the ASTER Global Digital Elevation Model Version 2 over the conterminous United States , 2012 .

[107]  Ronald E. Poeppl,et al.  A conceptual connectivity framework for understanding geomorphic change in human-impacted fluvial systems , 2017 .

[108]  Wei Chen,et al.  Landslide spatial modeling: Introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques , 2017 .

[109]  T. Beach The Fate of Eroded Soil: Sediment Sinks and Sediment Budgets of Agrarian Landscapes in Southern Minnesota, 1851–1988 , 1994 .

[110]  Hamid Reza Pourghasemi,et al.  Spatial modelling of gully erosion in Mazandaran Province, northern Iran , 2018 .

[111]  Mohammad Subzar Malik,et al.  Prioritization of Sub-Watersheds for Soil and Water Conservation in Parts of Narmada River through Morphometric Analysis Using Remote Sensing and GIS , 2019, Journal of the Geological Society of India.

[112]  H. Pourghasemi,et al.  Erodibility prioritization of sub-watersheds using morphometric parameters analysis and its mapping: A comparison among TOPSIS, VIKOR, SAW, and CF multi-criteria decision making models. , 2018, The Science of the total environment.

[113]  J. Poesen,et al.  Impact of road building on gully erosion risk: a case study from the Northern Ethiopian Highlands , 2002 .

[114]  R. Bingner,et al.  Evaluating ephemeral gully erosion impact on Zea mays L. yield and economics using AnnAGNPS , 2016 .

[115]  B. Pradhan,et al.  Landslide susceptibility mapping using support vector machine and GIS at the Golestan Province, Iran , 2013, Journal of Earth System Science.

[116]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[117]  Thomas Blaschke,et al.  Gully Head-Cut Distribution Modeling Using Machine Learning Methods—A Case Study of N.W. Iran , 2019, Water.

[118]  C. Rennó,et al.  Height Above the Nearest Drainage – a hydrologically relevant new terrain model , 2011 .

[119]  Quanqin Shao,et al.  Assessing the effects of land use and topography on soil erosion on the Loess Plateau in China , 2014 .

[120]  H. Ramesh,et al.  Assessment of soil erosion by RUSLE model using remote sensing and GIS - A case study of Nethravathi Basin , 2016 .