Bagging Based Machine Learning Algorithms for Landslide Susceptibility Modeling

Landslide hazards have attracted increasing public attention over the past decades due to a series of catastrophic consequences of landslide occurrence. Thus, the mitigation and prevention of landslide hazards have been the topical issues. Thereinto, numerous research achievements on landslide susceptibility assessment have been springing up in recent years. In this paper, four benchmark models including best-first decision tree (BFTree), functional tree (FT), support vector machine (SVM) and classification regression tree (CART) and were integrated with bagging strategy. Then these bagging-based models were applied to map regional landslide susceptibility in Jiange County, Sichuan Province, China. Fifteen conditioning factors were employed in establishing landslide susceptibility models, respectively, slope aspect, slope angle, elevation, plan curvature, profile curvature, TWI, SPI, STI, lithology, soil, land use, NDVI, distance to rivers, distance to roads and distance to lineaments. Then utilize correlation attribute evaluation (CAE) method to weigh the contribution of each factor. Finally, the comprehensive performance of various bagging-based models and corresponding benchmark models was evaluated and systematically compared applying receiver operating characteristic curve (ROC) and area under curve (AUC) values. Results demonstrated that bagging-based ensemble models significantly outperformed their corresponding benchmark models with validation dataset. among them the Bag-CART model has the highest AUC value of 0.874, however the AUC value of CART model is only 0.766, which reflected satisfying predictive capacity of integrated models in some degree. The achievements obtained in this study have some reference values for landslides prevention and land resource planning in Jiange County.

[1]  João Gama,et al.  Functional Trees , 2001, Machine Learning.

[2]  M. S. Mesgari,et al.  A novel spatial index using spatial analyses and hierarchical fuzzy expert system for obtaining green TOD: a case study in Tehran city , 2019 .

[3]  Ali P. Yunus,et al.  Evaluating scale effects of topographic variables in landslide susceptibility models using GIS-based machine learning techniques , 2019, Scientific Reports.

[4]  Y. Matsushi,et al.  Lithological controls on hydrological processes that trigger shallow landslides: Observations from granite and hornfels hillslopes in Hiroshima, Japan , 2019, CATENA.

[5]  Jürgen Pilz,et al.  The human cost of global warming: Deadly landslides and their triggers (1995-2014). , 2019, The Science of the total environment.

[6]  Dieu Tien Bui,et al.  Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study , 2017, Geotechnical and Geological Engineering.

[7]  Ali P. Yunus,et al.  Torrential rainfall-triggered shallow landslide characteristics and susceptibility assessment using ensemble data-driven models in the Dongjiang Reservoir Watershed, China , 2019, Natural Hazards.

[8]  Sara M. Abuzied,et al.  Spatial prediction of landslide-susceptible zones in El-Qaá area, Egypt, using an integrated approach based on GIS statistical analysis , 2019, Bulletin of Engineering Geology and the Environment.

[9]  D. Bui,et al.  Uncertainties of prediction accuracy in shallow landslide modeling: Sample size and raster resolution , 2019, CATENA.

[10]  V. K. Bansal,et al.  Landslide hazard zonation using analytical hierarchy process along National Highway-3 in mid Himalayas of Himachal Pradesh, India , 2018, Environmental Earth Sciences.

[11]  Chin-Tung Cheng,et al.  Landslide Susceptibility Map , 2013 .

[12]  C. W. Tong,et al.  Expression of Concern: Influence of introducing various meteorological parameters to the Angström–Prescott model for estimation of global solar radiation , 2018, Environmental Earth Sciences.

[13]  Jiang-bo Gao,et al.  Spatial association between landslides and environmental factors over Guizhou Karst Plateau, China , 2018, Journal of Mountain Science.

[14]  Virender Kumar,et al.  Landslide hazard mapping along national highway-154A in Himachal Pradesh, India using information value and frequency ratio , 2017, Arabian Journal of Geosciences.

[15]  Biswajeet Pradhan,et al.  An ensemble model for landslide susceptibility mapping in a forested area , 2020, Geocarto International.

[16]  P. Bierman,et al.  Human and natural controls on erosion in the Lower Jinsha River, China , 2019, Journal of Asian Earth Sciences.

[17]  Zhihao Xu,et al.  Novel Entropy and Rotation Forest-Based Credal Decision Tree Classifier for Landslide Susceptibility Modeling , 2019, Entropy.

[18]  B. Muys,et al.  Comparison and ranking of different modelling techniques for prediction of site index in Mediterranean mountain forests , 2010 .

[19]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[20]  Mukhiddin Juliev,et al.  Comparative analysis of statistical methods for landslide susceptibility mapping in the Bostanlik District, Uzbekistan. , 2019, The Science of the total environment.

[21]  Biswajeet Pradhan,et al.  Data Mining and Statistical Approaches in Debris-Flow Susceptibility Modelling Using Airborne LiDAR Data , 2019, Sensors.

[22]  Wei Chen,et al.  GIS-Based Machine Learning Algorithms for Gully Erosion Susceptibility Mapping in a Semi-Arid Region of Iran , 2020, Remote. Sens..

[23]  Wei Chen,et al.  Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping , 2016 .

[24]  W. Shuster,et al.  Realizing the opportunities of black carbon in urban soils: Implications for water quality management with green infrastructure. , 2018, The Science of the total environment.

[25]  Keh-Jian Shou,et al.  Evaluation of the extreme rainfall predictions and their impact on landslide susceptibility in a sub-catchment scale , 2020 .

[26]  Binh Thai Pham,et al.  A novel hybrid model of Bagging-based Naïve Bayes Trees for landslide susceptibility assessment , 2019, Bulletin of Engineering Geology and the Environment.

[27]  Wei Chen,et al.  Deep learning and boosting framework for piping erosion susceptibility modeling: spatial evaluation of agricultural areas in the semi-arid region , 2021, Geocarto International.

[28]  B. Pradhan,et al.  Comparison of four kernel functions used in support vector machines for landslide susceptibility mapping: a case study at Suichuan area (China) , 2017 .

[29]  Daniel W. McKenney,et al.  Spatial models of site index based on climate and soil properties for two boreal tree species in Ontario, Canada , 2003 .

[30]  Biswajeet Pradhan,et al.  Analysis and evaluation of landslide susceptibility: a review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016) , 2018, Arabian Journal of Geosciences.

[31]  G. P. Obi Reddy,et al.  Evaluation of Best First Decision Tree on Categorical Soil Survey Data for Land Capability Classification , 2013 .

[32]  Hyuck Jin Park,et al.  Assessment of rainfall-induced landslide susceptibility at the regional scale using a physically based model and fuzzy-based Monte Carlo simulation , 2019, Landslides.

[33]  J. Adamowski,et al.  An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. , 2019, The Science of the total environment.

[34]  D. Bui,et al.  A novel ensemble classifier of rotation forest and Naïve Bayer for landslide susceptibility assessment at the Luc Yen district, Yen Bai Province (Viet Nam) using GIS , 2017 .

[35]  Saro Lee,et al.  Combining landslide susceptibility maps obtained from frequency ratio, logistic regression, and artificial neural network models using ASTER images and GIS , 2012 .

[36]  Maryam Mokhtari,et al.  Spatial prediction of landslide susceptibility in Taleghan basin, Iran , 2019, Stochastic Environmental Research and Risk Assessment.

[37]  D. Bui,et al.  Debris flows modeling using geo-environmental factors: developing hybridized deep-learning algorithms , 2021, Geocarto International.

[38]  Xiping Wang,et al.  Use of Time-of-Flight Ultrasound to Measure Wave Speed in Poplar Seedlings , 2019, Forests.

[39]  Tri Dev Acharya,et al.  Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China) , 2018 .

[40]  Binh Thai Pham,et al.  Performance Evaluation of GIS-Based Artificial Intelligence Approaches for Landslide Susceptibility Modeling and Spatial Patterns Analysis , 2020, ISPRS Int. J. Geo Inf..

[41]  Biswajeet Pradhan,et al.  Modification of landslide susceptibility mapping using optimized PSO-ANN technique , 2018, Engineering with Computers.

[42]  H. Pourghasemi,et al.  Prediction of the landslide susceptibility: Which algorithm, which precision? , 2018 .

[43]  Gongzhuang Peng,et al.  Landslide Susceptibility Assessment Using Integrated Deep Learning Algorithm along the China-Nepal Highway , 2018, Sensors.

[44]  Kun Song,et al.  Use of multi-source remote sensing images to describe the sudden Shanshucao landslide in the Three Gorges Reservoir, China , 2019, Bulletin of Engineering Geology and the Environment.

[45]  Bahareh Kalantar,et al.  Performance Evaluation and Sensitivity Analysis of Expert-Based, Statistical, Machine Learning, and Hybrid Models for Producing Landslide Susceptibility Maps , 2017 .

[46]  Qingyun Du,et al.  Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China , 2019, International journal of environmental research and public health.

[47]  Runqiu Huang,et al.  Formation, distribution and risk control of landslides in China , 2011 .

[48]  Renwei Li,et al.  Landslide Susceptibility Mapping for the Muchuan County (China): A Comparison Between Bivariate Statistical Models (WoE, EBF, and IoE) and Their Ensembles with Logistic Regression , 2019, Symmetry.

[49]  B. Pradhan,et al.  Application of weights-of-evidence and certainty factor models and their comparison in landslide susceptibility mapping at Haraz watershed, Iran , 2013, Arabian Journal of Geosciences.

[50]  F. H. van Eemeren,et al.  Analysis and evaluation , 1995 .

[51]  Alessandra Conversi,et al.  Comparative Analysis , 2009, Encyclopedia of Database Systems.

[52]  Wei Chen,et al.  GIS-based landslide susceptibility assessment using optimized hybrid machine learning methods , 2021, CATENA.

[53]  Min-Yuan Cheng,et al.  Typhoon-induced slope collapse assessment using a novel bee colony optimized support vector classifier , 2015, Natural Hazards.

[54]  Ismail Colkesen,et al.  Performance analysis of advanced decision tree-based ensemble learning algorithms for landslide susceptibility mapping , 2019, Geocarto International.

[55]  A. Akgun,et al.  Landslide susceptibility mapping by geographical information system-based multivariate statistical and deterministic models: in an artificial reservoir area at Northern Turkey , 2016, Arabian Journal of Geosciences.

[56]  B. Pham,et al.  Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods , 2017, Theoretical and Applied Climatology.

[57]  Iuliana Armaş,et al.  Weights of evidence method for landslide susceptibility mapping. Prahova Subcarpathians, Romania , 2012, Natural Hazards.

[58]  Lahsen Ait Brahim,et al.  Weights of evidence method for landslide susceptibility mapping in Tangier, Morocco , 2018 .

[59]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[60]  S. Mandal,et al.  Analytical hierarchy process (AHP) based landslide susceptibility mapping of Lish river basin of eastern Darjeeling Himalaya, India , 2018, Advances in Space Research.

[61]  Wei Chen,et al.  Urban flood modeling using deep-learning approaches in Seoul, South Korea , 2021 .

[62]  F. Wei,et al.  A new slope unit extraction method for regional landslide analysis based on morphological image analysis , 2018, Bulletin of Engineering Geology and the Environment.

[63]  P. Reichenbach,et al.  A review of statistically-based landslide susceptibility models , 2018 .

[64]  Xiaojing Wang,et al.  Landslide Susceptibility Modeling Based on GIS and Novel Bagging-Based Kernel Logistic Regression , 2018, Applied Sciences.

[65]  Jie Dou,et al.  Evaluating GIS-Based Multiple Statistical Models and Data Mining for Earthquake and Rainfall-Induced Landslide Susceptibility Using the LiDAR DEM , 2019, Remote. Sens..

[66]  Qiqing Wang,et al.  Landslide susceptibility assessment using frequency ratio, statistical index and certainty factor models for the Gangu County, China , 2016, Arabian Journal of Geosciences.

[67]  H. Pourghasemi,et al.  Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances , 2013, Natural Hazards.

[68]  H. Pourghasemi,et al.  Land subsidence susceptibility assessment using random forest machine learning algorithm , 2019, Environmental Earth Sciences.

[69]  Biswajeet Pradhan,et al.  Shallow Landslide Prediction Using a Novel Hybrid Functional Machine Learning Algorithm , 2019, Remote. Sens..

[70]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[71]  João Gama,et al.  Functional trees for classification , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[72]  Dieu Tien Bui,et al.  Development of a Novel Hybrid Intelligence Approach for Landslide Spatial Prediction , 2019, Applied Sciences.

[73]  M. Kasai,et al.  Topographic effects on frequency-size distribution of landslides triggered by the Hokkaido Eastern Iburi Earthquake in 2018 , 2019, Earth, Planets and Space.

[74]  Y. Zhao,et al.  A comparison of the effects of natural vegetation regrowth with a plantation scheme on soil structure in a geological hazard‐prone region , 2019, European Journal of Soil Science.

[75]  M. Rossi,et al.  Landslide susceptibility modeling in a landslide prone area in Mazandarn Province, north of Iran: a comparison between GLM, GAM, MARS, and M-AHP methods , 2017, Theoretical and Applied Climatology.

[76]  Saro Lee,et al.  Application of Ensemble-Based Machine Learning Models to Landslide Susceptibility Mapping , 2018, Remote. Sens..

[77]  Wei Chen,et al.  Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling. , 2018, The Science of the total environment.

[78]  Ravinesh C. Deo,et al.  Land subsidence modelling using tree-based machine learning algorithms. , 2019, The Science of the total environment.

[79]  H. Pourghasemi,et al.  Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms , 2019, Geoderma.

[80]  Saro Lee,et al.  Enhancing Prediction Performance of Landslide Susceptibility Model Using Hybrid Machine Learning Approach of Bagging Ensemble and Logistic Model Tree , 2018, Applied Sciences.

[81]  E. McDonald,et al.  Hillslope response under variable microclimate , 2019, Earth Surface Processes and Landforms.

[82]  Fabrizio Palmisano,et al.  Methodology for Landslide Damage Assessment , 2016 .

[83]  Taro Uchimura,et al.  Clarifying the hydrological mechanisms and thresholds for rainfall-induced landslide: in situ monitoring of big data to unsaturated slope stability analysis , 2019, Bulletin of Engineering Geology and the Environment.

[84]  Wei Chen,et al.  Evaluation of different boosting ensemble machine learning models and novel deep learning and boosting framework for head-cut gully erosion susceptibility. , 2021, Journal of environmental management.

[85]  S. Mandal,et al.  Bivariate statistical index for landslide susceptibility mapping in the Rorachu river basin of eastern Sikkim Himalaya, India , 2018, Spatial Information Research.

[86]  Shu-Yeong Chi,et al.  Regional landslide susceptibility assessment using multi-stage remote sensing data along the coastal range highway in northeastern Taiwan , 2018 .

[87]  Spatio-temporal dynamics of sediment transfer systems in landslide-prone Alpine catchments , 2019, Solid Earth.

[88]  Adrienn Dineva,et al.  Evaluation efficiency of hybrid deep learning algorithms with neural network decision tree and boosting methods for predicting groundwater potential , 2021, Geocarto International.

[89]  Shijie Wang,et al.  Assessing spatial-temporal evolution processes and driving forces of karst rocky desertification , 2019, Geocarto International.

[90]  Biswajeet Pradhan,et al.  Modeling landslide susceptibility in data-scarce environments using optimized data mining and statistical methods , 2018 .

[91]  Biswajeet Pradhan,et al.  GIS-based landslide susceptibility mapping using numerical risk factor bivariate model and its ensemble with linear multivariate regression and boosted regression tree algorithms , 2019, Journal of Mountain Science.

[92]  Yanli Wu,et al.  A comparative study on the landslide susceptibility mapping using logistic regression and statistical index models , 2017, Arabian Journal of Geosciences.

[93]  M. S. Guettouche,et al.  GIS-based landslide susceptibility zonation using bivariate statistical and expert approaches in the city of Constantine (Northeast Algeria) , 2015, Bulletin of Engineering Geology and the Environment.

[94]  A. Zhu,et al.  Landslide susceptibility evaluating using artificial intelligence method in the Youfang district (China) , 2019, Environmental Earth Sciences.

[95]  Hossein A. Taiebat,et al.  Finite element simulation of an excavation-triggered landslide using large deformation theory , 2016 .

[96]  Tian-feng Gu,et al.  Mechanisms of wetting-induced loess slope failures , 2019, Landslides.

[97]  B. Pradhan,et al.  GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks , 2016, Environmental Earth Sciences.

[98]  Tetsuya Kubota,et al.  Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia , 2018, Geomorphology.

[99]  Dongyan Sun,et al.  Landslide susceptibility mapping in the region of eastern Himalayan syntaxis, Tibetan Plateau, China: a comparison between analytical hierarchy process information value and logistic regression-information value methods , 2018, Bulletin of Engineering Geology and the Environment.

[100]  F. Luino,et al.  New insights in the relation between climate and slope failures at high-elevation sites , 2018, Theoretical and Applied Climatology.

[101]  Jianbing Peng,et al.  Three-dimensional geological structures and sliding factors and modes of loess landslides , 2018, Environmental Earth Sciences.

[102]  Derya Dilara Kose,et al.  GIS-based fully automatic landslide susceptibility analysis by weight-of-evidence and frequency ratio methods , 2018, Physical Geography.

[103]  Dieu Tien Bui,et al.  A novel artificial intelligence approach based on Multi-layer Perceptron Neural Network and Biogeography-based Optimization for predicting coefficient of consolidation of soil , 2019, CATENA.

[104]  Jhareswar Maiti,et al.  An optimization-based decision tree approach for predicting slip-trip-fall accidents at work , 2019, Safety Science.

[105]  I. Moore,et al.  Digital terrain modelling: A review of hydrological, geomorphological, and biological applications , 1991 .

[106]  Prima Riza Kadavi,et al.  Landslide-susceptibility mapping in Gangwon-do, South Korea, using logistic regression and decision tree models , 2019, Environmental Earth Sciences.

[107]  Seyed Amir Naghibi,et al.  Groundwater Augmentation through the Site Selection of Floodwater Spreading Using a Data Mining Approach (Case study: Mashhad Plain, Iran) , 2018, Water.

[108]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[109]  P. Martin Mai,et al.  Presenting logistic regression-based landslide susceptibility results , 2018, Engineering Geology.

[110]  P. Leese Comparison of the effects , 1992 .

[111]  D. Bui,et al.  Hybrid Machine Learning Approaches for Landslide Susceptibility Modeling , 2019, Forests.

[112]  H. Pourghasemi,et al.  GIS-based frequency ratio and index of entropy models for landslide susceptibility assessment in the Caspian forest, northern Iran , 2014, International Journal of Environmental Science and Technology.

[113]  Wei Chen,et al.  Gully headcut susceptibility modeling using functional trees, naïve Bayes tree, and random forest models , 2019, Geoderma.

[114]  F. Pergalani,et al.  Slope Instability Zonation: a Comparison Between Certainty Factor and Fuzzy Dempster–Shafer Approaches , 1998 .