Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping

Abstract Landslides are a common type of natural disaster that brings great threats to the human lives and economic development around the world, especially in the Chinese Loess Plateau. Longxian County (Shaanxi Province, China), a landslide-prone area located in the southwest part of the Loess Plateau, was selected as the study area. The main purpose of this paper is to map landslide susceptibility using Alternating decision tree (ADTree) as well as GIS-based new ensemble techniques involving ADTree with bootstrap aggregation (Bagging) and ADTree with adaptive boosting (AdaBoost). Initially, a landslide inventory map was prepared with 171 determined historical landslides events in the study area, 120 landslides (70%) were randomly selected for training dataset and the remaining 51 landslides (30%) were used for validation dataset. Subsequently, eleven landslide conditioning factors were considered in the landslide susceptibility mapping. Then, an optimization operation on selection of landslide conditioning factors was performed using correlation attribute evaluation method and Spearman’s rank correlation coefficient. Afterwards, landslide susceptibility maps were generated with the three models. Finally, receiver operating characteristic (ROC) curve, area under the ROC curve (AUC) and statistical measures were applied to evaluate and validate the performance of the models. The results show success rates of the ADTree model, the ADTree with Bagging (ADTree-Bagging) model and the ADTree with AdaBoost (ADTree-AdaBoost) model were 0.872, 0.917, and 0.984, respectively, while prediction rates of the three models were 0.696, 0.752 and 0.787, respectively. In sum, the two ensemble models proposed prohibited better performance than the ADTree model did, and the ADTree-AdaBoost model was selected as the best model in the study. Hence, ensemble techniques can provide new and promising methods for spatial prediction and zonation of landslide susceptibility.

[1]  P. Reichenbach,et al.  Optimal landslide susceptibility zonation based on multiple forecasts , 2010 .

[2]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[3]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[4]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[5]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[6]  C. Thorne,et al.  Quantitative analysis of land surface topography , 1987 .

[7]  Biswajeet Pradhan,et al.  A novel ensemble decision tree-based CHi-squared Automatic Interaction Detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping , 2014, Landslides.

[8]  B. Pradhan,et al.  GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks , 2016, Environmental Earth Sciences.

[9]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility assessment in Lianhua County (China); a comparison between a random forest data mining technique and bivariate and multivariate statistical models , 2016 .

[10]  C. Palacín,et al.  Influence of spatial heterogeneity and temporal variability in habitat selection: A case study on a great bustard metapopulation , 2012 .

[11]  A. Trigila,et al.  Comparison of Logistic Regression and Random Forests techniques for shallow landslide susceptibility assessment in Giampilieri (NE Sicily, Italy) , 2015 .

[12]  Wei Chen,et al.  GIS-based landslide susceptibility modelling: a comparative assessment of kernel logistic regression, Naïve-Bayes tree, and alternating decision tree models , 2017 .

[13]  Saro Lee,et al.  Ensemble-based landslide susceptibility maps in Jinbu area, Korea , 2012, Environmental Earth Sciences.

[14]  J. Malet,et al.  Landslide susceptibility assessment by bivariate methods at large scales: Application to a complex mountainous environment , 2007 .

[15]  P. Reichenbach,et al.  A review of statistically-based landslide susceptibility models , 2018 .

[16]  C. F. Lee,et al.  Assessment of landslide susceptibility on the natural terrain of Lantau Island, Hong Kong , 2001 .

[17]  Wei Chen,et al.  GIS-based assessment of landslide susceptibility using certainty factor and index of entropy models for the Qianyang County of Baoji city, China , 2015, Journal of Earth System Science.

[18]  A. Yalçın GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): Comparisons of results and confirmations , 2008 .

[19]  Jianbing Peng,et al.  Distribution and characteristics of landslide in Loess Plateau: A case study in Shaanxi province , 2017 .

[20]  Wenwu Chen,et al.  Heavy rainfall triggered loess–mudstone landslide and subsequent debris flow in Tianshui, China , 2015 .

[21]  Wei Chen,et al.  Landslide spatial modeling: Introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques , 2017 .

[22]  Huichan Chai,et al.  Application of frequency ratio, statistical index, and index of entropy models and their comparison in landslide susceptibility mapping for the Baozhong Region of Baoji, China , 2015, Arabian Journal of Geosciences.

[23]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[24]  P. Martin Mai,et al.  Presenting logistic regression-based landslide susceptibility results , 2018, Engineering Geology.

[25]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[26]  Qiqing Wang,et al.  Landslide susceptibility assessment using frequency ratio, statistical index and certainty factor models for the Gangu County, China , 2016, Arabian Journal of Geosciences.

[27]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[28]  Jun Chen,et al.  A rapid loess flowslide triggered by irrigation in China , 2009 .

[29]  Xiaoqin Li,et al.  GIS-based landslide susceptibility mapping using analytical hierarchy process (AHP) and certainty factor (CF) models for the Baozhong region of Baoji City, China , 2015, Environmental Earth Sciences.

[30]  Jie Liu,et al.  Controlling factors of loess landslides in western China , 2010 .

[31]  Mauro Rossi,et al.  LAND-SE: a software for statistically based landslide susceptibilityzonation, version 1.0 , 2016 .

[32]  H. Pourghasemi,et al.  Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances , 2013, Natural Hazards.

[33]  Jianbing Peng,et al.  Distribution and genetic types of loess landslides in China , 2019, Journal of Asian Earth Sciences.

[34]  B. Pradhan,et al.  Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models , 2007 .

[35]  Gonghui Wang,et al.  Study of the 1920 Haiyuan earthquake-induced landslides in loess (China) , 2007 .

[36]  A. Ziegler,et al.  Erosion processes in steep terrain—Truths, myths, and uncertainties related to forest management in Southeast Asia , 2006 .

[37]  Biswajeet Pradhan,et al.  A novel integrated model for assessing landslide susceptibility mapping using CHAID and AHP pair-wise comparison , 2016 .

[38]  Yue Liang,et al.  A loess landslide induced by excavation and rainfall , 2014, Landslides.

[39]  G. Jenks The Data Model Concept in Statistical Mapping , 1967 .

[40]  H. Shahabi,et al.  Landslide susceptibility mapping at central Zab basin, Iran: a comparison between analytical hierarchy process, frequency ratio and logistic regression models , 2014 .

[41]  B. Pradhan,et al.  Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy , 2016 .

[42]  Shouyun Liang,et al.  Landslide susceptibility assessment using evidential belief function, certainty factor and frequency ratio model at Baxie River basin, NW China , 2019 .

[43]  Dieu Tien Bui,et al.  Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study , 2017, Geotechnical and Geological Engineering.

[44]  A. Akobeng,et al.  Understanding diagnostic tests 3: receiver operating characteristic curves , 2007, Acta paediatrica.

[45]  Fuchu Dai,et al.  Landslide risk assessment and management: an overview , 2002 .

[46]  E. Rotigliano,et al.  Assessment of susceptibility to earth-flow landslide using logistic regression and multivariate adaptive regression splines: A case of the Belice River basin (western Sicily, Italy) , 2015 .

[47]  D. Bui,et al.  A novel ensemble classifier of rotation forest and Naïve Bayer for landslide susceptibility assessment at the Luc Yen district, Yen Bai Province (Viet Nam) using GIS , 2017 .

[48]  Biswajeet Pradhan,et al.  Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling , 2010, Environ. Model. Softw..

[49]  B. Pradhan,et al.  Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling–Narayanghat road section in Nepal Himalaya , 2012, Natural Hazards.

[50]  Dongsheng Liu,et al.  Loess and the environment , 1985 .

[51]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[52]  Hamid Reza Pourghasemi,et al.  Erratum to: Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2016, Landslides.

[53]  Geoff Holmes,et al.  Optimizing the Induction of Alternating Decision Trees , 2001, PAKDD.

[54]  B. Pradhan,et al.  Landslide Susceptibility Mapping Along the National Road 32 of Vietnam Using GIS-Based J48 Decision Tree Classifier and Its Ensembles , 2014 .

[55]  Binh Thai Pham,et al.  A novel hybrid model of Bagging-based Naïve Bayes Trees for landslide susceptibility assessment , 2019, Bulletin of Engineering Geology and the Environment.

[56]  A. Akgun,et al.  Landslide susceptibility mapping for a landslide-prone area (Findikli, NE of Turkey) by likelihood-frequency ratio and weighted linear combination models , 2008 .

[57]  R. Guthrie,et al.  An examination of controls on debris flow mobility: Evidence from coastal British Columbia , 2010 .

[58]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2015, Landslides.

[59]  Peiyue Li,et al.  Environment: Accelerate research on land creation , 2014, Nature.

[60]  Mustafa Neamah Jebur,et al.  Landslide susceptibility mapping using ensemble bivariate and multivariate statistical models in Fayfa area, Saudi Arabia , 2015, Environmental Earth Sciences.

[61]  Young-Kwang Yeon,et al.  Landslide susceptibility mapping in Injae, Korea, using a decision tree , 2010 .

[62]  Ling Xu,et al.  Landslides in a loess platform, North-West China , 2014, Landslides.

[63]  Q. Záruba,et al.  Landslides and their control , 1969 .

[64]  Minoru Yamanaka,et al.  Predictive modelling of rainfall-induced landslide hazard in the Lesser Himalaya of Nepal based on weights-of-evidence , 2008 .

[65]  L. Ayalew,et al.  Landslide susceptibility mapping using GIS-based weighted linear combination, the case in Tsugawa area of Agano River, Niigata Prefecture, Japan , 2004 .

[66]  Xuanmei Fan,et al.  The landslide story , 2013 .

[67]  Sandeep Kumar,et al.  PMT: New analytical framework for automated evaluation of geo-environmental modelling approaches. , 2019, The Science of the total environment.

[68]  J. Caprini,et al.  Validation of Venous Clinical Severity Score (VCSS) with other venous severity assessment tools from the American Venous Forum, National Venous Screening Program. , 2011, Journal of vascular surgery.

[69]  Yanli Wu,et al.  Landslide susceptibility zonation using GIS and evidential belief function model , 2016, Arabian Journal of Geosciences.

[70]  Wenping Li,et al.  Landslide susceptibility mapping based on GIS and information value model for the Chencang District of Baoji, China , 2014, Arabian Journal of Geosciences.

[71]  P. Reichenbach,et al.  Estimating the quality of landslide susceptibility models , 2006 .

[72]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[73]  Huichan Chai,et al.  Landslide susceptibility mapping based on GIS and support vector machine models for the Qianyang County, China , 2016, Environmental Earth Sciences.

[74]  B. Pradhan,et al.  Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines , 2015 .

[75]  Paraskevas Tsangaratos,et al.  Estimating landslide susceptibility through a artificial neural network classifier , 2014, Natural Hazards.

[76]  B. Pham,et al.  A Comparative Study of Least Square Support Vector Machines and Multiclass Alternating Decision Trees for Spatial Prediction of Rainfall-Induced Landslides in a Tropical Cyclones Area , 2016, Geotechnical and Geological Engineering.

[77]  Hamid Reza Pourghasemi,et al.  Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution (DE), and particle swarm optimization (PSO) for landslide spatial modelling , 2017 .

[78]  L. Highland,et al.  The Landslide Handbook - A Guide to Understanding Landslides , 2008 .

[79]  Peng Jianbin THE CRITICAL ISSUES AND CREATIVE CONCEPTS IN MITIGATION RESEARCH OF LOESS GEOLOGICAL HAZARDS , 2014 .

[80]  André Stumpf,et al.  Object-oriented mapping of landslides using Random Forests , 2011 .

[81]  Dieu Tien Bui,et al.  Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS , 2017 .

[82]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[83]  A. Brenning,et al.  Assessing the quality of landslide susceptibility maps – case study Lower Austria , 2014 .

[84]  A. Zhu,et al.  GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method , 2018 .

[85]  Paraskevas Tsangaratos,et al.  Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size , 2016 .

[86]  P. Reichenbach,et al.  GIS techniques and statistical models in evaluating landslide hazard , 1991 .

[87]  P. Shi,et al.  Landslide and Debris Flow Disasters in China , 2016 .

[88]  Ruiqing Niu,et al.  The assessment of landslide susceptibility mapping using random forest and decision tree methods in the Three Gorges Reservoir area, China , 2017, Environmental Earth Sciences.

[89]  B. Pradhan,et al.  Landslide susceptibility mapping using index of entropy and conditional probability models in GIS: Safarood Basin, Iran , 2012 .

[90]  S. Hensley,et al.  Fault-zone controls on the spatial distribution of slow-moving landslides , 2013 .

[91]  E. Derbyshire,et al.  Geological hazards in loess terrain, with particular reference to the loess regions of China , 2001 .

[92]  Maria Petrou,et al.  Landslide-Hazard Mapping Using an Expert System and a GIS , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[93]  Yanli Wu,et al.  A comparative study on the landslide susceptibility mapping using logistic regression and statistical index models , 2017, Arabian Journal of Geosciences.