A comparative study of support vector machine and logistic model tree classifiers for shallow landslide susceptibility modeling

The main aim of this study was to evaluate and compare the results of two data-mining algorithms including support vector machine (SVM) and logistic model tree (LMT) for shallow landslide modelling in Kamyaran county where located in Kurdistan Province, Iran. A total of 60 landslide locations were identified using different sources and randomly divided into a ratio of 70/30 for landslide modeling and validation process. After that, 21 conditioning factors, with a raster resolution of 20 m, based on the information gain ratio (IGR) technique were selected. Performance of the models was evaluated using area under the receiver-operating characteristic curve (AUROC), and also several statistical-based indexes. Results depicted that only eight factors including distance to river, river density, stream power index (SPI), rainfall, valley depth, topographic wetness index (TWI), solar radiation, and plan curvature were known more effective for landslide modeling using training data set. The results also revealed that the SVM model (AUROC = 0.882) outperformed and outclassed the LMT model (AUROC = 0.737). Therefore, analysis and comparison of the results showed that the SVM model by RBF function performed well for landslide spatial prediction in the study area. Eventually, the findings of this study can be useful for land-use planning, reducing the risk of landslide, and decision-making in areas prone to landslide.

[1]  Lipika Dey,et al.  A feature selection technique for classificatory analysis , 2005, Pattern Recognit. Lett..

[2]  Wei Chen,et al.  A novel ensemble approach of bivariate statistical-based logistic model tree classifier for landslide susceptibility assessment , 2018 .

[3]  Xueling Wu,et al.  Landslide susceptibility mapping based on rough set theory and support vector machines: A case of the Three Gorges area, China , 2014 .

[4]  C. Irigaray,et al.  Factors selection in landslide susceptibility modelling on large scale following the gis matrix method: application to the river Beiro basin (Spain) , 2012 .

[5]  M. Abedini,et al.  Landslide susceptibility mapping in Bijar city, Kurdistan Province, Iran: a comparative study by logistic regression and AHP models , 2017, Environmental Earth Sciences.

[6]  Y. Zhao,et al.  Comparison of decision tree methods for finding active objects , 2007, 0708.4274.

[7]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[8]  B. Pradhan,et al.  Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy , 2016 .

[9]  Lee Saro,et al.  A GIS-based logistic regression model in rock-fall susceptibility mapping along a mountainous road: Salavat Abad case study, Kurdistan, Iran , 2012, Natural Hazards.

[10]  H. Shahabi,et al.  Novel forecasting approaches using combination of machine learning and statistical models for flood susceptibility mapping. , 2018, Journal of environmental management.

[11]  D. Cruden A simple definition of a landslide , 1991 .

[12]  Dieu Tien Bui,et al.  A comparative study between popular statistical and machine learning methods for simulating volume of landslides , 2017 .

[13]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[14]  Wei Chen,et al.  A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naïve Bayes tree classifiers for a landslide susceptibility assessment in Langao County, China , 2017 .

[15]  B. Pradhan,et al.  Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in Central Nepal Himalaya , 2014, Arabian Journal of Geosciences.

[16]  Pradeep Kurup,et al.  Decision tree approach for classification and dimensionality reduction of electronic nose data , 2011 .

[17]  B. Pham,et al.  Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods , 2017, Theoretical and Applied Climatology.

[18]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[19]  D. Bui,et al.  Shallow landslide susceptibility assessment using a novel hybrid intelligence approach , 2017, Environmental Earth Sciences.

[20]  Jens Forster,et al.  Logistic Model Trees with AUC Split Criterion for the KDD Cup 2009 Small Challenge , 2009, KDD Cup.

[21]  A. Zhu,et al.  Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling , 2018, Bulletin of Engineering Geology and the Environment.

[22]  Guifang Zhang,et al.  Integration of the Statistical Index Method and the Analytic Hierarchy Process technique for the assessment of landslide susceptibility in Huizhou, China , 2016 .

[23]  Wei Chen,et al.  Landslide Detection and Susceptibility Mapping by AIRSAR Data Using Support Vector Machine and Index of Entropy Models in Cameron Highlands, Malaysia , 2018, Remote. Sens..

[24]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[25]  Biswajeet Pradhan,et al.  A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India) , 2016, Environ. Model. Softw..

[26]  B. Pradhan,et al.  Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models , 2012 .

[27]  Fuchu Dai,et al.  Landslide risk assessment and management: an overview , 2002 .

[28]  K. Solaimani,et al.  Rock fall susceptibility assessment along a mountainous road: an evaluation of bivariate statistic, analytical hierarchy process and frequency ratio , 2017, Environmental Earth Sciences.

[29]  L. Tham,et al.  Landslide susceptibility mapping based on Support Vector Machine: A case study on natural slopes of Hong Kong, China , 2008 .

[30]  B. Pradhan,et al.  Landslide susceptibility mapping using support vector machine and GIS at the Golestan Province, Iran , 2013, Journal of Earth System Science.

[31]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility assessment in Lianhua County (China); a comparison between a random forest data mining technique and bivariate and multivariate statistical models , 2016 .

[32]  Chong Xu,et al.  The 2010 Yushu earthquake triggered landslide hazard mapping using GIS and weight of evidence modeling , 2012, Environmental Earth Sciences.

[33]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .

[34]  B. Pradhan,et al.  Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines , 2015 .

[35]  Dieu Tien Bui,et al.  Landslide Susceptibility Assessment at a Part of Uttarakhand Himalaya, India using GIS – based Statistical Approach of Frequency Ratio Method , 2015 .

[36]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[37]  Yang Liu,et al.  An introduction to decision tree modeling , 2004 .

[38]  A-Xing Zhu,et al.  Flood susceptibility assessment in Hengfeng area coupling adaptive neuro-fuzzy inference system with genetic algorithm and differential evolution. , 2018, The Science of the total environment.

[39]  N S NITHYA,et al.  Gain ratio based fuzzy weighted association rule mining classifier for medical diagnostic interface , 2014 .

[40]  Alfred Stein,et al.  Landslide susceptibility mapping along road corridors in the Indian Himalayas using Bayesian logistic regression models , 2012 .

[41]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[42]  C. Irigaray,et al.  Landslide-susceptibility mapping in a semi-arid mountain environment: an example from the southern slopes of Sierra Nevada (Granada, Spain) , 2011 .

[43]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[44]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[45]  Nhat-Duc Hoang,et al.  A Novel Integrated Approach of Relevance Vector Machine Optimized by Imperialist Competitive Algorithm for Spatial Modeling of Shallow Landslides , 2018, Remote. Sens..

[46]  S. Z. Mousavi,et al.  GIS-based spatial prediction of landslide susceptibility using logistic regression model , 2011 .

[47]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[48]  I. Ilia,et al.  Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi Perfection, Greece , 2016, Landslides.

[49]  Dieu Tien Bui,et al.  A novel hybrid artificial intelligence approach for flood susceptibility assessment , 2017, Environ. Model. Softw..

[50]  Joseph H. A. Guillaume,et al.  Characterising performance of environmental models , 2013, Environ. Model. Softw..

[51]  Biswajeet Pradhan,et al.  Novel GIS Based Machine Learning Algorithms for Shallow Landslide Susceptibility Mapping , 2018, Sensors.

[52]  A. Zhu,et al.  A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China) , 2017, Environmental Earth Sciences.

[53]  Tri Dev Acharya,et al.  Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China) , 2018 .

[54]  D. Bui,et al.  A hybrid machine learning ensemble approach based on a Radial Basis Function neural network and Rotation Forest for landslide susceptibility modeling: A case study in the Himalayan area, India , 2017, International Journal of Sediment Research.

[55]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[56]  A. Nonomura,et al.  GIS-based weights-of-evidence modelling of rainfall-induced landslides in small catchments for landslide susceptibility mapping , 2008 .

[57]  Wei Chen,et al.  Spatial prediction of landslide susceptibility by combining evidential belief function, logistic regression and logistic model tree , 2019, Geocarto International.

[58]  D. Bui,et al.  Landslide susceptibility modeling using Reduced Error Pruning Trees and different ensemble techniques: Hybrid machine learning approaches , 2019, CATENA.

[59]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[60]  Wei Chen,et al.  Land Subsidence Susceptibility Mapping in South Korea Using Machine Learning Algorithms , 2018, Sensors.

[61]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[62]  V. Singh,et al.  New Hybrids of ANFIS with Several Optimization Algorithms for Flood Susceptibility Modeling , 2018, Water.

[63]  Himan Shahabi,et al.  A novel hybrid approach of Bayesian Logistic Regression and its ensembles for landslide susceptibility assessment , 2018, Geocarto International.

[64]  Friedrich Recknagel,et al.  Handbook of Ecological Modelling and Informatics , 2009 .

[65]  Xiaojing Wang,et al.  Landslide Susceptibility Modeling Based on GIS and Novel Bagging-Based Kernel Logistic Regression , 2018, Applied Sciences.

[66]  Wei Chen,et al.  A hybrid fuzzy weight of evidence method in landslide susceptibility analysis on the Wuyuan area, China , 2017 .

[67]  Cristiano Ballabio,et al.  Support Vector Machines for Landslide Susceptibility Mapping: The Staffora River Basin Case Study, Italy , 2012, Mathematical Geosciences.

[68]  V. Singh,et al.  Mapping Groundwater Potential Using a Novel Hybrid Intelligence Approach , 2018, Water Resources Management.

[69]  D. Bui,et al.  Uncertainties of prediction accuracy in shallow landslide modeling: Sample size and raster resolution , 2019, CATENA.

[70]  V. Singh,et al.  Novel Hybrid Evolutionary Algorithms for Spatial Prediction of Floods , 2018, Scientific Reports.

[71]  T. Kavzoglu,et al.  An assessment of multivariate and bivariate approaches in landslide susceptibility mapping: a case study of Duzkoy district , 2015, Natural Hazards.

[72]  D. Bui,et al.  Hybrid Machine Learning Approaches for Landslide Susceptibility Modeling , 2019, Forests.

[73]  Dieu Tien Bui,et al.  A novel hybrid approach of landslide susceptibility modelling using rotation forest ensemble and different base classifiers , 2019, Geocarto International.

[74]  Biswajeet Pradhan,et al.  Use of GIS-based fuzzy logic relations and its cross application to produce landslide susceptibility maps in three test areas in Malaysia , 2011 .

[75]  Himan Shahabi,et al.  Landslide spatial modelling using novel bivariate statistical based Naïve Bayes, RBF Classifier, and RBF Network machine learning algorithms. , 2019, The Science of the total environment.

[76]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[77]  Himan Shahabi,et al.  Flood susceptibility assessment using integration of adaptive network-based fuzzy inference system (ANFIS) and biogeography-based optimization (BBO) and BAT algorithms (BA) , 2019 .

[78]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.