Shallow Landslide Prediction Using a Novel Hybrid Functional Machine Learning Algorithm

We used a novel hybrid functional machine learning algorithm to predict the spatial distribution of landslides in the Sarkhoon watershed, Iran. We developed a new ensemble model which is a combination of a functional algorithm, stochastic gradient descent (SGD) and an AdaBoost (AB) Meta classifier namely ABSGD model to predict the landslides. The model incorporates 20 landslide conditioning factors, which we ranked using the least-square support vector machine (LSSVM) technique. For the modeling, we considered 98 landslide locations, of which 70% (79) were Remote Sens. 2019, 11, 931; doi:10.3390/rs11080931 www.mdpi.com/journal/remotesensing Remote Sens. 2019, 11, 931 2 of 22 used for training and 30% (19) for validation processes. Model validation was performed using sensitivity, specificity, accuracy, the root mean square error (RMSE) and the area under the receiver operatic characteristic (AUC) curve. We also used soft computing benchmark models, including SGD, logistic regression (LR), logistic model tree (LMT) and functional tree (FT) algorithms for model validation and comparison. The selected conditioning factors were significant in landslide occurrence but distance to road was found to be the most important factor. The ABSGD model (AUC= 0.860) outperformed the LR (0.797), SGD (0.776), LMT (0.740) and FT (0.734) models. Our results confirm that the combined use of a functional algorithm and a Meta classifier prevents over-fitting, reduces noise and enhances the power prediction of the individual SGD algorithm for the spatial prediction of landslides.

[1]  Bahareh Kalantar,et al.  Performance Evaluation and Sensitivity Analysis of Expert-Based, Statistical, Machine Learning, and Hybrid Models for Producing Landslide Susceptibility Maps , 2017 .

[2]  Wei Chen,et al.  Landslide susceptibility assessment at the Wuning area, China: a comparison between multi-criteria decision making, bivariate statistical and machine learning methods , 2018, Natural Hazards.

[3]  P. Aleotti,et al.  Landslide hazard assessment: summary review and new perspectives , 1999 .

[4]  D. Bui,et al.  Hybrid Machine Learning Approaches for Landslide Susceptibility Modeling , 2019, Forests.

[5]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[6]  Biswajeet Pradhan,et al.  Novel Hybrid Integration Approach of Bagging-Based Fisher’s Linear Discriminant Function for Groundwater Potential Analysis , 2019, Natural Resources Research.

[7]  Saro Lee,et al.  Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models. , 2019, The Science of the total environment.

[8]  Himan Shahabi,et al.  Hybrid artificial intelligence models based on a neuro-fuzzy system and metaheuristic optimization algorithms for spatial prediction of wildfire probability , 2019, Agricultural and Forest Meteorology.

[9]  Biswajeet Pradhan,et al.  Groundwater spring potential modelling: Comprising the capability and robustness of three different modeling approaches , 2018, Journal of Hydrology.

[10]  B. Pradhan,et al.  GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks , 2016, Environmental Earth Sciences.

[11]  B. Pradhan Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia , 2010 .

[12]  Shiuan Wan,et al.  Discrete rough set analysis of two different soil-behavior-induced landslides in National Shei-Pa Park, Taiwan , 2015 .

[13]  Majid Shadman Roodposhti,et al.  Fuzzy Shannon Entropy: A Hybrid GIS-Based Landslide Susceptibility Mapping Method , 2016, Entropy.

[14]  Wei Chen,et al.  Hybrid Integration Approach of Entropy with Logistic Regression and Support Vector Machine for Landslide Susceptibility Modeling , 2018, Entropy.

[15]  Iman Nasiri Aghdam,et al.  A new hybrid model using Step-wise Weight Assessment Ratio Analysis (SWARA) technique and Adaptive Neuro-fuzzy Inference System (ANFIS) for regional landslide hazard assessment in Iran , 2015 .

[16]  B. Pham,et al.  A comparative study of sequential minimal optimization-based support vector machines, vote feature intervals, and logistic regression in landslide susceptibility assessment using GIS , 2017, Environmental Earth Sciences.

[17]  B. Pradhan,et al.  Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in Central Nepal Himalaya , 2014, Arabian Journal of Geosciences.

[18]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[19]  Lei Wang,et al.  AdaBoost with SVM-based component classifiers , 2008, Eng. Appl. Artif. Intell..

[20]  W. Gardner Learning characteristics of stochastic-gradient-descent algorithms: A general study, analysis, and critique , 1984 .

[21]  Wei Chen,et al.  Spatial prediction of landslide susceptibility using an adaptive neuro-fuzzy inference system combined with frequency ratio, generalized additive model, and support vector machine techniques , 2017, Geomorphology.

[22]  Jung Hyun Lee,et al.  A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping , 2014 .

[23]  Himan Shahabi,et al.  A novel hybrid approach of Bayesian Logistic Regression and its ensembles for landslide susceptibility assessment , 2018, Geocarto International.

[24]  Binh Thai Pham,et al.  Evaluation of predictive ability of support vector machines and naive Bayes trees methods for spatial prediction of landslides in Uttarakhand state (India) using GIS , 2016 .

[25]  Lin Ma,et al.  Empirical analysis of support vector machine ensemble classifiers , 2009, Expert Syst. Appl..

[26]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.

[27]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[28]  Birgit Kleinschmit,et al.  Evaluation of Remote-Sensing-Based Landslide Inventories for Hazard Assessment in Southern Kyrgyzstan , 2017, Remote. Sens..

[29]  D. Bui,et al.  Uncertainties of prediction accuracy in shallow landslide modeling: Sample size and raster resolution , 2019, CATENA.

[30]  Fuan Tsai,et al.  Analysis of topographic and vegetative factors with data mining for landslide verification , 2013 .

[31]  J. Malet,et al.  Recommendations for the quantitative analysis of landslide risk , 2013, Bulletin of Engineering Geology and the Environment.

[32]  B. Pradhan,et al.  A novel hybrid evidential belief function-based fuzzy logic model in spatial prediction of rainfall-induced shallow landslides in the Lang Son city area (Vietnam) , 2015 .

[33]  Tri Dev Acharya,et al.  Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China) , 2018 .

[34]  P. Santi,et al.  Debris flows and their toll on human life: a global analysis of debris-flow fatalities from 1950 to 2011 , 2014, Natural Hazards.

[35]  M. Aniya,et al.  Landslide hazard mapping and its evaluation using GIS: an investigation of sampling schemes for a grid-cell based quantitative method. , 2000 .

[36]  P. Reichenbach,et al.  Estimating the quality of landslide susceptibility models , 2006 .

[37]  Abbas Alimohammadi,et al.  A GIS-based neuro-fuzzy procedure for integrating knowledge and data in landslide susceptibility mapping , 2010, Comput. Geosci..

[38]  B. Pradhan,et al.  Landslide Susceptibility Mapping Along the National Road 32 of Vietnam Using GIS-Based J48 Decision Tree Classifier and Its Ensembles , 2014 .

[39]  F. Smedt,et al.  Landslide susceptibility mapping using the weight of evidence method in the Tinau watershed, Nepal , 2012, Natural Hazards.

[40]  D. Bui,et al.  Spatial prediction of landslides using a hybrid machine learning approach based on Random Subspace and Classification and Regression Trees , 2018 .

[41]  D. Bui,et al.  Shallow landslide susceptibility assessment using a novel hybrid intelligence approach , 2017, Environmental Earth Sciences.

[42]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[43]  Francisco Gutiérrez,et al.  Sinkhole susceptibility mapping: A comparison between Bayes‐based machine learning algorithms , 2019, Land Degradation & Development.

[44]  Wei Chen,et al.  A novel ensemble approach of bivariate statistical-based logistic model tree classifier for landslide susceptibility assessment , 2018 .

[45]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[46]  Wei Chen,et al.  Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility , 2019, CATENA.

[47]  K. Moffett,et al.  Remote Sens , 2015 .

[48]  B. Roe,et al.  Boosted decision trees as an alternative to artificial neural networks for particle identification , 2004, physics/0408124.

[49]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[50]  Kyoji Sasa,et al.  Landslides : disaster risk reduction , 2009 .

[51]  Himan Shahabi,et al.  Landslide spatial modelling using novel bivariate statistical based Naïve Bayes, RBF Classifier, and RBF Network machine learning algorithms. , 2019, The Science of the total environment.

[52]  Biswajeet Pradhan,et al.  Spatial prediction of landslide hazards in Hoa Binh province (Vietnam): a comparative assessment of , 2012 .

[53]  P. Reichenbach,et al.  A review of statistically-based landslide susceptibility models , 2018 .

[54]  Seyed Amir Naghibi,et al.  Prioritization of landslide conditioning factors and its spatial modeling in Shangnan County, China using GIS-based data mining algorithms , 2018, Bulletin of Engineering Geology and the Environment.

[55]  A. Zhu,et al.  GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method , 2018 .

[56]  P. Peduzzi,et al.  Global landslide and avalanche hotspots , 2006 .

[57]  Veronica Tofani,et al.  Landslide susceptibility estimation by random forests technique: sensitivity and scaling issues , 2013 .

[58]  A-Xing Zhu,et al.  Landslide susceptibility modelling using GIS-based machine learning techniques for Chongren County, Jiangxi Province, China. , 2018, The Science of the total environment.

[59]  William J. Elliot,et al.  Spatial Prediction of Landslide Hazard Using Logistic Regression and ROC Analysis , 2006, Trans. GIS.

[60]  Taskin Kavzoglu,et al.  Investigation of automatic feature weighting methods (Fisher, Chi-square and Relief-F) for landslide susceptibility mapping , 2017 .

[61]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[62]  Dieu Tien Bui,et al.  Meta optimization of an adaptive neuro-fuzzy inference system with grey wolf optimizer and biogeography-based optimization algorithms for spatial prediction of landslide susceptibility , 2019, CATENA.

[63]  Soyoung Park,et al.  Landslide susceptibility mapping using frequency ratio, analytic hierarchy process, logistic regression, and artificial neural network methods at the Inje area, Korea , 2013, Environmental Earth Sciences.

[64]  Zhong Lu,et al.  Remote Sensing of Landslides - A Review , 2018, Remote. Sens..

[65]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[66]  H. Shahabi,et al.  Evaluation and comparison of bivariate and multivariate statistical methods for landslide susceptibility mapping (case study: Zab basin) , 2013, Arabian Journal of Geosciences.

[67]  Yoshua Bengio,et al.  Boosting Neural Networks , 2000, Neural Computation.

[68]  H. Shahabi,et al.  Novel forecasting approaches using combination of machine learning and statistical models for flood susceptibility mapping. , 2018, Journal of environmental management.

[69]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[70]  Wei Chen,et al.  Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling. , 2018, The Science of the total environment.

[71]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[72]  A. Zhu,et al.  A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China) , 2017, Environmental Earth Sciences.

[73]  M. Berberian,et al.  Towards a paleogeography and tectonic evolution of Iran: Reply , 1981 .

[74]  Nhat-Duc Hoang,et al.  A Novel Integrated Approach of Relevance Vector Machine Optimized by Imperialist Competitive Algorithm for Spatial Modeling of Shallow Landslides , 2018, Remote. Sens..

[75]  Santiago Beguería,et al.  Validation and Evaluation of Predictive Models in Hazard Assessment and Risk Management , 2006 .

[76]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[77]  L. Ermini,et al.  Artificial Neural Networks applied to landslide susceptibility assessment , 2005 .

[78]  B. Pradhan,et al.  Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models , 2007 .

[79]  D. Bui,et al.  Landslide susceptibility modeling using Reduced Error Pruning Trees and different ensemble techniques: Hybrid machine learning approaches , 2019, CATENA.

[80]  A. Zhu,et al.  Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling , 2018, Bulletin of Engineering Geology and the Environment.

[81]  George L. W. Perry,et al.  Identifying the controls on coastal cliff landslides using machine-learning approaches , 2016, Environ. Model. Softw..

[82]  A. Brenning,et al.  Integrating physical and empirical landslide susceptibility models using generalized additive models , 2011 .

[83]  D. Bui,et al.  A Novel Hybrid Model of Rotation Forest Based Functional Trees for Landslide Susceptibility Mapping: A Case Study at Kon Tum Province, Vietnam , 2017 .

[84]  C. Gokceoğlu,et al.  Use of fuzzy relations to produce landslide susceptibility map of a landslide prone area (West Black Sea Region, Turkey) , 2004 .

[85]  Wei Chen,et al.  Land Subsidence Susceptibility Mapping in South Korea Using Machine Learning Algorithms , 2018, Sensors.

[86]  Yang Hong,et al.  Landslides Susceptibility Mapping in Oklahoma State Using GIS-Based Weighted Linear Combination Method , 2014 .

[87]  A. Yalçın GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): Comparisons of results and confirmations , 2008 .

[88]  Sailesh Samanta,et al.  Landslide vulnerability mapping (LVM) using weighted linear combination (WLC) model through remote sensing and GIS techniques , 2016, Modeling Earth Systems and Environment.

[89]  Iman Nasiri Aghdam,et al.  Landslide susceptibility mapping using an ensemble statistical index (Wi) and adaptive neuro-fuzzy inference system (ANFIS) model at Alborz Mountains (Iran) , 2016, Environmental Earth Sciences.

[90]  Haijun Wang,et al.  Application of kernel-based Fisher discriminant analysis to map landslide susceptibility in the Qinggan River delta, Three Gorges, China , 2012 .

[91]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[92]  Biswajeet Pradhan,et al.  Multi-Criteria Decision Making (MCDM) Model for Seismic Vulnerability Assessment (SVA) of Urban Residential Buildings , 2018, ISPRS Int. J. Geo Inf..

[93]  Saro Lee,et al.  Combining landslide susceptibility maps obtained from frequency ratio, logistic regression, and artificial neural network models using ASTER images and GIS , 2012 .

[94]  Veronica Tofani,et al.  Landslide Susceptibility Mapping at National Scale: The Italian Case Study , 2013 .

[95]  Hossein Shafizadeh-Moghadam,et al.  Big data in Geohazard; pattern mining and large scale analysis of landslides in Iran , 2018, Earth Science Informatics.

[96]  Bernhard Schölkopf,et al.  The connection between regularization operators and support vector kernels , 1998, Neural Networks.

[97]  Xiaojing Wang,et al.  Landslide Susceptibility Modeling Based on GIS and Novel Bagging-Based Kernel Logistic Regression , 2018, Applied Sciences.

[98]  Esmaeil Alizadeh,et al.  THE EFFECTS OF RANGE MANAGEMENT PLANS OF SOIL PROPERTIES AND RANGELANDS VEGETATION (CASE STUDY: ESHTEHARD RANGELANDS) , 2012 .

[99]  Mazlan Hashim,et al.  Landslide susceptibility mapping using GIS-based statistical models and Remote sensing data in tropical environment , 2015, Scientific Reports.

[100]  S. Bai,et al.  GIS-based logistic regression for landslide susceptibility mapping of the Zhongxian segment in the Three Gorges area, China , 2010 .

[101]  E. Rotigliano,et al.  Assessment of susceptibility to earth-flow landslide using logistic regression and multivariate adaptive regression splines: A case of the Belice River basin (western Sicily, Italy) , 2015 .

[102]  B. Pham,et al.  Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods , 2017, Theoretical and Applied Climatology.

[103]  Hui Li,et al.  AdaBoost ensemble for financial distress prediction: An empirical comparison with data from Chinese listed companies , 2011, Expert Syst. Appl..

[104]  Wei Chen,et al.  Landslide Detection and Susceptibility Mapping by AIRSAR Data Using Support Vector Machine and Index of Entropy Models in Cameron Highlands, Malaysia , 2018, Remote. Sens..

[105]  D. Petley Global patterns of loss of life from landslides , 2012 .

[106]  B. Neuhäuser,et al.  GIS-based assessment of landslide susceptibility on the base of the Weights-of-Evidence model , 2012, Landslides.

[107]  Biswajeet Pradhan,et al.  A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India) , 2016, Environ. Model. Softw..

[108]  Wei Chen,et al.  Applying Information Theory and GIS-based quantitative methods to produce landslide susceptibility maps in Nancheng County, China , 2017, Landslides.

[109]  K. Solaimani,et al.  Rock fall susceptibility assessment along a mountainous road: an evaluation of bivariate statistic, analytical hierarchy process and frequency ratio , 2017, Environmental Earth Sciences.

[110]  Bin Li,et al.  Landslide Identification and Monitoring along the Jinsha River Catchment (Wudongde Reservoir Area), China, Using the InSAR Method , 2018, Remote. Sens..

[111]  A. Akgun A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir, Turkey , 2012, Landslides.