Using statistical learning algorithms in regional landslide susceptibility zonation with limited landslide field data

Regional Landslide Susceptibility Zonation (LSZ) is always challenged by the available amount of field data, especially in southwestern China where large mountainous areas and limited field information coincide. Statistical learning algorithms are believed to be superior to traditional statistical algorithms for their data adaptability. The aim of the paper is to evaluate how statistical learning algorithms perform on regional LSZ with limited field data. The focus is on three statistical learning algorithms, Logistic Regression (LR), Artificial Neural Networks (ANN) and Support Vector Machine (SVM). Hanzhong city, a landslide prone area in southwestern China is taken as a study case. Nine environmental factors are selected as inputs. The accuracies of the resulting LSZ maps are evaluated through landslide density analysis (LDA), receiver operating characteristic (ROC) curves and Kappa index statistics. The dependence of the algorithm on the size of field samples is examined by varying the sizes of the training set. The SVM has proven to be the most accurate and the most stable algorithm at small training set sizes and on all known landslide sizes. The accuracy of SVM shows a steadily increasing trend and reaches a high level at a small size of the training set, while accuracies of LR and ANN algorithms show distinct fluctuations. The geomorphological interpretations confirm the strength of SVM on all landslide sizes. Our results show that the strengths of SVM in generalization capability and model robustness make it an appropriate and efficient tool for regional LSZ with limited landslide field samples.

[1]  Manoj K. Arora,et al.  Approaches for comparative evaluation of raster GIS-based landslide susceptibility zonation maps , 2008, Int. J. Appl. Earth Obs. Geoinformation.

[2]  P. Aleotti,et al.  Landslide hazard assessment: summary review and new perspectives , 1999 .

[3]  H. Lan,et al.  Landslide hazard spatial analysis and prediction using GIS in the Xiaojiang watershed, Yunnan, China , 2004 .

[4]  Saro Lee,et al.  Landslide susceptibility analysis using GIS and artificial neural network , 2003 .

[5]  Mikhail Kanevski,et al.  Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping , 2013, Mathematical Geosciences.

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  W. Z. Savage,et al.  Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning. Commentary , 2008 .

[8]  P. Reichenbach,et al.  GIS techniques and statistical models in evaluating landslide hazard , 1991 .

[9]  Christopher L. Salter,et al.  Atlas of China , 1973 .

[10]  Yi-Chen Wang,et al.  Presettlement land survey records of vegetation: geographic characteristics, quality and modes of analysis , 2005 .

[11]  L. Cascini Applicability of landslide susceptibility and hazard zoning at different scales , 2008 .

[12]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[13]  Giovanni B. Crosta,et al.  Techniques for evaluating the performance of landslide susceptibility models , 2010 .

[14]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .

[15]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  W. Z. Savage,et al.  Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning , 2008 .

[18]  R. Anbalagan,et al.  Landslide hazard evaluation and zonation mapping in mountainous terrain , 1992 .

[19]  Roman M. Balabin,et al.  Support vector machine regression (SVR/LS-SVM)--an alternative to neural networks (ANN) for analytical chemistry? Comparison of nonlinear methods on near infrared (NIR) spectroscopy data. , 2011, The Analyst.

[20]  Işık Yilmaz,et al.  The effect of the sampling strategies on the landslide susceptibility mapping by conditional probability and artificial neural networks , 2010 .

[21]  P. Gong,et al.  Integrated Analysis of Spatial Data from Multiple Sources: Using Evidential Reasoning and Artificial Neural Network Techniques for Geological Mapping , 1996 .

[22]  Jerome V. DeGraff,et al.  Regional Landslide—Susceptibility Assessment for Wildland Management: A Matrix Approach , 2020, Thresholds in Geomorphology.

[23]  M. Arora,et al.  An approach for GIS-based statistical landslide susceptibility zonation—with a case study in the Himalayas , 2005 .

[24]  Darren George,et al.  SPSS for Windows Step by Step: A Simple Guide and Reference , 1998 .

[25]  Pijush Samui,et al.  Utilization of a least square support vector machine (LSSVM) for slope stability analysis , 2011 .

[26]  Mihaela Sima,et al.  A country-wide spatial assessment of landslide susceptibility in Romania. , 2010 .

[27]  Pradhan Biswajeet,et al.  Utilization of Optical Remote Sensing Data and GIS Tools for Regional Landslide Hazard Analysis Using an Artificial Neural Network Model , 2007 .

[28]  I. Yilmaz,et al.  GIS based statistical and physical approaches to landslide susceptibility mapping (Sebinkarahisar, Turkey) , 2009 .

[29]  Fuchu Dai,et al.  Landslide risk assessment and management: an overview , 2002 .

[30]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[31]  Biswajeet Pradhan,et al.  Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling , 2010, Environ. Model. Softw..

[32]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[33]  L. Ermini,et al.  Artificial Neural Networks applied to landslide susceptibility assessment , 2005 .

[34]  R. Leemans,et al.  Comparing global vegetation maps with the Kappa statistic , 1992 .

[35]  A. Clerici,et al.  A procedure for landslide susceptibility zonation by the conditional analysis method , 2002 .

[36]  Manoj K. Arora,et al.  A comparative study of conventional, ANN black box, fuzzy and combined neural and fuzzy weighting procedures for landslide susceptibility zonation in Darjeeling Himalayas , 2006 .

[37]  Chong Xu,et al.  GIS-based support vector machine modeling of earthquake-triggered landslide susceptibility in the Jianjiang River watershed, China , 2012 .

[38]  H. A. Nefeslioglu,et al.  An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps , 2008 .

[39]  C. Gokceoglu,et al.  Landslide Susceptibility Zoning North of Yenice (NW Turkey) by Multivariate Statistical Techniques , 2004 .

[40]  Mukta Sharma,et al.  Landslide Susceptibility Zonation through ratings derived from Artificial Neural Network , 2010, Int. J. Appl. Earth Obs. Geoinformation.

[41]  I. Yilmaz Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine , 2010 .

[42]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[43]  E. E. Brabb,et al.  Landslide susceptibility in San Mateo County, California , 1972 .

[44]  B. Pradhan,et al.  Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models , 2012 .

[45]  S. Sarkar,et al.  STATISTICAL MODELS FOR SLOPE INSTABILITY CLASSIFICATION , 1993 .

[46]  Wang Jian GIS-based and Data Drive Bivariate Landslide Susceptibility Mapping in the Three Gorge Area,China , 2007 .

[47]  Wu Zhaohui,et al.  Support vector domain description for speaker recognition , 2001, Neural Networks for Signal Processing XI: Proceedings of the 2001 IEEE Signal Processing Society Workshop (IEEE Cat. No.01TH8584).

[48]  P. Reichenbach,et al.  Optimal landslide susceptibility zonation based on multiple forecasts , 2010 .

[49]  Wei-dong Wang,et al.  Landslides susceptibility mapping in Guizhou province based on fuzzy theory , 2009 .

[50]  Chadi Abdallah,et al.  Detecting major terrain parameters relating to mass movements' occurrence using GIS, remote sensing and statistical correlations, case study Lebanon , 2005 .

[51]  B. Pradhan Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia , 2010 .

[52]  S. Reis,et al.  A GIS-based comparative study of frequency ratio, analytical hierarchy process, bivariate statistics , 2011 .

[53]  Manfred F. Buchroithner,et al.  A GIS-based back-propagation neural network model and its cross-application and validation for landslide susceptibility analyses , 2010, Comput. Environ. Urban Syst..

[54]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[55]  Donald Robert Coates,et al.  Thresholds in Geomorphology , 2020 .

[56]  Alexander Brenning,et al.  Logistic regression modeling of rock glacier and glacier distribution: Topographic and climatic controls in the semi-arid Andes , 2006 .

[57]  Xu Weiya,et al.  GIS-based landslide hazard assessment: an overview , 2005 .

[58]  L. Tham,et al.  Landslide susceptibility mapping based on Support Vector Machine: A case study on natural slopes of Hong Kong, China , 2008 .

[59]  Marian Marschalko,et al.  A small-scale landslide susceptibility assessment for the territory of Western Carpathians , 2013, Natural Hazards.

[60]  S. Bai,et al.  GIS-based logistic regression for landslide susceptibility mapping of the Zhongxian segment in the Three Gorges area, China , 2010 .

[61]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[62]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[63]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .