Spatial landslide susceptibility assessment using machine learning techniques assisted by additional data created with generative adversarial networks

Abstract In recent years, landslide susceptibility mapping has substantially improved with advances in machine learning. However, there are still challenges remain in landslide mapping due to the availability of limited inventory data. In this paper, a novel method that improves the performance of machine learning techniques is presented. The proposed method creates synthetic inventory data using Generative Adversarial Networks (GANs) for improving the prediction of landslides. In this research, landslide inventory data of 156 landslide locations were identified in Cameron Highlands, Malaysia, taken from previous projects the authors worked on. Elevation, slope, aspect, plan curvature, profile curvature, total curvature, lithology, land use and land cover (LULC), distance to the road, distance to the river, stream power index (SPI), sediment transport index (STI), terrain roughness index (TRI), topographic wetness index (TWI) and vegetation density are geo-environmental factors considered in this study based on suggestions from previous works on Cameron Highlands. To show the capability of GANs in improving landslide prediction models, this study tests the proposed GAN model with benchmark models namely Artificial Neural Network (ANN), Support Vector Machine (SVM), Decision Trees (DT), Random Forest (RF) and Bagging ensemble models with ANN and SVM models. These models were validated using the area under the receiver operating characteristic curve (AUROC). The DT, RF, SVM, ANN and Bagging ensemble could achieve the AUROC values of (0.90, 0.94, 0.86, 0.69 and 0.82) for the training; and the AUROC of (0.76, 0.81, 0.85, 0.72 and 0.75) for the test, subsequently. When using additional samples, the same models achieved the AUROC values of (0.92, 0.94, 0.88, 0.75 and 0.84) for the training and (0.78, 0.82, 0.82, 0.78 and 0.80) for the test, respectively. Using the additional samples improved the test accuracy of all the models except SVM. As a result, in data-scarce environments, this research showed that utilizing GANs to generate supplementary samples is promising because it can improve the predictive capability of common landslide prediction models.

[1]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[2]  Michele Calvello,et al.  A comparison of statistical and deterministic methods for shallow landslide susceptibility zoning in clayey soils , 2017 .

[3]  Biswajeet Pradhan,et al.  An improved algorithm for identifying shallow and deep-seated landslides in dense tropical forest from airborne laser scanning data , 2018, CATENA.

[4]  Gavin Brown,et al.  Ensemble Learning , 2010, Encyclopedia of Machine Learning and Data Mining.

[5]  Maria Ferentinou,et al.  Shallow landslide susceptibility assessment in a semiarid environment — A Quaternary catchment of KwaZulu-Natal, South Africa , 2016 .

[6]  Ling Han,et al.  GIS-based landslide susceptibility mapping using hybrid integration approaches of fractal dimension with index of entropy and support vector machine , 2019, Journal of Mountain Science.

[7]  D. Kawabata,et al.  Landslide susceptibility mapping using geological data, a DEM from ASTER images and an Artificial Neural Network (ANN) , 2009 .

[8]  Te Xiao,et al.  A novel physically-based model for updating landslide susceptibility , 2019, Engineering Geology.

[9]  Geoffrey I. Webb,et al.  MultiBoosting: A Technique for Combining Boosting and Wagging , 2000, Machine Learning.

[10]  Inge Revhaug,et al.  Optimization of Causative Factors for Landslide Susceptibility Evaluation Using Remote Sensing and GIS Data in Parts of Niigata, Japan , 2015, PloS one.

[11]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[12]  H. Sonmez,et al.  Landslide susceptibility mapping at Ovacık-Karabük (Turkey) using different artificial neural network models: comparison of training algorithms , 2019, Bulletin of Engineering Geology and the Environment.

[13]  Bahareh Kalantar,et al.  Conditioning factor determination for mapping and prediction of landslide susceptibility using machine learning algorithms , 2019, Remote Sensing.

[14]  Saro Lee,et al.  Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models , 2006 .

[15]  Biswajeet Pradhan,et al.  Systematic sample subdividing strategy for training landslide susceptibility models , 2020 .

[16]  Saro Lee,et al.  Application of Ensemble-Based Machine Learning Models to Landslide Susceptibility Mapping , 2018, Remote. Sens..

[17]  Fuan Tsai,et al.  Exploring Influence of Sampling Strategies on Event-Based Landslide Susceptibility Modeling , 2019, ISPRS Int. J. Geo Inf..

[18]  K. Yin,et al.  Spatial prediction of landslide susceptibility using GIS-based statistical and machine learning models in Wanzhou County, Three Gorges Reservoir, China , 2019, Acta Geochimica.

[19]  H. Yamagishi,et al.  Landslide Susceptibility Mapping in Tegucigalpa, Honduras, Using Data Mining Methods , 2018, IAEG/AEG Annual Meeting Proceedings, San Francisco, California, 2018 - Volume 1.

[20]  S. Oliveira,et al.  Mapping landslide susceptibility using data-driven methods. , 2017, Science of the Total Environment.

[21]  B. Pradhan,et al.  Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia , 2010 .

[22]  A. Kornejady,et al.  Landslide susceptibility assessment using maximum entropy model with two different data sampling methods , 2017 .

[23]  Arnold K. Bregt,et al.  Implementing landslide path dependency in landslide susceptibility modelling , 2018, Landslides.

[24]  Prima Riza Kadavi,et al.  Landslide-susceptibility mapping in Gangwon-do, South Korea, using logistic regression and decision tree models , 2019, Environmental Earth Sciences.

[25]  F. Yan,et al.  A novel hybrid approach for landslide susceptibility mapping integrating analytical hierarchy process and normalized frequency ratio methods with the cloud model , 2019, Geomorphology.

[26]  T. Kavzoglu,et al.  Selecting optimal conditioning factors in shallow translational landslide susceptibility mapping using genetic algorithm , 2015 .

[27]  A. Zhu,et al.  Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping , 2019, CATENA.

[28]  Saro Lee,et al.  Determination and application of the weights for landslide susceptibility mapping using an artificial neural network , 2004 .

[29]  Biswajeet Pradhan,et al.  Modeling landslide susceptibility in data-scarce environments using optimized data mining and statistical methods , 2018 .

[30]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[31]  Young-Kwang Yeon,et al.  Landslide susceptibility mapping in Injae, Korea, using a decision tree , 2010 .

[32]  Işık Yilmaz,et al.  Landslide Inventory, Sampling and Effect of Sampling Strategies on Landslide Susceptibility/Hazard Modelling at a Glance , 2018, Advances in Natural and Technological Hazards Research.

[33]  V. Doyuran,et al.  A comparison of the GIS based landslide susceptibility assessment methods: multivariate versus bivariate , 2004 .

[34]  Andang Suryana Soma,et al.  Optimization of causative factors using logistic regression and artificial neural network models for landslide susceptibility assessment in Ujung Loe Watershed, South Sulawesi Indonesia , 2019, Journal of Mountain Science.

[35]  Bahareh Kalantar,et al.  Assessment of the effects of training data selection on the landslide susceptibility mapping: a comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN) , 2018 .

[36]  Biswajeet Pradhan,et al.  A Novel Hybrid Machine Learning-Based Model for Rockfall Source Identification in Presence of Other Landslide Types Using LiDAR and GIS , 2019, Earth Systems and Environment.

[37]  E. Rotigliano,et al.  Exploring the effect of absence selection on landslide susceptibility models: A case study in Sicily, Italy , 2016 .

[38]  Yun-Tae Kim,et al.  A regional-scale landslide early warning methodology applying statistical and physically based approaches in sequence , 2019, Engineering Geology.

[39]  Alexander Brenning,et al.  The influence of systematically incomplete shallow landslide inventories on statistical susceptibility models and suggestions for improvements , 2017, Landslides.

[40]  P. Reichenbach,et al.  Different landslide sampling strategies in a grid-based bi-variate statistical susceptibility model , 2016 .

[41]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[42]  Lewis A. Owen,et al.  GIS-based landslide susceptibility mapping for the 2005 Kashmir earthquake region , 2008 .

[43]  Vladimir Vapnik,et al.  Constructing Learning Algorithms , 1995 .

[44]  Biswajeet Pradhan,et al.  Landslide Detection Using Residual Networks and the Fusion of Spectral and Topographic Information , 2019, IEEE Access.

[45]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[46]  Anil K. Jain,et al.  Artificial Neural Networks: A Tutorial , 1996, Computer.

[47]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[48]  A. Basith,et al.  Evaluation of landslide causative factors towards efficient landslide susceptibility modelling in the Cameron Highlands, Malaysia , 2012 .

[49]  H. Aksoy,et al.  Integrated approach for determining spatio-temporal variations in the hydrodynamic factors as a contributing parameter in landslide susceptibility assessments , 2018, Bulletin of Engineering Geology and the Environment.

[50]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[51]  A-Xing Zhu,et al.  A similarity-based approach to sampling absence data for landslide susceptibility mapping using data-driven methods , 2019 .

[52]  A. Kornejady,et al.  Landslide susceptibility assessment using three bivariate models considering the new topo-hydrological factor: HAND , 2018 .

[53]  Tetsuya Kubota,et al.  Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia , 2018, Geomorphology.

[54]  Paraskevas Tsangaratos,et al.  Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size , 2016 .

[55]  Biswajeet Pradhan,et al.  Hazard zoning for spatial planning using GIS-based landslide susceptibility assessment: a new hybrid integrated data-driven and knowledge-based model , 2019, Arabian Journal of Geosciences.

[56]  Riccardo Rigon,et al.  Integrated Physically based system for modeling landslide susceptibility , 2014 .

[57]  Hakan Aktas,et al.  Landslide susceptibility mapping using an automatic sampling algorithm based on two level random sampling , 2019, Comput. Geosci..

[58]  J. Grzybowski,et al.  Artificial neural network ensembles applied to the mapping of landslide susceptibility , 2020 .

[59]  H. Saito,et al.  Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: The Akaishi Mountains, Japan , 2009 .

[60]  Alexander Brenning,et al.  Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling , 2015, Comput. Geosci..

[61]  Thomas Blaschke,et al.  Comparing GIS-based support vector machine kernel functions for landslide susceptibility mapping , 2017, Arabian Journal of Geosciences.

[62]  A. Ozdemir,et al.  A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan Mountains, SW Turkey , 2013 .

[63]  Yuk Feng Huang,et al.  Rainfall-induced landslides in Hulu Kelang area, Malaysia , 2013, Natural Hazards.

[64]  A. Brenning,et al.  Integrating physical and empirical landslide susceptibility models using generalized additive models , 2011 .

[65]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[66]  D. Costanzo,et al.  Slope units-based flow susceptibility model: using validation tests to select controlling factors , 2012, Natural Hazards.

[67]  Taskin Kavzoglu,et al.  Machine Learning Techniques in Landslide Susceptibility Mapping: A Survey and a Case Study , 2018, Landslides: Theory, Practice and Modelling.

[68]  T. Glade,et al.  Landslide susceptibility assessment based on an incomplete landslide inventory in the Jilong Valley, Tibet, Chinese Himalayas , 2020, Engineering Geology.