A Meta-Learning Approach of Optimisation for Spatial Prediction of Landslides

Optimisation plays a key role in the application of machine learning in the spatial prediction of landslides. The common practice in optimising landslide prediction models is to search for optimal/suboptimal hyperparameter values in a number of predetermined hyperparameter configurations based on an objective function, i.e., k-fold cross-validation accuracy. However, the overhead of hyperparameter optimisation can be prohibitive, especially for computationally expensive algorithms. This paper introduces an optimisation approach based on meta-learning for the spatial prediction of landslides. The proposed approach is tested in a dense tropical forested area of Cameron Highlands, Malaysia. Instead of optimising prediction models with a large number of hyperparameter configurations, the proposed approach begins with promising configurations based on several basic and statistical meta-features. The proposed meta-learning approach was tested based on Bayesian optimisation as a hyperparameter tuning algorithm and random forest (RF) as a prediction model. The spatial database was established with a total of 63 historical landslides and 15 conditioning factors. Three RF models were constructed based on (1) default parameters as suggested by the sklearn library, (2) parameters suggested by the Bayesian optimisation (BO), and (3) parameters suggested by the proposed meta-learning approach (BO-ML). Based on five-fold cross-validation accuracy, the Bayesian method achieved the best performance for both the training (0.810) and test (0.802) datasets. The meta-learning approach achieved slightly lower accuracies than the Bayesian method for the training (0.769) and test (0.800) datasets. Similarly, based on F1-score and area under the receiving operating characteristic curves (AUROC), the models with optimised parameters either by the Bayesian or meta-learning methods produced more accurate landslide susceptibility assessment than the model with the default parameters. In the present approach, instead of learning from scratch, the meta-learning would begin with hyperparameter configurations optimal for the most similar previous datasets, which can be considerably helpful and time-saving for landslide modelings.

[1]  M. Ramírez-Herrera,et al.  The importance of input data on landslide susceptibility mapping , 2021, Scientific Reports.

[2]  Yu Huang,et al.  Review on landslide susceptibility mapping using support vector machines , 2018, CATENA.

[3]  Helmi Zulhaidi Mohd Shafri,et al.  Optimized Neural Architecture for Automatic Landslide Detection from High‐Resolution Airborne Laser Scanning Data , 2017 .

[4]  Alexander Brenning,et al.  Performance evaluation and hyperparameter tuning of statistical and machine-learning models using spatial data , 2018, Ecological Modelling.

[5]  Nadhir Al-Ansari,et al.  Landslide Detection and Susceptibility Modeling on Cameron Highlands (Malaysia): A Comparison between Random Forest, Logistic Regression and Logistic Model Tree Algorithms , 2020, Forests.

[6]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[7]  J. Harris,et al.  Comparison of the Data-Driven Random Forests Model and a Knowledge-Driven Method for Mineral Prospectivity Mapping: A Case Study for Gold Deposits Around the Huritz Group and Nueltin Suite, Nunavut, Canada , 2016, Natural Resources Research.

[8]  Gheorghe Tecuci,et al.  Landslide susceptibility analyses using Random Forest, C4.5, and C5.0 with balanced and unbalanced datasets , 2021 .

[9]  Biswajeet Pradhan,et al.  Spatial landslide susceptibility assessment using machine learning techniques assisted by additional data created with generative adversarial networks , 2021, Geoscience Frontiers.

[10]  Hamid Reza Pourghasemi,et al.  Landslide susceptibility modeling applying machine learning methods: A case study from Longju in the Three Gorges Reservoir area, China , 2018, Comput. Geosci..

[11]  P. Reichenbach,et al.  A review of statistically-based landslide susceptibility models , 2018 .

[12]  Y. Bashan,et al.  Microbial populations and activities in the rhizoplane of rock-weathering desert plants. I. Root colonization and weathering of igneous rocks. , 2004, Plant biology.

[13]  Zhong Lu,et al.  Remote Sensing of Landslides - A Review , 2018, Remote. Sens..

[14]  Biswajeet Pradhan,et al.  Application of convolutional neural networks featuring Bayesian optimization for landslide susceptibility assessment , 2020, CATENA.

[15]  C. Tucker Red and photographic infrared linear combinations for monitoring vegetation , 1979 .

[16]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2015, Landslides.

[17]  T. Kavzoglu,et al.  Selecting optimal conditioning factors in shallow translational landslide susceptibility mapping using genetic algorithm , 2015 .

[18]  Zhou Zhao,et al.  A Comparative Study of Landslide Susceptibility Mapping Using SVM and PSO-SVM Models Based on Grid and Slope Units , 2021, Mathematical Problems in Engineering.

[19]  Kounghoon Nam,et al.  An extreme rainfall-induced landslide susceptibility assessment using autoencoder combined with random forest in Shimane Prefecture, Japan , 2020 .

[20]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[21]  P. Behnia,et al.  Landslide susceptibility modelling using the quantitative random forest method along the northern portion of the Yukon Alaska Highway Corridor, Canada , 2018, Natural Hazards.

[22]  Ghassan Beydoun,et al.  A New Integrated Approach for Landslide Data Balancing and Spatial Prediction Based on Generative Adversarial Networks (GAN) , 2021, Remote. Sens..

[23]  Deliang Sun,et al.  A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm , 2020, Geomorphology.

[24]  Ismail Colkesen,et al.  Developing comprehensive geocomputation tools for landslide susceptibility mapping: LSM tool pack , 2020, Comput. Geosci..

[25]  D. Petley,et al.  Global fatal landslide occurrence from 2004 to 2016 , 2018, Natural Hazards and Earth System Sciences.

[26]  G. Yuliyanto,et al.  VULNERABILITY ASSESSMENT IN LANDSLIDE RISK ANALYSIS , 2020 .

[27]  Edesio Alcobaça,et al.  MFE: Towards reproducible meta-feature extraction , 2020, J. Mach. Learn. Res..

[28]  Lukáš Karell,et al.  Applicability of Support Vector Machines in Landslide Susceptibility Mapping , 2017 .

[29]  Yanli Wu,et al.  Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping , 2020 .

[30]  Thomas Lengauer,et al.  Classification with correlated features: unreliability of feature ranking and solutions , 2011, Bioinform..

[31]  Yu Liu,et al.  Application of Optimized Parameters SVM in Deformation Prediction of Creep Landslide Tunnel , 2014 .

[32]  Lars Kotthoff,et al.  Auto-WEKA 2.0: Automatic model selection and hyperparameter optimization in WEKA , 2017, J. Mach. Learn. Res..

[33]  Tara G Martin,et al.  A guide to eliciting and using expert knowledge in Bayesian ecological models. , 2010, Ecology letters.

[34]  Min-Hao Wu,et al.  Evaluating triggering and causative factors of landslides in Lawnon River Basin, Taiwan , 2011 .

[35]  A. Prasad,et al.  Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction , 2006, Ecosystems.

[36]  Ameet Talwalkar,et al.  Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization , 2016, J. Mach. Learn. Res..

[37]  Luis F. Robledo,et al.  Landslide hazard assessment based on Bayesian optimization–support vector machine in Nanping City, China , 2021, Natural Hazards.

[38]  Saro Lee,et al.  Application of Ensemble-Based Machine Learning Models to Landslide Susceptibility Mapping , 2018, Remote. Sens..

[39]  Giovanni B. Crosta,et al.  Techniques for evaluating the performance of landslide susceptibility models , 2010 .

[40]  Mustafa Neamah Jebur,et al.  Optimization of landslide conditioning factors using very high-resolution airborne laser scanning (LiDAR) data at catchment scale , 2014 .

[41]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[42]  Eduardo C. Garrido-Merchán,et al.  Dealing with Categorical and Integer-valued Variables in Bayesian Optimization with Gaussian Processes , 2017, Neurocomputing.

[43]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[44]  Bahareh Kalantar,et al.  Landslide Susceptibility Modeling: An Integrated Novel Method Based on Machine Learning Feature Transformation , 2021, Remote. Sens..

[45]  M. Wiesmeier,et al.  Digital mapping of soil organic matter stocks using Random Forest modeling in a semi-arid steppe ecosystem , 2011, Plant and Soil.

[46]  Bjoern H. Menze,et al.  A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data , 2009, BMC Bioinformatics.

[47]  A. K. Lysdahl,et al.  Modelling of shallow landslides with machine learning algorithms , 2021 .

[48]  Nayyer Saleem,et al.  Parameters Derived from and/or Used with Digital Elevation Models (DEMs) for Landslide Susceptibility Mapping and Landslide Risk Assessment: A Review , 2019, ISPRS Int. J. Geo Inf..

[49]  Cheng-Chien Liu,et al.  A New Approach Using AHP to Generate Landslide Susceptibility Maps in the Chen-Yu-Lan Watershed, Taiwan , 2019, Sensors.

[50]  A. Zhu,et al.  Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping , 2019, CATENA.

[51]  M. Hutchinson,et al.  Digital terrain analysis. , 2008 .

[52]  Yichen Zhang,et al.  Rainfall Induced Landslide Susceptibility Mapping Based on Bayesian Optimized Random Forest and Gradient Boosting Decision Tree Models—A Case Study of Shuicheng County, China , 2020, Water.

[53]  Deliang Sun,et al.  Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: A comparison between logistic regression and random forest , 2020, Engineering Geology.

[54]  Joaquin Vanschoren,et al.  Meta-Learning: A Survey , 2018, Automated Machine Learning.

[55]  Mikhail Kanevski,et al.  Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping , 2013, Mathematical Geosciences.

[56]  Matteo Matteucci,et al.  Evaluation of prediction capability, robustness, and sensitivity in non-linear landslide susceptibility models, Guantánamo, Cuba , 2011, Comput. Geosci..

[57]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[58]  Manoj K. Arora,et al.  A comparative study of conventional, ANN black box, fuzzy and combined neural and fuzzy weighting procedures for landslide susceptibility zonation in Darjeeling Himalayas , 2006 .

[59]  Bolin Huang,et al.  A successful case of emergency landslide response - the Sept. 2, 2014, Shanshucao landslide, Three Gorges Reservoir, China , 2015, Geoenvironmental Disasters.

[60]  A. Sevtap Selcuk-Kestel,et al.  Analysis of training sample selection strategies for regression-based quantitative landslide susceptibility mapping methods , 2017, Comput. Geosci..

[61]  Ali P. Yunus,et al.  Evaluating scale effects of topographic variables in landslide susceptibility models using GIS-based machine learning techniques , 2019, Scientific Reports.

[62]  Qing Zhu,et al.  Deep Fusion of Local and Non-Local Features for Precision Landslide Recognition , 2020, ArXiv.

[63]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[64]  Ali P. Yunus,et al.  Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance , 2020 .

[65]  Nhat-Duc Hoang,et al.  A Novel GIS-Based Random Forest Machine Algorithm for the Spatial Prediction of Shallow Landslide Susceptibility , 2020 .

[66]  Taskin Kavzoglu,et al.  Machine Learning Techniques in Landslide Susceptibility Mapping: A Survey and a Case Study , 2018, Landslides: Theory, Practice and Modelling.

[67]  D. Siev,et al.  Methods comparison , 2014, Journal of veterinary diagnostic investigation : official publication of the American Association of Veterinary Laboratory Diagnosticians, Inc.

[68]  M. Ercanoglu under a Creative Commons License. Natural Hazards and Earth System Sciences Landslide susceptibility assessment of SE Bartin (West Black Sea , 2022 .

[69]  Mengliang Yu,et al.  Mine landslide susceptibility assessment using IVM, ANN and SVM models considering the contribution of affecting factors , 2019, PloS one.

[70]  E. Rotigliano,et al.  Exploring the effect of absence selection on landslide susceptibility models: A case study in Sicily, Italy , 2016 .