Application of the group method of data handling (GMDH) approach for landslide susceptibility zonation using readily available spatial covariates

Abstract Landslide susceptibility (LS) mapping is an essential tool for landslide risk assessment. This study aimed to provide a new approach with better performance for landslide mapping and adopting readily available variables. In addition, it investigates the capability of a state-of-the-art model developed using the group method of data handling (GMDH) to spatially model LS. Furthermore, hybridized models of GMDH were developed using different metaheuristic algorithms. The study area was the Bonghwa region of South Korea, for which an accurate landslide inventory dataset is available. We considered a total of 13 spatial covariates (altitude, slope, aspect, topographic wetness index, valley depth, plan curvature, profile curvature, distance from fault, distance from river, distance from road, land use, density of forest, and lithology were chosen as independent variables). Two benchmark models—random forest and boosted regression trees—were used to compare their results with the standalone GMDH and hybridized models. We compared model accuracy using the two most robust evaluation metrics, root mean square error (RMSE) and area under the receiver operating characteristic curve (AUROC). The validation results showed that hybridized models outperformed the standalone GMDH model. Moreover, the hybridized GMDH-PSO (AUC = 0.83, RMSE = 0.108), GMDH-IWO (AUC = 0.81, RMSE = 0.111), GMDH-BBO (AUC = 0.8; RMSE = 0.12), and GMDH-ICA (AUC = 0.8; RMSE = 0.117) had a better predictive performance than both RF and BRT. Therefore, the proposed approach could successfully produce landslide susceptibility maps using relatively few readily available variables and can be repeated in data-scarce regions.

[1]  Reza Shirmohammadi,et al.  Optimization of mixed refrigerant systems in low temperature applications by means of group method of data handling (GMDH) , 2015 .

[2]  T. Kavzoglu,et al.  Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression , 2014, Landslides.

[3]  Alessandro Beghi,et al.  A PSO-based algorithm for optimal multiple chiller systems operation , 2012 .

[4]  P. Reichenbach,et al.  A review of statistically-based landslide susceptibility models , 2018 .

[5]  Saro Lee,et al.  Landslide susceptibility mapping using random forest and boosted tree models in Pyeong-Chang, Korea , 2018 .

[6]  I. Marchesini,et al.  Effective surveyed area and its role in statistical landslide susceptibility assessments , 2018, Natural Hazards and Earth System Sciences.

[7]  E. Rotigliano,et al.  Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: application to the 2009 storm event in Messina (Sicily, southern Italy) , 2015, Natural Hazards.

[8]  P. Reichenbach,et al.  The Influence of Land Use Change on Landslide Susceptibility Zonation: The Briga Catchment Test Site (Messina, Italy) , 2013, Environmental Management.

[9]  A. K. Lysdahl,et al.  Modelling of shallow landslides with machine learning algorithms , 2021 .

[10]  Saro Lee,et al.  Application of data-driven evidential belief functions to landslide susceptibility mapping in Jinbu, Korea , 2013 .

[11]  Biswajeet Pradhan,et al.  Application of an evidential belief function model in landslide susceptibility mapping , 2012, Comput. Geosci..

[12]  Miaojing Shi,et al.  A Selective Biogeography-Based Optimizer Considering Resource Allocation for Large-Scale Global Optimization , 2019, Comput. Intell. Neurosci..

[13]  P. Martin Mai,et al.  Presenting logistic regression-based landslide susceptibility results , 2018, Engineering Geology.

[14]  Yung-Che Chen,et al.  Incorporation of dynamic segmented neutrophil-to-monocyte ratio with leukocyte count for sepsis risk stratification , 2019, Scientific Reports.

[15]  G. Pazuki,et al.  Modeling the Thermal Conductivity of Ionic Liquids and Ionanofluids Based on a Group Method of Data Handling and Modified Maxwell Model , 2015 .

[16]  R. Soeters,et al.  Use of Geomorphological Information in Indirect Landslide Susceptibility Assessment , 2003 .

[17]  Taskin Kavzoglu,et al.  Machine Learning Techniques in Landslide Susceptibility Mapping: A Survey and a Case Study , 2018, Landslides: Theory, Practice and Modelling.

[18]  Sandeep Kumar,et al.  PMT: New analytical framework for automated evaluation of geo-environmental modelling approaches. , 2019, The Science of the total environment.

[19]  B. Pradhan,et al.  Landslide susceptibility mapping at Al-Hasher area, Jizan (Saudi Arabia) using GIS-based frequency ratio and index of entropy models , 2015, Geosciences Journal.

[20]  Dieu Tien Bui,et al.  Meta optimization of an adaptive neuro-fuzzy inference system with grey wolf optimizer and biogeography-based optimization algorithms for spatial prediction of landslide susceptibility , 2019, CATENA.

[21]  G. N. Pillai,et al.  Prediction of landslide displacement with controlling factors using extreme learning adaptive neuro-fuzzy inference system (ELANFIS) , 2017, Appl. Soft Comput..

[22]  S. Mickovski,et al.  Hydrological effect of vegetation against rainfall-induced landslides , 2017 .

[23]  Slobodan B. Mickovski,et al.  Plant-soil reinforcement response under different soil hydrological regimes , 2017 .

[24]  R. Venkata Rao,et al.  Teaching-learning-based optimization: A novel method for constrained mechanical design optimization problems , 2011, Comput. Aided Des..

[25]  Fuqing Zhao,et al.  A two-stage differential biogeography-based optimization algorithm and its performance analysis , 2019, Expert Syst. Appl..

[26]  I. Ilia,et al.  Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map , 2016, Landslides.

[27]  Dimitrios Myronidis,et al.  Landslide susceptibility mapping based on landslide history and analytic hierarchy process (AHP) , 2016, Natural Hazards.

[28]  Yang Junfeng,et al.  A New Method on Software Reliability Prediction , 2013 .

[29]  K. Yin,et al.  Landslide susceptibility mapping based on self-organizing-map network and extreme learning machine , 2017 .

[30]  P. Reichenbach,et al.  Estimating the quality of landslide susceptibility models , 2006 .

[31]  A. Brenning,et al.  Integrating physical and empirical landslide susceptibility models using generalized additive models , 2011 .

[32]  Zhile Yang,et al.  Biogeography-Based Optimization: A 10-Year Review , 2017, IEEE Transactions on Emerging Topics in Computational Intelligence.

[33]  Chun Lu,et al.  An improved GA and a novel PSO-GA-based hybrid algorithm , 2005, Inf. Process. Lett..

[34]  Viktor P. Astakhov,et al.  Tool life testing in gundrilling: an application of the group method of data handling (GMDH) , 2005 .

[35]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[36]  Biswajeet Pradhan,et al.  Modeling landslide susceptibility in data-scarce environments using optimized data mining and statistical methods , 2018 .

[37]  Dan Simon,et al.  Biogeography-Based Optimization , 2022 .

[38]  Ping-Lang Yen,et al.  Engineering Applications of Intelligent Monitoring and Control 2014 , 2013 .

[39]  Majdi I. Radaideh,et al.  Analyzing nuclear reactor simulation data and uncertainty with the group method of data handling , 2020 .

[40]  Qiuhua Liang,et al.  A new depth-averaged model for flow-like landslides over complex terrains with curvatures and steep slopes , 2018 .

[41]  Mohanad S. Al-Musaylh,et al.  Two-phase particle swarm optimized-support vector regression hybrid model integrated with improved empirical mode decomposition with adaptive noise for multiple-horizon electricity demand forecasting , 2018 .

[42]  R. Venkata Rao,et al.  Review of applications of TLBO algorithm and a tutorial for beginners to solve the unconstrained and constrained optimization problems , 2016 .

[43]  Wei Chen,et al.  GIS-based evaluation of landslide susceptibility using hybrid computational intelligence models , 2020, CATENA.

[44]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[45]  Caro Lucas,et al.  Imperialist competitive algorithm: An algorithm for optimization inspired by imperialistic competition , 2007, 2007 IEEE Congress on Evolutionary Computation.

[46]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[47]  Saro Lee,et al.  Landslide susceptibility mapping using GIS and the weight-of-evidence model , 2004, Int. J. Geogr. Inf. Sci..

[48]  Yongquan Zhou,et al.  A Novel Differential Evolution Invasive Weed Optimization Algorithm for Solving Nonlinear Equations Systems , 2013, J. Appl. Math..

[49]  N. Park Using maximum entropy modeling for landslide susceptibility mapping with multiple geoenvironmental data sets , 2015, Environmental Earth Sciences.

[50]  A. Gandomi,et al.  Imperialist competitive algorithm combined with chaos for global optimization , 2012 .

[51]  Gholamreza Pazuki,et al.  Estimation of the viscosity of nine nanofluids using a hybrid GMDH-type neural network system , 2014 .

[52]  Qi Gao,et al.  A Novel Imperialist Competitive Algorithm for Scheme Configuration Rules Extraction of Product Service System , 2019 .

[53]  Christian Conoscenti,et al.  GIS analysis to assess landslide susceptibility in a fluvial basin of NW Sicily (Italy) , 2008 .

[54]  S. Keesstra,et al.  How can statistical and artificial intelligence approaches predict piping erosion susceptibility? , 2019, The Science of the total environment.

[55]  Moung-Jin Lee,et al.  Landslide Susceptibility Assessment Using an Optimized Group Method of Data Handling Model , 2020, ISPRS Int. J. Geo Inf..

[56]  Abdelaziz I. Hammouri A modified biogeography-based optimization algorithm with guided bed selection mechanism for patient admission scheduling problems , 2020, J. King Saud Univ. Comput. Inf. Sci..

[57]  Caro Lucas,et al.  A novel numerical optimization algorithm inspired from weed colonization , 2006, Ecol. Informatics.

[58]  A. Kornejady,et al.  Landslide susceptibility assessment using maximum entropy model with two different data sampling methods , 2017 .

[59]  M. S. Guettouche,et al.  GIS-based landslide susceptibility zonation using bivariate statistical and expert approaches in the city of Constantine (Northeast Algeria) , 2015, Bulletin of Engineering Geology and the Environment.

[60]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2015, Landslides.

[61]  Yudong Zhang,et al.  A Comprehensive Survey on Particle Swarm Optimization Algorithm and Its Applications , 2015 .

[62]  Jeffrey S. Kargel,et al.  The cost of rapid and haphazard urbanization: lessons learned from the Freetown landslide disaster , 2019, Landslides.

[63]  T. Kavzoglu,et al.  Selecting optimal conditioning factors in shallow translational landslide susceptibility mapping using genetic algorithm , 2015 .

[64]  Jin Xiao,et al.  Use of group method of data handling for transport energy demand modeling , 2017 .

[65]  Hamid Reza Pourghasemi,et al.  Erratum to: Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2016, Landslides.

[66]  E. Rotigliano,et al.  Prediction of debris-avalanches and -flows triggered by a tropical storm by using a stochastic approach: An application to the events occurred in Mocoa (Colombia) on 1 April 2017 , 2019, Geomorphology.

[67]  A. Whittaker,et al.  Lithological controls on hillslope sediment supply: insights from landslide activity and grain size distributions , 2018 .

[68]  Hamid Reza Pourghasemi,et al.  Investigating the effects of different landslide positioning techniques, landslide partitioning approaches, and presence-absence balances on landslide susceptibility mapping , 2020, CATENA.

[69]  Dieu Tien Bui,et al.  Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study , 2017 .

[70]  A. Trigila,et al.  Comparison of Logistic Regression and Random Forests techniques for shallow landslide susceptibility assessment in Giampilieri (NE Sicily, Italy) , 2015 .

[71]  J. Godt,et al.  Hillslope Hydrology and Stability , 2013 .

[72]  Á. Felicísimo,et al.  Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study , 2013, Landslides.

[73]  I. Ilia,et al.  Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi Perfection, Greece , 2016, Landslides.

[74]  B. Pradhan,et al.  Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling–Narayanghat road section in Nepal Himalaya , 2012, Natural Hazards.

[75]  Mahdi Yaghoobi,et al.  Improved invasive weed optimization algorithm (IWO) based on chaos theory for optimal design of PID controller , 2019, J. Comput. Des. Eng..

[76]  M. Najafzadeh,et al.  Group method of data handling to predict scour depth around vertical piles under regular waves , 2013 .

[77]  Hamed Mojallali,et al.  Solving nonlinear equations systems with a new approach based on invasive weed optimization algorithm and clustering , 2012, Swarm Evol. Comput..

[78]  Mehdi Mousavi,et al.  Imperialistic Competitive Algorithm: A metaheuristic algorithm for locating the critical slip surface in 2-Dimensional soil slopes , 2016 .

[79]  S. Kahla,et al.  Fuzzy-PSO controller design for maximum power point tracking in photovoltaic system , 2017 .

[80]  Xin Chen,et al.  A New Stochastic PSO Technique for Neural Network Training , 2006, ISNN.

[81]  H. Pourghasemi,et al.  Assessing and mapping multi-hazard risk susceptibility using a machine learning technique , 2020, Scientific Reports.

[82]  André Stumpf,et al.  Object-oriented mapping of landslides using Random Forests , 2011 .

[83]  Xin-She Yang,et al.  Optimization in Civil Engineering and Metaheuristic Algorithms: A Review of State-of-the-Art Developments , 2018, Computational Intelligence, Optimization and Inverse Problems with Applications in Engineering.

[84]  Anh Dinh,et al.  Blind, Cuff-less, Calibration-Free and Continuous Blood Pressure Estimation using Optimized Inductive Group Method of Data Handling , 2020, Biomed. Signal Process. Control..

[85]  Biswajeet Pradhan,et al.  Analysis and evaluation of landslide susceptibility: a review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016) , 2018, Arabian Journal of Geosciences.

[86]  R. Venkata Rao,et al.  Teaching–Learning-based Optimization Algorithm , 2016 .

[88]  Xiuping Zhang The Application of Imperialist Competitive Algorithm based on Chaos Theory in Perceptron Neural Network , 2012 .

[89]  P. Reichenbach,et al.  Probabilistic landslide hazard assessment at the basin scale , 2005 .

[90]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[91]  Luisa Liucci,et al.  The role of topography in the scaling distribution of landslide areas: A cellular automata modeling approach , 2017 .

[92]  Ali P. Yunus,et al.  Evaluating scale effects of topographic variables in landslide susceptibility models using GIS-based machine learning techniques , 2019, Scientific Reports.

[93]  Maghsud Solimanpur,et al.  Solving multi-objective portfolio optimization problem using invasive weed optimization , 2016, Swarm Evol. Comput..

[94]  Yuzhu Zhang,et al.  The effects of slope length and slope gradient on the size distributions of loess slides: Field observations and simulations , 2018 .

[95]  Claudia Meisina,et al.  The role of land use changes in the distribution of shallow landslides. , 2017, The Science of the total environment.

[96]  D. Bui,et al.  Uncertainties of prediction accuracy in shallow landslide modeling: Sample size and raster resolution , 2019, CATENA.

[97]  Xiang Feng,et al.  Improved Rao-Blackwellized Particle Filter by Particle Swarm Optimization , 2013, J. Appl. Math..

[98]  J. Seibert,et al.  Effects of DEM resolution on the calculation of topographical indices: TWI and its components , 2007 .

[99]  Jean-Philippe Malet,et al.  Automatic classification of endogenous landslide seismicity using the Random Forest supervised classifier , 2017 .

[100]  Rubini Mahalingam,et al.  Evaluation of landslide susceptibility mapping techniques using lidar-derived conditioning factors (Oregon case study) , 2016 .

[101]  K Salmalian,et al.  Multi-objective evolutionary optimization of polynomial neural networks for fatigue life modelling and prediction of unidirectional carbon-fibre-reinforced plastics composites , 2010 .

[102]  Haijun Wang,et al.  Delineating early warning zones in rapidly growing metropolitan areas by integrating a multiscale urban growth model with biogeography-based optimization , 2020 .

[103]  Zhanya Xu,et al.  Coupling logistic model tree and random subspace to predict the landslide susceptibility areas with considering the uncertainty of environmental features , 2019, Scientific Reports.

[104]  E. Rotigliano,et al.  Assessment of susceptibility to earth-flow landslide using logistic regression and multivariate adaptive regression splines: A case of the Belice River basin (western Sicily, Italy) , 2015 .

[105]  Hui Zhang,et al.  Integration of dynamic rainfall data with environmental factors to forecast debris flow using an improved GMDH model , 2013, Comput. Geosci..

[106]  C. Meynard,et al.  Predicting species distributions: a critical comparison of the most common statistical models using artificial species , 2007 .

[107]  Boris Schröder,et al.  How can statistical models help to determine driving factors of landslides , 2012 .

[108]  Hamid Reza Pourghasemi,et al.  Landslide Susceptibility Mapping Using GIS-Based Data Mining Algorithms , 2019, Water.

[109]  F. Smedt,et al.  Landslide susceptibility mapping using the weight of evidence method in the Tinau watershed, Nepal , 2012, Natural Hazards.

[110]  T. Rabczuk,et al.  Modeling heat capacity of ionic liquids using group method of data handling: A hybrid and structure-based approach , 2019, International Journal of Heat and Mass Transfer.

[111]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[112]  R. V. Rao,et al.  Teaching–learning-based optimization algorithm for unconstrained and constrained real-parameter optimization problems , 2012 .

[113]  Giovanni B. Crosta,et al.  Techniques for evaluating the performance of landslide susceptibility models , 2010 .

[114]  Mustafa Neamah Jebur,et al.  Optimization of landslide conditioning factors using very high-resolution airborne laser scanning (LiDAR) data at catchment scale , 2014 .

[115]  Alexander Brenning,et al.  Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling , 2015, Comput. Geosci..

[116]  M. Klose,et al.  Landslide cost modeling for transportation infrastructures: a methodological approach , 2015, Landslides.

[117]  Francesco Bucci,et al.  Landslide distribution and size in response to Quaternary fault activity: the Peloritani Range, NE Sicily, Italy , 2016 .

[118]  V. Moosavi,et al.  Development of hybrid wavelet packet-statistical models (WP-SM) for landslide susceptibility mapping , 2016, Landslides.

[119]  Jianzhou Wang,et al.  Short-term effects of air pollution on lower respiratory diseases and forecasting by the group method of data handling , 2012 .

[120]  Iman Nasiri Aghdam,et al.  A new hybrid model using Step-wise Weight Assessment Ratio Analysis (SWARA) technique and Adaptive Neuro-fuzzy Inference System (ANFIS) for regional landslide hazard assessment in Iran , 2015 .

[121]  Bahram Nasernejad,et al.  Prediction of partition coefficients of alkaloids in ionic liquids based aqueous biphasic systems using hybrid group method of data handling (GMDH) neural network , 2014 .

[122]  J. Zêzere,et al.  The role of the lithological setting on the landslide pattern and distribution , 2015 .

[123]  Deliang Sun,et al.  A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm , 2020, Geomorphology.

[124]  Seyed Amir Naghibi,et al.  Prioritization of landslide conditioning factors and its spatial modeling in Shangnan County, China using GIS-based data mining algorithms , 2018, Bulletin of Engineering Geology and the Environment.

[125]  Harish Garg,et al.  An efficient biogeography based optimization algorithm for solving reliability optimization problems , 2015, Swarm Evol. Comput..

[126]  Frank Graf,et al.  How does forest structure affect root reinforcement and susceptibility to shallow landslides? , 2016 .

[127]  S. Im,et al.  Modeling the contribution of trees to shallow landslide development in a steep, forested watershed , 2013 .

[128]  Huaglory Tianfield,et al.  Biogeography-based optimization with covariance matrix based migration , 2016, Appl. Soft Comput..

[129]  Thomas Glade,et al.  The influence of forest cover on landslide occurrence explored with spatio-temporal information , 2017 .

[130]  Slobodan B. Mickovski,et al.  Plant-Best: A novel plant selection tool for slope protection , 2017 .

[131]  Ayaz Isazadeh,et al.  Imperialist competitive algorithm for solving systems of nonlinear equations , 2013, Comput. Math. Appl..

[132]  I. Rodríguez‐Iturbe,et al.  Ecohydrology of Water-Controlled Ecosystems: Soil Moisture and Plant Dynamics , 2005 .

[133]  Magnus Johansson,et al.  Landslide risk management — A brief overview and example from Sweden of current situation and climate change , 2013 .

[134]  Xianyu Yu,et al.  A Combination of Geographically Weighted Regression, Particle Swarm Optimization and Support Vector Machine for Landslide Susceptibility Mapping: A Case Study at Wanzhou in the Three Gorges Area, China , 2016, International journal of environmental research and public health.