Landslide Susceptibility Mapping Based on Weighted Gradient Boosting Decision Tree in Wanzhou Section of the Three Gorges Reservoir Area (China)

The main goal of this study is to produce a landslide susceptibility map in the Wanzhou section of the Three Gorges reservoir area (China) with a weighted gradient boosting decision tree (weighted GBDT) model. According to the current research on landslide susceptibility mapping (LSM), the GBDT method is rarely used in LSM. Furthermore, previous studies have rarely considered the imbalance of landslide samples and simply regarded the LSM problem as a binary classification problem. In this paper, we considered LSM as an imbalanced learning problem and obtained a better predictive model using the weighted GBDT method. The innovations of the article mainly include the following two points: introducing the GBDT model into the evaluation of landslide susceptibility; using the weighted GBDT method to deal with the problem of landslide sample imbalance. The logistic regression (LR) model and gradient boosting decision tree (GBDT) model were also used in the study to compare with the weighted GBDT model. Five kinds of data from different data source were used in the study: geology, topography, hydrology, land cover, and triggered factors (rainfall, earthquake, land use, etc.). Twenty nine environmental parameters and 233 landslides were used as input data. The receiver operating characteristic (ROC) curve, the area under the ROC curve (AUC) value, and the recall value were used to estimate the quality of the weighted GBDT model, the GBDT model, and the LR model. The results showed that the GBDT model and the weighted GBDT model had a higher AUC value (0.977, 0.976) than the LR model (0.845); the weighted GBDT model had a little higher AUC value (0.977) than the GBDT model (0.976); and the weighted GBDT model had a higher recall value (0.823) than the GBDT model (0.426) and the LR model (0.004). The weighted GBDT method could be considered to have the best performance considering the AUC value and the recall value in landslide susceptibility mapping dealing with imbalanced landslide data.

[1]  Thomas Stanley,et al.  A heuristic approach to global landslide susceptibility mapping , 2017, Natural Hazards.

[2]  Yang Yang,et al.  New method for landslide susceptibility mapping supported by spatial logistic regression and GeoDetector: A case study of Duwen Highway Basin, Sichuan Province, China , 2019, Geomorphology.

[3]  H. Pourghasemi,et al.  Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances , 2013, Natural Hazards.

[4]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[5]  A. Zhu,et al.  An expert knowledge-based approach to landslide susceptibility mapping using GIS and fuzzy logic , 2014 .

[6]  Veronica Tofani,et al.  GIS techniques for regional-scale landslide susceptibility assessment: the Sicily (Italy) case study , 2013, Int. J. Geogr. Inf. Sci..

[7]  B. Pradhan,et al.  GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks , 2016, Environmental Earth Sciences.

[8]  Wang Jiaji LANDSLIDE SUSCEPTIBILITY ASSESSMENT BASED ON GIS AND WEIGHTED INFORMATION VALUE:A CASE STUDY OF WANZHOU DISTRICT,THREE GORGES RESERVOIR , 2014 .

[9]  A. Ozdemir,et al.  A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan Mountains, SW Turkey , 2013 .

[10]  Robert Gilmore Pontius,et al.  Recommendations for using the relative operating characteristic (ROC) , 2013, Landscape Ecology.

[11]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[12]  C. Gokceoğlu,et al.  Assessment of landslide susceptibility for a landslide-prone area (north of Yenice, NW Turkey) by fuzzy approach , 2002 .

[13]  A. Akgun A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir, Turkey , 2012, Landslides.

[14]  H. Shahabi,et al.  Remote sensing and GIS-based landslide susceptibility mapping using frequency ratio, logistic regression, and fuzzy logic methods at the central Zab basin, Iran , 2015, Environmental Earth Sciences.

[15]  Biswajeet Pradhan,et al.  Use of GIS-based fuzzy logic relations and its cross application to produce landslide susceptibility maps in three test areas in Malaysia , 2011 .

[16]  Francisco Herrera,et al.  Preprocessing noisy imbalanced datasets using SMOTE enhanced with fuzzy rough prototype selection , 2014, Appl. Soft Comput..

[17]  T. Kavzoglu,et al.  Assessment of shallow landslide susceptibility using artificial neural networks in Jabonosa River Basin, Venezuela , 2005 .

[18]  Karim Solaimani,et al.  Landslide susceptibility mapping based on frequency ratio and logistic regression models , 2013, Arabian Journal of Geosciences.

[19]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[20]  M. Arora,et al.  Landslide susceptibility zonation of the Chamoli region, Garhwal Himalayas, using logistic regression model , 2010 .

[21]  K. Yin,et al.  Landslide susceptibility mapping based on self-organizing-map network and extreme learning machine , 2017 .

[22]  L. Ermini,et al.  Artificial Neural Networks applied to landslide susceptibility assessment , 2005 .

[23]  Inge Revhaug,et al.  Optimization of Causative Factors for Landslide Susceptibility Evaluation Using Remote Sensing and GIS Data in Parts of Niigata, Japan , 2015, PloS one.

[24]  D. Petley Global patterns of loss of life from landslides , 2012 .

[25]  Paraskevas Tsangaratos,et al.  Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size , 2016 .

[26]  Wei Fangqiang LANDSLIDE HAZARD EVALUATION OF WANZHOU BASED ON GIS INFORMATION VALUE METHOD IN THE THREE GORGES RESERVOIR , 2006 .

[27]  Biswajeet Pradhan,et al.  Analysis and evaluation of landslide susceptibility: a review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016) , 2018, Arabian Journal of Geosciences.

[28]  Albert Y. Zomaya,et al.  Ensemble-Based Wrapper Methods for Feature Selection and Class Imbalance Learning , 2013, PAKDD.

[29]  Soyoung Park,et al.  Landslide susceptibility mapping using frequency ratio, analytic hierarchy process, logistic regression, and artificial neural network methods at the Inje area, Korea , 2013, Environmental Earth Sciences.

[30]  B. Pradhan Landslide susceptibility mapping of a catchment area using frequency ratio, fuzzy logic and multivariate logistic regression approaches , 2010 .

[31]  Ying He,et al.  MSMOTE: Improving Classification Performance When Training Data is Imbalanced , 2009, 2009 Second International Workshop on Computer Science and Engineering.

[32]  Shibiao Bai,et al.  GIS-based rare events logistic regression for landslide-susceptibility mapping of Lianyungang, China , 2011 .

[33]  Xiuping Jia,et al.  A comparison of information value and logistic regression models in landslide susceptibility mapping by using GIS , 2016, Environmental Earth Sciences.

[34]  J. Friedman Stochastic gradient boosting , 2002 .

[35]  P. Kayastha,et al.  Application of the analytical hierarchy process (AHP) for landslide susceptibility mapping: A case study from the Tinau watershed, west Nepal , 2013, Comput. Geosci..

[36]  Tri Dev Acharya,et al.  Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China) , 2018 .

[37]  Xianyu Yu,et al.  A Combination of Geographically Weighted Regression, Particle Swarm Optimization and Support Vector Machine for Landslide Susceptibility Mapping: A Case Study at Wanzhou in the Three Gorges Area, China , 2016, International journal of environmental research and public health.

[38]  Á. Felicísimo,et al.  Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study , 2013, Landslides.

[39]  P. Kayastha Application of fuzzy logic approach for landslide susceptibility mapping in Garuwa sub-basin, East Nepal , 2012, Frontiers of Earth Science.

[40]  Saro Lee,et al.  Application of logistic regression model and its validation for landslide susceptibility mapping using GIS and remote sensing data , 2005 .

[41]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[42]  Veronica Tofani,et al.  Landslide Susceptibility Mapping at National Scale: The Italian Case Study , 2013 .

[43]  X Yao,et al.  Application of two-class SVM applied in landslide susceptibility mapping , 2013 .

[44]  A. Stein,et al.  Landslide susceptibility assessment using logistic regression and its comparison with a rock mass classification system, along a road section in the northern Himalayas (India) , 2010 .

[45]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[46]  Biswajeet Pradhan,et al.  Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling , 2010, Environ. Model. Softw..

[47]  Chenghu Zhou,et al.  Mapping landslide susceptibility in the Three Gorges area, China using GIS, expert knowledge and fuzzy logic , 2004 .

[48]  A. Erener,et al.  Improvement of statistical landslide susceptibility mapping by using spatial and global regression methods in the case of More and Romsdal (Norway) , 2010 .

[49]  R. Gil Pontius,et al.  Land-cover change model validation by an ROC method for the Ipswich watershed, Massachusetts, USA , 2001 .

[50]  Xueling Wu,et al.  Landslide susceptibility mapping based on rough set theory and support vector machines: A case of the Three Gorges area, China , 2014 .