Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: application to the 2009 storm event in Messina (Sicily, southern Italy)

Abstract This study aims to compare binary logistic regression (BLR) and stochastic gradient treeboost (SGT) methods in assessing landslide susceptibility within the Mediterranean region for multiple-occurrence regional landslide events. A test area was selected in the north-eastern sector of Sicily (southern Italy) where thousands of debris flows and debris avalanches triggered on the first October 2009 due to an extreme storm. Exploiting the same set of predictors and the 2009 event landslide archive, BLR- and SGT-based susceptibility models have been obtained for the two catchments separately, adopting a random partition (RP) technique for validation. In addition, the models trained in one catchment have been tested in predicting the landslide distribution in the second, adopting a spatial partition (SP)-based validation. The models produced high predictive performances with a general consistency between BLR and SGT in the susceptibility maps, predictor importance and role. In particular, SGT models reached a higher prediction performance with respect to BLR models for RP-modelling, while for the SP-based models, the difference in predictive skills dropped, converging to equally excellent performances. However, analysing the precision of the probability estimates, BLR produced more robust models around the mean value for each pixel, indicating possible overfitting effects, which affect decision trees to a greater extent. The assessment of the predictor roles allowed identifying the activation mechanisms which are primarily controlled by steep south-facing open slopes located near the coastal area. These slopes are characterised by low/middle altitude downhill from mountain tops, having a medium-grade metamorphic bedrock, under grassland and cultivated (terraced) uses.

[1]  K. Beven,et al.  A physically based, variable contributing area model of basin hydrology , 1979 .

[2]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[3]  David G. Tarboton,et al.  On the extraction of channel networks from digital elevation data , 1991 .

[4]  A. Ciampi Generalized regression trees , 1991 .

[5]  I. Moore,et al.  Digital terrain modelling: A review of hydrological, geomorphological, and biological applications , 1991 .

[6]  P. Burrough,et al.  Principles of geographical information systems , 1998 .

[7]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  John P. Wilson,et al.  Terrain analysis : principles and applications , 2000 .

[10]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[11]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[12]  J. N. Hutchinson,et al.  A review of the classification of landslides of the flow type , 2001 .

[13]  Dan Steinberg,et al.  Stochastic Gradient Boosting: An Introduction to TreeNet™ , 2002, AusDM.

[14]  Andrea G. Fabbri,et al.  Validation of Spatial Prediction Models for Landslide Hazard Mapping , 2003 .

[15]  Peloritani continental crust composition (southern Italy); geological and petrochemical evidence , 2004 .

[16]  V. Doyuran,et al.  A comparison of the GIS based landslide susceptibility assessment methods: multivariate versus bivariate , 2004 .

[17]  P. Reichenbach,et al.  Probabilistic landslide hazard assessment at the basin scale , 2005 .

[18]  M. Jakob,et al.  Debris-flow Hazards and Related Phenomena , 2005 .

[19]  A. Prasad,et al.  Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction , 2006, Ecosystems.

[20]  M. Crozier Multiple-occurrence regional landslide events in New Zealand: Hazard management issues , 2005 .

[21]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .

[22]  Ricco Rakotomalala,et al.  TANAGRA : un logiciel gratuit pour l'enseignement et la recherche , 2005, EGC.

[23]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[24]  Scott McDougall,et al.  Entrainment of material by debris flows , 2005 .

[25]  P. Reichenbach,et al.  Estimating the quality of landslide susceptibility models , 2006 .

[26]  T. Fernández,et al.  Evaluation and validation of landslide-susceptibility maps obtained by a GIS matrix method: examples from the Betic Cordillera (southern Spain) , 2007 .

[27]  M. Hutchinson,et al.  Digital terrain analysis. , 2008 .

[28]  H. A. Nefeslioglu,et al.  An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps , 2008 .

[29]  Domenico Guida,et al.  Typical source areas of May 1998 flow-like mass movements in the Campania region, Southern Italy , 2008 .

[30]  Chang-Jo Chung,et al.  On Blind Tests and Spatial Prediction Models , 2008 .

[31]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[32]  Şermin Tağil,et al.  GIS-Based Automated Landform Classification and Topographic, Landcover and Geologic Attributes of Landforms Around the Yazoren Polje, Turkey , 2008 .

[33]  Marco Borga,et al.  Analysing the influence of upslope bedrock outcrops on shallow landsliding , 2008 .

[34]  Mapping landslide susceptibility in The Faroe Islands , 2008 .

[35]  Yong Li,et al.  Relationships between debris flows and earth surface factors in Southwest China , 2008 .

[36]  S. Schnabel,et al.  Using and comparing two nonparametric methods (CART and MARS) to model the potential distribution of gullies , 2009 .

[37]  P. Reichenbach,et al.  Combined landslide inventory and susceptibility assessment based on different mapping units: an example from the Flemish Ardennes, Belgium , 2009 .

[38]  Giovanni B. Crosta,et al.  Techniques for evaluating the performance of landslide susceptibility models , 2010 .

[39]  Giuseppe Tito Aronica,et al.  Flash floods and debris flow in the city area of Messina, north-east part of Sicily, Italy in October 2009: the case of the Giampilieri catchment , 2010 .

[40]  P. Reichenbach,et al.  Optimal landslide susceptibility zonation based on multiple forecasts , 2010 .

[41]  M. Eeckhaut,et al.  Comparison of two landslide susceptibility assessments in the Champagne-Ardenne region (France). , 2010 .

[42]  A. Akgun A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir, Turkey , 2012, Landslides.

[43]  Christian Conoscenti,et al.  Exporting a Google Earth™ aided earth-flow susceptibility model: a test in central Sicily , 2012, Natural Hazards.

[44]  Christian Conoscenti,et al.  The role of the diagnostic areas in the assessment of landslide susceptibility models: a test in the sicilian chain , 2011 .

[45]  S. Reis,et al.  A GIS-based comparative study of frequency ratio, analytical hierarchy process, bivariate statistics , 2011 .

[46]  Peter Lehmann,et al.  Spatial statistical modeling of shallow landslides—Validating predictions for different landslide inventories and rainfall events , 2011 .

[47]  Fausto Guzzetti,et al.  Semi-automatic recognition and mapping of rainfall induced shallow landslides using optical satellite images , 2011 .

[48]  D. Bui,et al.  Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression , 2011 .

[49]  Neil C. Mitchell,et al.  Distribution and causes of landslides in the eastern Peloritani of NE Sicily and western Aspromonte of SW Calabria, Italy , 2011 .

[50]  T. Caloiero,et al.  A proposal for a methodological approach to the characterisation of Widespread Landslide Events: an application to Southern Italy , 2012 .

[51]  Aykut Akgün A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir, Turkey , 2012 .

[52]  Nicola Casagli,et al.  An integrated approach to the study of catastrophic debris-flows: geological hazard and human influence , 2012 .

[53]  G. Gigli,et al.  Landslide inventory map for the Briga and the Giampilieri catchments, NE Sicily, Italy , 2012 .

[54]  C. Irigaray,et al.  Factors selection in landslide susceptibility modelling on large scale following the gis matrix method: application to the river Beiro basin (Spain) , 2012 .

[55]  Boris Schröder,et al.  How can statistical models help to determine driving factors of landslides , 2012 .

[56]  Á. Felicísimo,et al.  Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study , 2013, Landslides.

[57]  C. Irigaray,et al.  Forward logistic regression for earth-flow landslide susceptibility assessment in the Platani river basin (southern Sicily, Italy) , 2014, Landslides.

[58]  A proposal for a methodological approach to the assessment of vulnerability , 2013 .

[59]  Salvatore Scudero,et al.  Landslide susceptibility assessment in the Peloritani Mts. (Sicily, Italy) and clues for tectonic control of relief processes , 2013 .

[60]  A. Brenning,et al.  Assessing the quality of landslide susceptibility maps – case study Lower Austria , 2014 .

[61]  P. Reichenbach,et al.  The Influence of Land Use Change on Landslide Susceptibility Zonation: The Briga Catchment Test Site (Messina, Italy) , 2013, Environmental Management.

[62]  E. Rotigliano,et al.  A test of transferability for landslides susceptibility models under extreme climatic events: application to the Messina 2009 disaster , 2014, Natural Hazards.

[63]  Biswajeet Pradhan,et al.  Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree , 2016, Landslides.

[64]  Alexander Brenning,et al.  Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling , 2015, Comput. Geosci..

[65]  J. Gonçalves,et al.  UAV photogrammetry for topographic monitoring of coastal areas , 2015 .