Spatial statistical modeling of shallow landslides—Validating predictions for different landslide inventories and rainfall events

Abstract Statistical models that exploit the correlation between landslide occurrence and geomorphic properties are often used to map the spatial occurrence of shallow landslides triggered by heavy rainfalls. In many landslide susceptibility studies, the true predictive power of the statistical model remains unknown because the predictions are not validated with independent data from other events or areas. This study validates statistical susceptibility predictions with independent test data. The spatial incidence of landslides, triggered by an extreme rainfall in a study area, was modeled by logistic regression. The fitted model was then used to generate susceptibility maps for another three study areas, for which event-based landslide inventories were also available. All the study areas lie in the northern foothills of the Swiss Alps. The landslides had been triggered by heavy rainfall either in 2002 or 2005. The validation was designed such that the first validation study area shared the geomorphology and the second the triggering rainfall event with the calibration study area. For the third validation study area, both geomorphology and rainfall were different. All explanatory variables were extracted for the logistic regression analysis from high-resolution digital elevation and surface models (2.5 m grid). The model fitted to the calibration data comprised four explanatory variables: (i) slope angle (effect of gravitational driving forces), (ii) vegetation type (grassland and forest; root reinforcement), (iii) planform curvature (convergent water flow paths), and (iv) contributing area (potential supply of water). The area under the Receiver Operating Characteristic (ROC) curve ( AUC ) was used to quantify the predictive performance of the logistic regression model. The AUC values were computed for the susceptibility maps of the three validation study areas (validation AUC ), the fitted susceptibility map of the calibration study area (apparent AUC : 0.80) and another susceptibility map obtained for the calibration study area by 20-fold cross-validation (cross-validation AUC : 0.74). The AUC values of the first and second validation study areas (0.72 and 0.69, respectively) and the cross-validation AUC matched fairly well, and all AUC values were distinctly smaller than the apparent AUC . Based on the apparent AUC one would have clearly overrated the predictive performance for the first two validation areas. Rather surprisingly, the AUC value of the third validation study area (0.82) was larger than the apparent AUC . A large part of the third validation study area consists of gentle slopes, and the regression model correctly predicted that no landslides occur in the flat parts. This increased the predictive performance of the model considerably. The predicted susceptibility maps were further validated by summing the predicted susceptibilities for the entire validation areas and by comparing the sums with the observed number of landslides. The sums exceeded the observed counts for all the validation areas. Hence, the logistic regression model generally over-estimated the risk of landslide occurrence. Obviously, a predictive model that is based on static geomorphic properties alone cannot take a full account of the complex and time dependent processes in the subsurface. However, such a model is still capable of distinguishing zones highly or less prone to shallow landslides.

[1]  R L Shuster,et al.  Landslides: Analysis and Control , 1978 .

[2]  R. Sidle,et al.  Landslides: Processes, Prediction, and Land Use , 2006 .

[3]  Toni P Labhart,et al.  Geologie der Schweiz , 1982 .

[4]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[5]  S. L. Kuriakose,et al.  Spatial data for landslide susceptibility, hazard, and vulnerability assessment: An overview , 2008 .

[6]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[7]  V. Doyuran,et al.  Data driven bivariate landslide susceptibility assessment using geographical information systems: a method and application to Asarsuyu catchment, Turkey , 2004 .

[8]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[9]  P. Reichenbach,et al.  Optimal landslide susceptibility zonation based on multiple forecasts , 2010 .

[10]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[11]  John C. Davis,et al.  Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA , 2003 .

[12]  C. Thorne,et al.  Quantitative analysis of land surface topography , 1987 .

[13]  A. Ribolini,et al.  Logistic regression versus artificial neural networks: landslide susceptibility evaluation in a sample area of the Serchio River valley, Italy , 2009 .

[14]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .

[15]  O. Korup,et al.  The role of landslides in mountain range evolution. , 2010 .

[16]  H. Wang,et al.  Rainfall‐induced landslide hazard assessment using artificial neural networks , 2006 .

[17]  D. Montgomery,et al.  Parameterization of soil properties for a model of topographic controls on shallow landsliding: application to Rio de Janeiro , 2003 .

[18]  H. A. Nefeslioglu,et al.  An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps , 2008 .

[19]  M. García-Rodríguez,et al.  Susceptibility assessment of earthquake-triggered landslides in El Salvador using logistic regression , 2008 .

[20]  Rex L. Baum,et al.  Transient deterministic shallow landslide modeling: Requirements for susceptibility and hazard assessments in a GIS framework , 2008 .

[21]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[22]  L. Ermini,et al.  Artificial Neural Networks applied to landslide susceptibility assessment , 2005 .

[23]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[24]  C. Hegg,et al.  Ereignisanalyse Hochwasser 2005. Teil 1 - Prozesse, Schäden und erste Einordnung , 2007 .

[25]  Toru Higuchi,et al.  Development of progressive landslide failure in cohesive materials , 2005 .

[26]  F. Agterberg,et al.  Weights of evidence modelling: a new approach to mapping mineral potential , 1990 .

[27]  C. Chung,et al.  Predicting landslides for risk analysis — Spatial models tested by a cross-validation technique , 2008 .

[28]  David R. Montgomery,et al.  A process-based model for colluvial soil depth and shallow landsliding using digital elevation data , 1995 .

[29]  F. Graf,et al.  Effects of forests on shallow landslides - case studies in Switzerland , 2009 .

[30]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[31]  P. Allen,et al.  Sediment flux from a mountain belt derived by landslide mapping , 1997 .

[32]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[33]  R. Sidle,et al.  A distributed slope stability model for steep forested basins , 1995 .

[34]  D. Rieke-Zapp,et al.  Slip rates variability and sediment mobilization on a shallow landslide in the northern Swiss Alps , 2007 .

[35]  Edgar Berrezueta,et al.  Landslides in the Central Coalfield (Cantabrian Mountains, NW Spain): Geomorphological features, conditioning factors and methodological implications in susceptibility assessment , 2007 .

[36]  W. Dietrich,et al.  Testing a model for predicting the timing and location of shallow landslide initiation in soil‐mantled landscapes , 2003 .

[37]  Chang-Jo Chung,et al.  The representation of geoscience information for data integration , 1993 .

[38]  D. Tarboton A new method for the determination of flow directions and upslope areas in grid digital elevation models , 1997 .

[39]  Peter Lehmann,et al.  Quantifying lateral root reinforcement in steep slopes – from a bundle of roots to tree stands , 2010 .

[40]  D. Montgomery,et al.  A physically based model for the topographic control on shallow landsliding , 1994 .

[41]  P. Atkinson,et al.  Generalised linear modelling of susceptibility to landsliding in the Central Apennines, Italy , 1998 .

[42]  William J. Elliot,et al.  Spatial Prediction of Landslide Hazard Using Logistic Regression and ROC Analysis , 2006, Trans. GIS.

[43]  W. Briggs Statistical Methods in the Atmospheric Sciences , 2007 .

[44]  Saro Lee,et al.  Application and cross-validation of spatial logistic multiple regression for landslide susceptibility analysis , 2005 .

[45]  Ru-Hua Song,et al.  Modeling the potential distribution of shallow-seated landslides using the weights of evidence method and a logistic regression model: a case study of the Sabae Area, Japan , 2008 .

[46]  Andrea G. Fabbri,et al.  Validation of Spatial Prediction Models for Landslide Hazard Mapping , 2003 .

[47]  A. Shakoor,et al.  A GIS-based landslide susceptibility evaluation using bivariate and multivariate statistical analyses , 2010 .