Validation and evaluation of two multivariate statistical models for predictive shallow landslide susceptibility mapping of the Eastern Pyrenees (Spain)

This paper deals with the quality of two multivariate statistical models based on the Geographical Information System for shallow landslide susceptibility assessment in a test area at La Pobla de Lillet (Eastern Pyrenees, Spain). The quality, which was guaranteed by a rigorous methodology based on a suitable diagnosis, validation, and evaluation of the models, ensured a reliable contrast of the final susceptibility maps. This enables us to transfer the best results to the end user. Landslide susceptibility models were carried out by logistic regression and discriminant analysis of the significant conditioning factors related to the characteristics of the slope and the upslope contributing area captured from the digital elevation model and landslide distribution. The explanatory variables were tested (KS test, principal components and one-way and T-test) to select the most statistically significant ones before being introduced into the logistic and discriminant analyses. Accuracy statistics and the receiver operating characteristic curve used for diagnosis and validation showed similar prediction skills and a good fit to the data with more than 85% of unfailed cells properly classified for the two models. The evaluation of the study area and the correlation function (R2 = 0.83) between the models revealed that the discriminant model overestimated the susceptibility of the most stable zones with respect to the logistic model. Different methods of producing susceptibility maps showed marked differences in matching the models. Substantial spatial agreement (Kappa = 0.741) between binary maps produced by the standard cut-off value descended moderately (Kappa = 0.540) as a result of superimposing maps with five susceptibility levels defined by landslide percentage. Despite the fact that the two statistical models are similar in assessing susceptibility in the study area, the implications for hazard and risk management can be different because of the conservative nature of the discriminant model.

[1]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[2]  P. Frattini,et al.  Geomorphological and historical data in assessing landslide hazard , 2003 .

[3]  C. Westen,et al.  An approach towards deterministic landslide hazard analysis in GIS. A case study from Manizales (Colombia) , 1996 .

[4]  P. Reichenbach,et al.  Estimating the quality of landslide susceptibility models , 2006 .

[5]  H. A. Nefeslioglu,et al.  Susceptibility assessments of shallow earthflows triggered by heavy rainfall at three catchments by logistic regression analyses , 2005 .

[6]  E. Harp,et al.  A method for producing digital probabilistic seismic landslide hazard maps , 2000 .

[7]  Lalit Kumar,et al.  Comparative assessment of the measures of thematic classification accuracy , 2007 .

[8]  John C. Davis,et al.  Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA , 2003 .

[9]  M. Goldstein,et al.  Multivariate Analysis: Methods and Applications , 1984 .

[10]  D. Greenway,et al.  Vegetation and slope stability , 1987 .

[11]  R. Soeters,et al.  Landslide hazard and risk zonation—why is it still so difficult? , 2006 .

[12]  P. Reichenbach,et al.  Gis Technology in Mapping Landslide Hazard , 1995 .

[13]  J. Corominas,et al.  Assessment of shallow landslide susceptibility by means of multivariate statistical techniques , 2001 .

[14]  H. A. Nefeslioglu,et al.  An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps , 2008 .

[15]  Juan Remondo,et al.  Landslide Susceptibility Models Utilising Spatial Data Analysis Techniques. A Case Study from the Lower Deba Valley, Guipuzcoa (Spain) , 2003 .

[16]  Nicholas Sitar,et al.  Hydrologic conditions leading to debris-flow initiation , 1990 .

[17]  Saro Lee,et al.  Landslide susceptibility analysis and verification using the Bayesian probability model , 2002 .

[18]  M. Eeckhaut,et al.  Prediction of landslide susceptibility using rare events logistic regression: A case-study in the Flemish Ardennes (Belgium) , 2006 .

[19]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[20]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[21]  P. Frattini,et al.  Comparing models of debris-flow susceptibility in the alpine environment , 2008 .

[22]  J. H. Schuenemeyer,et al.  Statistical Methods For Geographers , 1986 .

[23]  Simone Sterlacchini,et al.  Landslide Representation Strategies in Susceptibility Studies using Weights-of-Evidence Modeling Technique , 2007 .

[24]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .

[25]  J. Muñoz,et al.  Thrust sequences in the eastern Spanish Pyrenees , 1986 .

[26]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[27]  W. Marcus,et al.  Assessing landslide potential using GIS, soil wetness modeling and topographic attributes, Payette River, Idaho , 2001 .

[28]  J. Corominas,et al.  A GIS-Based Multivariate Statistical Analysis for Shallow Landslide Susceptibility Mapping in La Pobla de Lillet Area (Eastern Pyrenees, Spain) , 2003 .

[29]  Saro Lee,et al.  Determination and application of the weights for landslide susceptibility mapping using an artificial neural network , 2004 .

[30]  J. Malet,et al.  Landslide susceptibility assessment by bivariate methods at large scales: Application to a complex mountainous environment , 2007 .

[31]  Jay Gao Identification of topographic settings conducive to landsliding from dem in Nelson county, Virginia, U.S.A. , 1993 .

[32]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[33]  M. Uschold,et al.  Methods and applications , 1953 .

[34]  Santiago Beguería,et al.  Validation and Evaluation of Predictive Models in Hazard Assessment and Risk Management , 2006 .

[35]  H. Park Comparing Group Means: T-tests and One-way ANOVA Using Stata, SAS, R, and SPSS , 2009 .

[36]  H. A. Nefeslioglu,et al.  Application of logistic regression for landslide susceptibility zoning of Cekmece Area, Istanbul, Turkey , 2006 .

[37]  C. Chung,et al.  Probabilistic prediction models for landslide hazard mapping , 1999 .

[38]  Candan Gokceoglu,et al.  LANDSLIDE SUSCEPTIBILITY MAPPING OF CEKMECE AREA (ISTANBUL, TURKEY) BY CONDITIONAL PROBABILITY , 2005 .

[39]  Chang-Jo Chung,et al.  The representation of geoscience information for data integration , 1993 .

[40]  C. F. Lee,et al.  Landslide characteristics and, slope instability modeling using GIS, Lantau Island, Hong Kong , 2002 .

[41]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[42]  Randy B. Foltz,et al.  Spatial prediction of landslide hazard using discriminant analysis and GIS , 2000 .

[43]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[44]  Andy P. Field,et al.  Discovering Statistics Using SPSS , 2000 .

[45]  L. Ermini,et al.  Artificial Neural Networks applied to landslide susceptibility assessment , 2005 .

[46]  Lucia Luzi,et al.  Slope vulnerability to earthquakes at subregional scale, using probabilistic techniques and geographic information systems , 2000 .

[47]  Michael M. Strand Estimation of a Population Total under a “Bernoulli Sampling” Procedure , 1979 .

[48]  David M. Cruden,et al.  LANDSLIDE TYPES AND PROCESSES , 1958 .

[49]  Saro Lee,et al.  Statistical analysis of landslide susceptibility at Yongin, Korea , 2001 .

[50]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .