How can statistical models help to determine driving factors of landslides

Landslides are a hazard for humans and artificial structures. From an ecological point of view, they represent an important ecosystem disturbance, especially in tropical montane forests. Here, shallow translational landslides are a frequent natural phenomenon and one local determinant of high levels of biodiversity. In this paper, we apply weighted ensembles of advanced phenomenological models from statistics and machine learning to analyze the driving factors of natural landslides in a tropical montane forest in South Ecuador. We exclusively interpret terrain attributes, derived from a digital elevation model, as proxies to several driving factors of landslides and use them as predictors in our models which are trained on a set of five historical landslide inventories. We check the model generality by transferring them in time and use three common performance criteria (i.e. AUC, explained deviance and slope of model calibration curve) to, on the one hand, compare several state-of-the-art model approaches and on the other hand, to create weighted model ensembles. Our results suggest that it is important to consider more than one single performance criterion.

[1]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[2]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[3]  Tetsuro Esaki,et al.  GIS-Based Probabilistic Mapping of Landslide Hazard Using a Three-Dimensional Deterministic Model , 2004 .

[4]  R. Sidle,et al.  A distributed slope stability model for steep forested basins , 1995 .

[5]  J. Lehmann,et al.  The Vertical Pattern of Rooting and Nutrient Uptake at Different Altitudes of a South Ecuadorian Montane Forest , 2006, Plant and Soil.

[6]  Miroslav Dudík,et al.  A maximum entropy approach to species distribution modeling , 2004, ICML.

[7]  Weimin Wu,et al.  Simulating effects of timber harvesting on the temporal and spatial distribution of shallow landslides , 1999 .

[8]  John P. Wilson,et al.  Terrain analysis : principles and applications , 2000 .

[9]  Miroslav Dudík,et al.  Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation , 2008 .

[10]  Veerle Vanacker,et al.  Linking hydrological, infinite slope stability and land-use change models through GIS for assessing the impact of deforestation on slope stability in high Andean watersheds , 2003 .

[11]  Saro Lee,et al.  Landslide susceptibility analysis and verification using the Bayesian probability model , 2002 .

[12]  R. Heerdegen,et al.  Quantifying source areas through land surface curvature and shape , 1982 .

[13]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  S. Weiss,et al.  GLM versus CCA spatial modeling of plant species distribution , 1999, Plant Ecology.

[16]  R. Sidle,et al.  Distributed simulations of landslides for different rainfall conditions , 2004 .

[17]  J. Lehmann,et al.  Root Morphology and Anchorage of Six Native Tree Species from a Tropical Montane Forest and an Elfin Forest in Ecuador , 2005, Plant and Soil.

[18]  Robert P. Anderson,et al.  Maximum entropy modeling of species geographic distributions , 2006 .

[19]  G. Moisen,et al.  PresenceAbsence: An R Package for Presence Absence Analysis , 2008 .

[20]  Fuchu Dai,et al.  Landslide risk assessment and management: an overview , 2002 .

[21]  Jürgen Homeier,et al.  Growth Dynamics of Trees in Tropical Mountain Ecosystems , 2008 .

[22]  R. Bagnold An approach to the sediment transport problem from general physics , 1966 .

[23]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[24]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[25]  J. Connell Diversity in tropical rain forests and coral reefs. , 1978, Science.

[26]  Jane Elith,et al.  POC plots: calibrating species distribution models with presence-only data. , 2010, Ecology.

[27]  Christoph Leuschner,et al.  Large altitudinal increase in tree root/shoot ratio in tropical mountain forests of Ecuador , 2007 .

[28]  Erwin Beck,et al.  Reasons for an outstanding plant diversity in the tropical Andes of southern Ecuador. , 2009 .

[29]  Michael Richter,et al.  Landslides as Important Disturbance Regimes — Causes and Regeneration , 2008 .

[30]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[31]  Boris Schröder,et al.  Predicting the species composition of Nardus stricta communities by logistic regression modelling , 2004 .

[32]  K. Beven,et al.  A physically based, variable contributing area model of basin hydrology , 1979 .

[33]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[34]  R. Sidle,et al.  A conceptual model of changes in root cohesion in response to vegetation management. , 1991 .

[35]  Matthew C. Larsen,et al.  The frequency and distribution of recent landslides in three montane tropical regions of Puerto Rico , 1998 .

[36]  Jeroen M. Schoorl,et al.  Contribution of Topographically Based Landslide Hazard Modelling to the Analysis of the Spatial Distribution and Ecology of Kauri (Agathis australis) , 2005, Landscape Ecology.

[37]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  T. Wu,et al.  Strength of tree roots and landslides on Prince of Wales Island, Alaska , 1979 .

[39]  P. Reichenbach,et al.  Landslide hazard assessment in the Collazzone area, Umbria, Central Italy , 2006 .

[40]  B. Reineking,et al.  Constrain to perform: Regularization of habitat models , 2006 .

[41]  T. G. Freeman,et al.  Calculating catchment area with divergent flow based on a regular grid , 1991 .

[42]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[43]  Norman H. Pillsbury,et al.  A Risk Analysis Approach for Using Discriminant Functions to Manage Logging-Related Landslides on Granitic Terrain , 1985 .

[44]  R. Valentino,et al.  Experimental analysis and modelling of shallow landslides , 2007 .

[45]  Niklaus E. Zimmermann,et al.  Predicting tree species presence and basal area in Utah: A comparison of stochastic gradient boosting, generalized additive models, and tree-based methods , 2006 .

[46]  D. Montgomery,et al.  A physically based model for the topographic control on shallow landsliding , 1994 .

[47]  P. Reichenbach,et al.  Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy , 1999 .

[48]  David F. R. P. Burslem,et al.  Disturbing hypotheses in tropical forests , 2003 .

[49]  C. Dormann,et al.  Static species distribution models in dynamically changing systems: how good can predictions really be? , 2009 .

[50]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[51]  Andreas Huth,et al.  Impacts of recruitment limitation and canopy disturbance on tropical tree species richness , 2007 .

[52]  J. Friedman Multivariate adaptive regression splines , 1990 .

[53]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[54]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[55]  Wilfried Thuiller,et al.  Statistical consensus methods for improving predictive geomorphology maps , 2009, Comput. Geosci..

[56]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[57]  Lieven Claessens,et al.  Landsliding and Its Multiscale Influence on Mountainscapes , 2009 .

[58]  I. Moore,et al.  Digital terrain modelling: A review of hydrological, geomorphological, and biological applications , 1991 .

[59]  H. Mitásová,et al.  Interpolation by regularized spline with tension: I. Theory and implementation , 1993 .

[60]  Juan Remondo,et al.  Landslide Susceptibility Models Utilising Spatial Data Analysis Techniques. A Case Study from the Lower Deba Valley, Guipuzcoa (Spain) , 2003 .