Are new open building data useful for flood vulnerability modelling?

Abstract. Flood risk modelling aims to quantify the probability of flooding and the resulting consequences for exposed elements. The assessment of flood damage is a core task that requires the description of complex flood damage processes including the influences of flooding intensity and vulnerability characteristics. Multi-variable modelling approaches are better suited for this purpose than simple stage-damage functions. However, multi-variable flood vulnerability models also often have problems to predict damage for regions other than those for which they have been developed. A transfer of vulnerability models usually results in a drop of model predictive performance. Here we investigate the question of whether data from the open data source OpenStreetMap is suitable to model flood vulnerability of residential buildings and whether the underlying standardized data model is helpful to transfer models across regions. We develop a new data set by calculating numerical spatial measures for residential building footprint geometries and combine these variables with an empirical data set of observed flood damage. From this data set random forest regression models are learned using regional sub-sets and are tested for predicting flood damage in other regions. This regional split-sample validation approach reveals that the predictive performance of models based on OpenStreetMap data is comparable to alternative multi-variable models, which use comprehensive and detailed information about preparedness, socio-economic status and other aspects of residential building vulnerability. However, our results show that using numerical spatial measures derived from OpenStreetMap building geometries does not resolve all problems of model transfer. Still, we conclude that these variables are useful proxies for flood vulnerability modelling, because these data are consistent, openly accessible, and thus make it easier and more cost-effective to transfer vulnerability models to other regions.

[1]  R. Keim,et al.  Natural Hazards and Earth System Sciences , 2013 .

[2]  J. Chatterton,et al.  The Benefits of Flood Alleviation: A Manual of Assessment Techniques , 1978 .

[3]  Abbas Rajabifard,et al.  A framework for a microscale flood damage assessment and visualization for a building using BIM–GIS integration , 2016, Int. J. Digit. Earth.

[4]  B. Merz,et al.  Tracing the value of data for flood loss modelling , 2016 .

[5]  Bruno Merz,et al.  Review article "Assessment of economic flood damage" , 2010 .

[6]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[7]  Heidi Kreibich,et al.  Coping with floods in the city of Dresden, Germany , 2009 .

[8]  G. Blöschl,et al.  The June 2013 flood in the Upper Danube Basin, and comparisons with the 2002, 1954 and 1899 floods , 2013 .

[9]  Bruno Merz,et al.  How useful are complex flood damage models? , 2014 .

[10]  B. Merz,et al.  Estimation uncertainty of direct monetary flood damage to buildings , 2004 .

[11]  Paul C. Boutros,et al.  The parameter sensitivity of random forests , 2016, BMC Bioinformatics.

[12]  Bruno Merz,et al.  Insurability and Mitigation of Flood Losses in Private Households in Germany , 2006, Risk analysis : an official publication of the Society for Risk Analysis.

[13]  Heiko Apel,et al.  Flood risk analyses—how detailed do we need to be? , 2009 .

[14]  Brenden Jongman,et al.  Effective adaptation to rising flood risk , 2018, Nature Communications.

[15]  Stanley A. Changnon,et al.  Shifting Economic Impacts from Weather Extremes in the United States: A Result of Societal Changes, Not Global Warming , 2003 .

[16]  H. Winsemius,et al.  A framework for global river flood risk assessments , 2012 .

[17]  Peter Salamon,et al.  Modelling the socio-economic impact of river floods in Europe , 2016 .

[18]  H. Kreibich,et al.  Influence of flood frequency on residential building losses , 2010 .

[19]  Bruno Merz,et al.  The extreme flood in June 2013 in Germany , 2014 .

[20]  Balqis M. Rehan An innovative micro-scale approach for vulnerability and flood risk assessment with the application to property-level protection adoptions , 2018, Natural Hazards.

[21]  R. Figueiredo,et al.  Using Open Building Data in the Development of Exposure Datasets for Catastrophe Risk Modelling , 2015 .

[22]  Heidi Kreibich,et al.  A Review of Flood Loss Models as Basis for Harmonization and Benchmarking , 2016, PloS one.

[23]  Bruno Merz,et al.  Seamless Estimation of Hydrometeorological Risk Across Spatial Scales , 2019, Earth's Future.

[24]  Mathieu Basille,et al.  rpostgis: Linking R with a PostGIS Spatial Database , 2018, R J..

[25]  Andreas Paul Zischg,et al.  Are flood damage models converging to “reality”? Lessons learnt from a blind test , 2020, Natural Hazards and Earth System Sciences.

[26]  Xuan Linh Nguyen,et al.  Verification of novel integrations of swarm intelligence algorithms into deep learning neural network for flood susceptibility mapping , 2020, Journal of Hydrology.

[27]  Bruno Merz,et al.  What made the June 2013 flood in Germany an exceptional event? A hydro-meteorological evaluation , 2014 .

[28]  A. Blanco-Vogt,et al.  Assessment of the physical flood susceptibility of buildings on a large scale - conceptual and methodological frameworks , 2014 .

[29]  Xiaohong Chen,et al.  Flood hazard risk assessment model based on random forest , 2015 .

[30]  Jeroen C. J. H. Aerts,et al.  Comparative flood damage model assessment: towards a European approach , 2012 .

[31]  Robert Hecht,et al.  Measuring Completeness of Building Footprints in OpenStreetMap over Space and Time , 2013, ISPRS Int. J. Geo Inf..

[32]  G. Zhai,et al.  MODELING FLOOD DAMAGE: CASE OF TOKAI FLOOD 2000 1 , 2005 .

[33]  Hadley Wickham,et al.  Reshaping Data with the reshape Package , 2007 .

[34]  Aisling Irwin,et al.  No PhDs needed: how citizen science is transforming research , 2018, Nature.

[35]  P. Hoeppe Trends in weather related disasters – Consequences for insurers and society , 2016 .

[36]  Bruno Merz,et al.  Multi-variate flood damage assessment: a tree-based data-mining approach , 2013 .

[37]  Stefan Lüdtke,et al.  Flood loss estimation using 3D city models and remote sensing data , 2018, Environ. Model. Softw..

[38]  James B. Brown,et al.  Iterative random forests to discover predictive and stable high-order interactions , 2017, Proceedings of the National Academy of Sciences.

[39]  Animesh K. Gain,et al.  Multi-Variate Analyses of Flood Loss in Can Tho City, Mekong Delta , 2015 .

[40]  Andreas Paul Zischg,et al.  From global circulation to local flood loss: Coupling models across the scales. , 2018, The Science of the total environment.

[41]  U. Ulbrich,et al.  The central European floods of August 2002: Part 2 –Synoptic causes and considerations with respect to climatic change , 2003 .

[42]  Kohske Takahashi,et al.  Welcome to the Tidyverse , 2019, J. Open Source Softw..

[43]  Annegret H. Thieken,et al.  Review article: assessing the costs of natural hazards - state of the art and knowledge gaps , 2013 .

[44]  Sarah E. Kienzler,et al.  After the extreme flood in 2002: changes in preparedness, response and recovery of flood-affected residents in Germany between 2005 and 2011 , 2014 .

[45]  Heidi Kreibich,et al.  The flood of June 2013 in Germany: how much do we know about its impacts? , 2016 .

[46]  Martin Jung,et al.  LecoS - A python plugin for automated landscape ecology analysis , 2016, Ecol. Informatics.

[47]  Stefan Lüdtke,et al.  Regional and Temporal Transferability of Multivariable Flood Damage Models , 2018 .

[48]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[49]  Bruno Merz,et al.  Tree‐based flood damage modeling of companies: Damage processes and model performance , 2017 .

[50]  Francesco Dottori,et al.  INSYDE: a synthetic, probabilistic flood damage model based on explicit cost analysis , 2016 .

[51]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[52]  D. I. Smith Flood damage estimation - A review of urban stage-damage curves and loss functions , 1994 .

[53]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[54]  H. Kreibich,et al.  A Consistent Approach for Probabilistic Residential Flood Loss Modeling in Europe , 2019, Water Resources Research.

[55]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[56]  Heidi Kreibich,et al.  Multi-model ensembles for assessment of flood losses and associated uncertainty , 2018 .

[57]  Massimiliano Pittore,et al.  Perspectives on global dynamic exposure modelling for geo-risk assessment , 2017, Natural Hazards.

[58]  H. Kreibich,et al.  Estimating exposure of residential assets to natural hazards in Europe using open data , 2019, Natural Hazards and Earth System Sciences.

[59]  Dennis Wagenaar,et al.  Multi-variable flood damage modelling with limited data using supervised learning approaches , 2017 .

[60]  Anthony J. Jakeman,et al.  Flood inundation modelling: A review of methods, recent advances and uncertainty analysis , 2017, Environ. Model. Softw..

[61]  Z. Kundzewicz,et al.  River flood risk and adaptation in Europe—assessment of the present status , 2010 .

[62]  Bertrand Michel,et al.  Correlation and variable importance in random forests , 2013, Statistics and Computing.

[63]  E. Penning-Rowsell,et al.  Flood risk assessments at different spatial scales , 2015, Mitigation and Adaptation Strategies for Global Change.

[64]  A. Thieken,et al.  Adaptability and transferability of flood loss functions in residential areas , 2013 .

[65]  Bruno Merz,et al.  Hierarchical Bayesian Approach for Modeling Spatiotemporal Variability in Flood Damage Processes , 2019, Water Resources Research.

[66]  H. Kreibich,et al.  Data Collection for a Better Understanding of What Causes Flood Damage–Experiences with Telephone Surveys , 2017 .

[67]  Paola Zuccolotto,et al.  Variable Selection Using Random Forests , 2006 .

[68]  B. Merz,et al.  Flood damage and influencing factors: New insights from the August 2002 flood in Germany , 2005 .

[69]  Edzer Pebesma,et al.  Simple Features for R: Standardized Support for Spatial Vector Data , 2018, R J..

[70]  Adam Millard-Ball,et al.  The world’s user-generated road map is more than 80% complete , 2017, PloS one.