Flood Defense Standard Estimation Using Machine Learning and Its Representation in Large‐Scale Flood Hazard Modeling

We propose a machine learning‐based approach to estimate the flood defense standard (FDS) for unlabeled sites. We adopted random forest regression (RFR) to characterize the relationship between the declared FDS and 10 explanatory factors contained in publicly available data sets. We compared RFR with multiple linear regression (MLR) and demonstrated the proposed approach in the conterminous United States (CONUS) and England, respectively. The results showed the following: (a) RFR performed better than MLR, with a Nash–Sutcliffe efficiency of 0.85 in the CONUS and 0.76 in England. Unsatisfactory performances of MLR indicated that the relationship between the FDS and explanatory factors did not obey an explicit linear function. (b) RFR revealed river flood factors had higher importance than physical and socio‐economic factors in the FDS estimation. The proposed RFR achieved the highest performance using all factors for prediction and could not provide good predictions (NSE < 0.65) using physical or socio‐economic factors individually. (c) We estimated the FDS for all unlabeled sites in the CONUS and England. Approximately 80% and 29% of sites were identified as high or highest standard (>100‐year return period) in the CONUS and England, respectively. (d) We incorporated the estimated FDS in large‐scale flood modeling and compared the model results with official flood hazard maps in three case studies. We identified obvious overestimations in protected areas when flood defenses were not taken into account; flood defenses were successfully represented using the proposed approach.

[1]  J. Altringham,et al.  Variable ranking and selection with random forest for unbalanced data , 2022, Environmental Data Science.

[2]  B. Merz,et al.  Event and Catchment Controls of Heavy Tail Behavior of Floods , 2022, Water Resources Research.

[3]  E. Wohl,et al.  Levees don't protect, they disconnect: A critical review of how artificial levees impact floodplain functions. , 2022, The Science of the total environment.

[4]  E. Wohl,et al.  Identification of Artificial Levees in the Contiguous United States , 2022, Water Resources Research.

[5]  Bo Pang,et al.  Improving urban flood susceptibility mapping using transfer learning , 2021 .

[6]  Chaopeng Shen,et al.  Editorial: Broadening the Use of Machine Learning in Hydrology , 2021, Frontiers in Water.

[7]  C. Sampson,et al.  Estimating River Channel Bathymetry in Large Scale Flood Inundation Models , 2021, Water Resources Research.

[8]  Chaopeng Shen,et al.  Transferring Hydrologic Data Across Continents – Leveraging Data‐Rich Regions to Improve Hydrologic Prediction in Data‐Sparse Regions , 2021, Water Resources Research.

[9]  D. Edmonds,et al.  A global database of flood-protection levees on river deltas (openDELvE) , 2021 .

[10]  L. Samaniego,et al.  Machine learning methods to assess the effects of a non-linear damage spectrum taking into account soil moisture on winter wheat yields in Germany , 2021 .

[11]  P. Bates,et al.  Design flood estimation for global river networks based on machine learning models , 2020 .

[12]  Jerom P. M. Aerts,et al.  Comparison of estimates of global flood models for flood hazard and exposed gross domestic product: a China case study , 2020, Natural Hazards and Earth System Sciences.

[13]  Dan-dan Wang,et al.  Assessing Chinese flood protection and its social divergence , 2020, Natural Hazards and Earth System Sciences.

[14]  Tzu-Tsung Wong,et al.  Reliable Accuracy Estimates from k-Fold Cross Validation , 2020, IEEE Transactions on Knowledge and Data Engineering.

[15]  P. Bates,et al.  The Impact of Dams on Design Floods in the Conterminous US , 2019, Water Resources Research.

[16]  C. Sampson,et al.  A New Automated Method for Improved Flood Defense Representation in Large‐Scale Hydraulic Models , 2019, Water Resources Research.

[17]  W. Botzen,et al.  Cost–benefit analysis of flood‐zoning policies: A review of current practice , 2019, WIREs Water.

[18]  Chong-yu Xu,et al.  A general framework of design flood estimation for cascade reservoirs in operation period , 2019, Journal of Hydrology.

[19]  Dai Yamazaki,et al.  MERIT Hydro: A High‐Resolution Global Hydrography Map Based on Latest Topography Dataset , 2019, Water Resources Research.

[20]  A. Langousis,et al.  A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources , 2019, Water.

[21]  Janet E. Heffernan,et al.  The Spatial Dependence of Flood Hazard and Risk in the United States , 2019, Water Resources Research.

[22]  Zhaoyin Wang,et al.  Two-thousand years of debates and practices of Yellow River training strategies , 2019, International Journal of Sediment Research.

[23]  Freek van Leijen,et al.  Applicability of satellite radar imaging to monitor the conditions of levees , 2018, Journal of Flood Risk Management.

[24]  D. Lettenmaier,et al.  If Precipitation Extremes Are Increasing, Why Aren't Floods? , 2018, Water Resources Research.

[25]  A. Viglione,et al.  Reservoir Effects on Flood Peak Discharge at the Catchment Scale , 2018, Water Resources Research.

[26]  Kwok-wing Chau,et al.  Flood Prediction Using Machine Learning Models: Literature Review , 2018, Water.

[27]  J. Medellín-Azuara,et al.  Full Flood Cost: Insights from a Risk Analysis Perspective , 2018, Journal of Environmental Engineering.

[28]  Klaudija Bašić,et al.  Design criteria for levees , 2018 .

[29]  N. Hanan,et al.  Global Hydrologic Soil Groups (HYSOGs250m) for Curve Number-Based Runoff Modeling , 2018 .

[30]  Matti Kummu,et al.  Gridded global datasets for Gross Domestic Product and Human Development Index over 1990–2015 , 2018, Scientific Data.

[31]  Stephen E. Fick,et al.  WorldClim 2: new 1‐km spatial resolution climate surfaces for global land areas , 2017 .

[32]  Kris A. Johnson,et al.  Validation of a 30 m resolution flood hazard model of the conterminous United States , 2017 .

[33]  Jeroen C. J. H. Aerts,et al.  A global framework for future costs and benefits of river-flood protection in urban areas , 2017, Nature Climate Change.

[34]  S. Kanae,et al.  A high‐accuracy map of global terrain elevations , 2017 .

[35]  J. Sanyal Uncertainty in levee heights and its effect on the spatial pattern of flood hazard in a floodplain , 2017 .

[36]  D. Milodowski,et al.  Geomorphometric delineation of floodplains and terraces from objectively defined topographic thresholds , 2017 .

[37]  Yang Hong,et al.  Development of an NRCS curve number global dataset using the latest geospatial remote sensing data for worldwide hydrologic applications , 2017 .

[38]  D. Apley,et al.  Visualizing the effects of predictor variables in black box supervised learning models , 2016, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[39]  Jeroen C. J. H. Aerts,et al.  FLOPROS: an evolving global database of flood protection standards , 2016 .

[40]  Paul D. Bates,et al.  A high‐resolution global flood hazard model† , 2015, Water resources research.

[41]  P. Bates,et al.  Usefulness and limitations of global flood risk models , 2015 .

[42]  Kytt MacManus,et al.  Taking Advantage of the Improved Availability of Census Data: A First Look at the Gridded Population of the World, Version 4 , 2015 .

[43]  W. Huang,et al.  Levee reliability analyses for various flood return periods – a case study in southern Taiwan , 2015 .

[44]  J. Aerts,et al.  Declining vulnerability to river floods and the global benefits of adaptation , 2015, Proceedings of the National Academy of Sciences.

[45]  Paul D. Bates,et al.  Regional flood frequency analysis at the global scale , 2015 .

[46]  Yunjae Choung,et al.  Mapping Levees Using LiDAR Data and Multispectral Orthoimages in the Nakdong River Basins, South Korea , 2014, Remote. Sens..

[47]  Virpi Junttila,et al.  Algorithm for Extracting Digital Terrain Models under Forest Canopy from Airborne LiDAR Data , 2014, Remote. Sens..

[48]  Yang Hong,et al.  CONUS-Wide Evaluation of National Weather Service Flash Flood Guidance Products , 2014 .

[49]  Bernhard Lehner,et al.  Global river hydrography and network routing: baseline data and new approaches to study the world's large river systems , 2013 .

[50]  M. Bruen,et al.  Bankfull discharge and recurrence intervals in Irish rivers floods , 2013 .

[51]  R. Kingsford,et al.  Semi‐automated GIS techniques for detecting floodplain earthworks , 2013 .

[52]  P. Bates,et al.  A subgrid channel model for simulating river hydraulics and floodplain inundation over large and data sparse areas , 2012 .

[53]  Chris Aldrich,et al.  Interpretation of nonlinear relationships between process variables by use of random forests , 2012 .

[54]  Isric,et al.  Harmonized World Soil Database - HWSD (version 1.2) , 2012 .

[55]  L. Feyen,et al.  Fluvial flood risk in Europe in present and future climates , 2012, Climatic Change.

[56]  N. Pinter,et al.  Hydraulic and flood-loss modeling of levee, floodplain, and river management strategies, Middle Mississippi River, USA , 2012, Natural Hazards.

[57]  Jessica Ludy,et al.  Flood risk perception in lands “protected” by 100-year levees , 2012, Natural Hazards.

[58]  Neil D. Burgess,et al.  Delimiting tropical mountain ecoregions for conservation , 2011, Environmental Conservation.

[59]  Giulia Sofia,et al.  An objective approach for feature extraction: distribution analysis and statistical descriptors for scale choice and channel network identification , 2011 .

[60]  Christopher O. Justice,et al.  Estimating Global Cropland Extent with Multi-year MODIS Data , 2010, Remote. Sens..

[61]  P. Bates,et al.  A simple inertial formulation of the shallow water equations for efficient two-dimensional flood inundation modelling. , 2010 .

[62]  Richard N. Palmer,et al.  World Environmental and Water Resources Congress 2010 , 2010 .

[63]  Doug Bellomo,et al.  FEMA Flood Map Modernization Program , 2010 .

[64]  P. Bates,et al.  A technique for the calibration of hydraulic models using uncertain satellite observations of flood extent , 2009 .

[65]  S. Hanson,et al.  Ranking Port Cities with High Exposure and Vulnerability to Climate Extremes: Exposure Estimates , 2008 .

[66]  Xudong Fu,et al.  Response of bankfull discharge to discharge and sediment load in the Lower Yellow River , 2008 .

[67]  Michael K. Sharp,et al.  New Orleans Levee System Performance during Hurricane Katrina: London Avenue and Orleans Canal South , 2008 .

[68]  Jay R. Lund,et al.  Climate change, urbanization, and optimal long‐term floodplain protection , 2007 .

[69]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[70]  W. Zhu,et al.  Making bootstrap statistical inferences: a tutorial. , 1997, Research quarterly for exercise and sport.

[71]  G. Williams Bank‐full discharge of rivers , 1978 .

[72]  G. N. Alexander Effect of catchment area on flood magnitude , 1972 .

[73]  Naoya Shibata,et al.  Socio-hydrological modeling and its issues in Japan: a case study in Naganuma District, Nagano City , 2022, Hydrological Research Letters.

[74]  David J. Wood,et al.  Identification of River Defences from Digital Terrain Models using Deep Learning , 2021, Science and practice for an uncertain future.

[75]  H. Lange,et al.  Machine Learning Applications in Hydrology , 2020 .

[76]  Jim W Hall,et al.  Adaptation thresholds and pathways for tidal flood risk management in London , 2019, Climate Risk Management.

[77]  C. Sampson,et al.  Progress Toward Hyperresolution Models of Global Flood Hazard , 2018 .

[78]  Roberto Rudari,et al.  IMPROVEMENT OF THE GLOBAL FLOOD MODEL FOR THE GAR 2015 , 2015 .

[79]  Jeffrey G. Arnold,et al.  Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations , 2007 .

[80]  R. Nicholls Analysis of global impacts of sea-level rise: a case study of flooding , 2002 .

[81]  R. Lewis An Introduction to Classification and Regression Tree (CART) Analysis , 2000 .