Detection of areas prone to flood risk using state-of-the-art machine learning models

Abstract The present study aims to evaluate the susceptibility to floods in the river basin of Buzau in Romania through the following 6 machine learning models: Support Vector Machine (SVM), J48 decision tree, Adaptive Neuro-Fuzzy Inference System (ANFIS), Random Forest (RF), Artificial Neural Network (ANN) and Alternating Decision Tree (ADT). In the first stage of the study, an inventory of the areas affected by floods was made in the study area, and a number of 205 flood points were identified. Further, 12 flood predictors were selected to be used for final susceptibility mapping. The six models' training was performed by using 70% of the total flood points that have been associated with the values of flood predictors. The highest accuracy (0.973) was obtained by the RF model, while J48 had the lowest performance (0.825). Besides, by classifying flood predictors' values in flood and non-flood pixels, the six flood susceptibility maps were made. High and very high flood susceptibility values cover between 17.71% (MLP) and 27.93% (ANFIS) of the study area. The validation of the results, performed using the ROC Curve, shows that the most accurate flood susceptibility values are also assigned to the RF model (AUC = 0.996).

[1]  B. Samali,et al.  Influence of seismic incident angle on response uncertainty and structural performance of tall asymmetric structure , 2020, The Structural Design of Tall and Special Buildings.

[2]  D. Peptenatu,et al.  Flood Hazard Mapping Using the Flood and Flash-Flood Potential Index in the Buzău River Catchment, Romania , 2019, Water.

[3]  Biswajeet Pradhan,et al.  Modeling flood susceptibility using data-driven approaches of naïve Bayes tree, alternating decision tree, and random forest methods. , 2019, The Science of the total environment.

[4]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[5]  Lalit Kumar,et al.  A novel GIS-based ensemble technique for flood susceptibility mapping using evidential belief function and support vector machine: Brisbane, Australia , 2019, PeerJ.

[6]  Ximing Cai,et al.  Communicating the Impacts of Projected Climate Change on Heavy Rainfall Using a Weighted Ensemble Approach , 2018 .

[7]  B. Pradhan,et al.  Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models , 2012 .

[8]  Romulus Costache,et al.  Flood Susceptibility Assessment by Using Bivariate Statistics and Machine Learning Models - A Useful Tool for Flood Risk Management , 2019, Water Resources Management.

[9]  Daniel Peptenatu,et al.  Deforestation and Frequency of Floods in Romania , 2019, Water Resources Management in Romania.

[10]  Asli Celikyilmaz,et al.  Modeling Uncertainty with Improved Fuzzy Functions , 2009 .

[11]  Wenlong Shi,et al.  Experimental Investigation and Error Analysis of High Precision FBG Displacement Sensor for Structural Health Monitoring , 2020 .

[12]  Amir Mosavi,et al.  Integrated machine learning methods with resampling algorithms for flood susceptibility prediction. , 2019, The Science of the total environment.

[13]  P. Pellikka,et al.  Patch aggregation trends of the global climate landscape under future global warming scenario , 2020, International Journal of Climatology.

[14]  X. Bui,et al.  Predicting blast-induced peak particle velocity using BGAMs, ANN and SVM: a case study at the Nui Beo open-pit coal mine in Vietnam , 2019, Environmental Earth Sciences.

[15]  Hao Wang,et al.  Robustness of the Active Rotary Inertia Driver System for Structural Swing Vibration Control Subjected to Multi-Type Hazard Excitations , 2019, Applied Sciences.

[16]  B. Pham,et al.  Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods , 2017, Theoretical and Applied Climatology.

[17]  Jinchang Ren,et al.  ANN vs. SVM: Which one performs better in classification of MCCs in mammogram imaging , 2012, Knowl. Based Syst..

[18]  Arturo S. Leon,et al.  Controlling HEC-RAS using MATLAB , 2016, Environ. Model. Softw..

[19]  Yizhong Chen,et al.  Coupling system dynamics analysis and risk aversion programming for optimizing the mixed noise-driven shale gas-water supply chains , 2021 .

[20]  Quoc Bao Pham,et al.  Novel hybrid models between bivariate statistics, artificial neural networks and boosting algorithms for flood susceptibility assessment. , 2020, Journal of environmental management.

[21]  A. Kalra,et al.  Coupling HEC-RAS and HEC-HMS in Precipitation Runoff Modelling and Evaluating Flood Plain Inundation Map , 2017 .

[22]  Dieu Tien Bui,et al.  Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study , 2017, Geotechnical and Geological Engineering.

[23]  Zhuo Chen,et al.  Unified No-Reference Quality Assessment of Singly and Multiply Distorted Stereoscopic Images , 2019, IEEE Transactions on Image Processing.

[24]  Yang Yang,et al.  Omnidirectional Motion Classification With Monostatic Radar System Using Micro-Doppler Signatures , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[25]  Quan Pan,et al.  A survey on multi-sensor fusion based obstacle detection for intelligent ground vehicles in off-road environments , 2020, Frontiers of Information Technology & Electronic Engineering.

[26]  Chunhui Zhao,et al.  Convergent Multiagent Formation Control With Collision Avoidance , 2020, IEEE Transactions on Robotics.

[27]  Li He,et al.  Life cycle assessment of greenhouse gas emissions and water-energy optimization for shale gas supply chain planning based on multi-level approach: Case study in Barnett, Marcellus, Fayetteville, and Haynesville shales , 2017 .

[28]  Li He,et al.  Evaluating the global potential of aquifer thermal energy storage and determining the potential worldwide hotspots driven by socio-economic, geo-hydrologic and climatic conditions , 2019, Renewable and Sustainable Energy Reviews.

[29]  Gabriel Minea,et al.  Assessment of the flash flood potential of Bâsca River Catchment (Romania) based on physiographic factors , 2013 .

[30]  Zhijia Li,et al.  Ground observation-based analysis of soil moisture spatiotemporal variability across a humid to semi-humid transitional zone in China , 2019, Journal of Hydrology.

[31]  Jun Liu,et al.  Efficient Deployment With Geometric Analysis for mmWave UAV Communications , 2020, IEEE Wireless Communications Letters.

[32]  Ronghua Liu,et al.  Investigation of inducements and defenses of flash floods and urban waterlogging in Fuzhou, China, from 1950 to 2010 , 2018, Natural Hazards.

[33]  Thomas Blaschke,et al.  Multi-Hazard Exposure Mapping Using Machine Learning for the State of Salzburg, Austria , 2020, Remote. Sens..

[34]  Chenchen Wang,et al.  New pore space characterization method of shale matrix formation by considering organic and inorganic pores , 2015 .

[35]  Hongwei Lu,et al.  Large decrease in streamflow and sediment load of Qinghai–Tibetan Plateau driven by future climate change: A case study in Lhasa River Basin , 2020 .

[36]  S. Saidi,et al.  Spatiotemporal floodplain mapping and prediction using HEC-RAS - GIS tools: Case of the Mejerda river, Tunisia , 2018, Journal of African Earth Sciences.

[37]  Thomas Blaschke,et al.  Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection , 2019, Remote. Sens..

[38]  Thomas Blaschke,et al.  Flood susceptibility mapping with machine learning, multi-criteria decision analysis and ensemble using Dempster Shafer Theory , 2020 .

[39]  L. Tian,et al.  Transport of intensity phase retrieval and computational imaging for partially coherent fields: The phase space perspective , 2015 .

[40]  Romulus Costache,et al.  Novel Ensembles of Deep Learning Neural Network and Statistical Learning for Flash-Flood Susceptibility Mapping , 2020, Water.

[41]  R. Schwarze,et al.  Exploring the uptake of nature-based measures in flood risk management: Evidence from German federal states , 2020 .

[42]  Hélène Roux,et al.  Accounting for rainfall systematic spatial variability in flash flood forecasting , 2016 .

[43]  Jiake Li,et al.  Simulation of the hydrological and environmental effects of a sponge city based on MIKE FLOOD , 2018, Environmental Earth Sciences.

[44]  Romulus Costache,et al.  Spatial prediction of flood potential using new ensembles of bivariate statistics and artificial intelligence: A case study at the Putna river catchment of Romania. , 2019, The Science of the total environment.

[45]  Shouming Zhong,et al.  Secondary delay‐partition approach on robust performance analysis for uncertain time‐varying Lurie nonlinear control system , 2017 .

[46]  Thomas Blaschke,et al.  A new GIS-based technique using an adaptive neuro-fuzzy inference system for land subsidence susceptibility mapping , 2018, Journal of Spatial Science.

[47]  Gangyi Jiang,et al.  Optimizing Multistage Discriminative Dictionaries for Blind Image Quality Assessment , 2018, IEEE Transactions on Multimedia.

[48]  K. Heme,et al.  AN EXPERIMENTAL AND NUMERICAL INVESTIGATION , 1983 .

[49]  Shamsuddin Shahid,et al.  Climate variability and changes in the major cities of Bangladesh: observations, possible impacts and adaptation , 2016, Regional Environmental Change.

[50]  B. Pradhan,et al.  Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms , 2013, Arabian Journal of Geosciences.

[51]  Romulus Costache,et al.  Flash-Flood Potential assessment in the upper and middle sector of Prahova river catchment (Romania). A comparative approach between four hybrid models. , 2019, The Science of the total environment.

[52]  Leonardo Mancusi,et al.  Improving flood risk analysis for effectively supporting the implementation of flood risk management plans: The case study of “Serio” Valley , 2017 .

[53]  Wei Gao,et al.  The feasibility of genetic programming and ANFIS in prediction energetic performance of a building integrated photovoltaic thermal (BIPVT) system , 2019 .

[54]  Ruhollah Taghizadeh-Mehrjardi,et al.  Comparison of artificial neural network and decision tree models in estimating spatial distribution of snow depth in a semi-arid region of Iran , 2016 .

[55]  Yanli Wu,et al.  Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping , 2020 .

[56]  M. Borga,et al.  Flash flood warning based on rainfall thresholds and soil moisture conditions: An assessment for gauged and ungauged basins , 2008 .

[57]  Zhenhao Zhang,et al.  Application of probabilistic method in maximum tsunami height prediction considering stochastic seabed topography , 2020, Natural Hazards.

[58]  Hui Zhao,et al.  History Matching of Naturally Fractured Reservoirs Using a Deep Sparse Autoencoder , 2021 .

[59]  H. Abida,et al.  Monte Carlo simulation-aided analytical hierarchy process (AHP) for flood susceptibility mapping in Gabes Basin (southeastern Tunisia) , 2017, Environmental Earth Sciences.

[60]  Lanh Si Ho,et al.  A Comparative Study of Kernel Logistic Regression, Radial Basis Function Classifier, Multinomial Naïve Bayes, and Logistic Model Tree for Flash Flood Susceptibility Mapping , 2020, Water.

[61]  O. Ghorbanzadeh,et al.  Integration of interval rough AHP and fuzzy logic for assessment of flood prone areas at the regional scale , 2020, Acta Geophysica.

[62]  Yi Liu,et al.  A Survey on Blocking Technology of Entity Resolution , 2020, Journal of Computer Science and Technology.

[63]  Ke Zhang,et al.  Geographically weighted regression based methods for merging satellite and gauge precipitation , 2018 .

[64]  I. Yilmaz Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine , 2010 .

[65]  W. Botzen,et al.  The safe development paradox: An agent-based model for flood risk under climate change in the European Union , 2020, Global Environmental Change.

[66]  Wang Jian,et al.  Data-Driven Niching Differential Evolution with Adaptive Parameters Control for History Matching and Uncertainty Quantification , 2021 .

[67]  Yimiao Huang,et al.  Large group activity security risk assessment and risk early warning based on random forest algorithm , 2021, Pattern Recognit. Lett..

[68]  Quoc Bao Pham,et al.  Spatial predicting of flood potential areas using novel hybridizations of fuzzy decision-making, bivariate statistics, and machine learning , 2020 .

[69]  Mohammad Ali Ghorbani,et al.  GIS-based comparative assessment of flood susceptibility mapping using hybrid multi-criteria decision-making approach, naïve Bayes tree, bivariate statistics and logistic regression: A case of Topľa basin, Slovakia , 2020 .

[70]  Wei Liu,et al.  Flood susceptibility assessment based on a novel random Naïve Bayes method: A comparison between different factor discretization methods , 2020 .

[71]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[72]  Hyung-Sup Jung,et al.  Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea , 2017 .

[73]  H. Hong,et al.  Exploring effectiveness of frequency ratio and support vector machine models in storm surge flood susceptibility assessment: A study of Sundarban Biosphere Reserve, India , 2020 .

[74]  B. Pradhan,et al.  Integrating multilayer perceptron neural nets with hybrid ensemble classifiers for deforestation probability assessment in Eastern India , 2020, Geomatics, Natural Hazards and Risk.

[75]  N. Trang,et al.  Prediction of economic loss of rice production due to flood inundation under climate change impacts using a modeling approach: A case study in Ha Tinh Province, Vietnam , 2019 .

[76]  Jan Adamowski,et al.  Incorporating multi-criteria decision-making and fuzzy-value functions for flood susceptibility assessment , 2019, Geocarto International.

[77]  I. Burhan Türksen,et al.  Modeling Uncertainty with Fuzzy Logic - With Recent Theory and Applications , 2009, Studies in Fuzziness and Soft Computing.

[78]  B. Samali,et al.  Experimental and numerical investigation on the complex behaviour of the localised seismic response in a multi-storey plan-asymmetric structure , 2020 .

[79]  C. Conoscenti,et al.  A comparison of statistical methods and multi-criteria decision making to map flood hazard susceptibility in Northern Iran. , 2019, The Science of the total environment.

[80]  Mehdi Raftari,et al.  Optimization of ANFIS with GA and PSO estimating α ratio in driven piles , 2019, Engineering with Computers.

[81]  Hongwei Lu,et al.  Analysis of microplastics in a remote region of the Tibetan Plateau: Implications for natural environmental response to human activities. , 2020, The Science of the total environment.

[82]  Romulus Costache,et al.  Flash-flood Potential Index mapping using weights of evidence, decision Trees models and their novel hybrid integration , 2019, Stochastic Environmental Research and Risk Assessment.

[83]  Dieu Tien Bui,et al.  An Integration of Least Squares Support Vector Machines and Firefly Optimization Algorithm for Flood Susceptible Modeling Using GIS , 2017 .

[84]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[85]  Bahareh Kalantar,et al.  Assessment of the effects of training data selection on the landslide susceptibility mapping: a comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN) , 2018 .

[86]  K. Gnana Sheela,et al.  Neural network based hybrid computing model for wind speed prediction , 2013, Neurocomputing.

[87]  G. Bürger,et al.  Climate change impact on regional floods in the Carpathian region , 2019, Journal of Hydrology: Regional Studies.

[88]  T. Kavzoglu,et al.  A comparative assessment of canonical correlation forest, random forest, rotation forest and logistic regression methods for landslide susceptibility mapping , 2018, Geocarto International.

[89]  B. Pradhan,et al.  Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines , 2015 .

[90]  Wei Chen,et al.  Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling. , 2018, The Science of the total environment.

[91]  K. Tan,et al.  Affine Transformation-Enhanced Multifactorial Optimization for Heterogeneous Problems , 2020, IEEE Transactions on Cybernetics.

[92]  Prima Riza Kadavi,et al.  Evaluation of landslide susceptibility mapping by evidential belief function, logistic regression and support vector machine models , 2018 .

[93]  H. A. Nefeslioglu,et al.  Evaluation of Floods and Landslides Triggered by a Meteorological Catastrophe (Ordu, Turkey, August 2018) Using Optical and Radar Data , 2020 .

[94]  Nadhir Al-Ansari,et al.  GIS Based Hybrid Computational Approaches for Flash Flood Susceptibility Assessment , 2020, Water.

[95]  Hossein Moayedi,et al.  The Feasibility of Three Prediction Techniques of the Artificial Neural Network, Adaptive Neuro-Fuzzy Inference System, and Hybrid Particle Swarm Optimization for Assessing the Safety Factor of Cohesive Slopes , 2019, ISPRS Int. J. Geo Inf..

[96]  Christos Chalkias,et al.  Adaptive neuro-fuzzy inference system (ANFIS) modeling for landslide susceptibility assessment in a Mediterranean hilly area , 2019, Bulletin of Engineering Geology and the Environment.

[97]  Ana Deletic,et al.  Assessment of Urban Pluvial Flood Risk and Efficiency of Adaptation Options Through Simulations – A New Generation of Urban Planning Tools , 2017 .