Tree-based machine learning algorithms in the Internet of Things environment for multivariate flood status prediction

Abstract Floods are one of the most common natural disasters in the world that affect all aspects of life, including human beings, agriculture, industry, and education. Research for developing models of flood predictions has been ongoing for the past few years. These models are proposed and built-in proportion for risk reduction, policy proposition, loss of human lives, and property damages associated with floods. However, flood status prediction is a complex process and demands extensive analyses on the factors leading to the occurrence of flooding. Consequently, this research proposes an Internet of Things-based flood status prediction (IoT-FSP) model that is used to facilitate the prediction of the rivers flood situation. The IoT-FSP model applies the Internet of Things architecture to facilitate the flood data acquisition process and three machine learning (ML) algorithms, which are Decision Tree (DT), Decision Jungle, and Random Forest, for the flood prediction process. The IoT-FSP model is implemented in MATLAB and Simulink as development platforms. The results show that the IoT-FSP model successfully performs the data acquisition and prediction tasks and achieves an average accuracy of 85.72% for the three-fold cross-validation results. The research finding shows that the DT scores the highest accuracy of 93.22%, precision of 92.85, and recall of 92.81 among the three ML algorithms. The ability of the ML algorithm to handle multivariate outputs of 13 different flood textual statuses provides the means of manifesting explainable artificial intelligence and enables the IoT-FSP model to act as an early warning and flood monitoring system.

[1]  A. Zhu,et al.  Application of fuzzy weight of evidence and data mining techniques in construction of flood susceptibility map of Poyang County, China. , 2018, The Science of the total environment.

[2]  Mohamed Elhoseny,et al.  An Adaptive Protection of Flooding Attacks Model for Complex Network Environments , 2021, Secur. Commun. Networks.

[3]  Mohamed Elhoseny,et al.  A New Multi-Agent Feature Wrapper Machine Learning Approach for Heart Disease Diagnosis , 2021, Computers, Materials & Continua.

[4]  Kwok-wing Chau,et al.  Flood Prediction Using Machine Learning Models: Literature Review , 2018, Water.

[5]  Bashar Ahmed Khalaf,et al.  A Comparison of Three Machine Learning Algorithms in the Classification of Network Intrusion , 2020, ACeS.

[6]  Wimol San-Um,et al.  A Conceptual Framework for the Design of an Urban Flood Early-Warning System Using a Context-Awareness Approach in Internet-of-Things Platform , 2016 .

[7]  Paul Muñoz,et al.  Flash-Flood Forecasting in an Andean Mountain Catchment—Development of a Step-Wise Methodology Based on the Random Forest Algorithm , 2018, Water.

[8]  H. Pourghasemi,et al.  A GIS-based flood susceptibility assessment and its mapping in Iran: a comparison between frequency ratio and weights-of-evidence bivariate statistical models with multi-criteria decision-making technique , 2016, Natural Hazards.

[9]  Mustafa Neamah Jebur,et al.  Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS , 2013 .

[10]  Biswajeet Pradhan,et al.  Modeling flood susceptibility using data-driven approaches of naïve Bayes tree, alternating decision tree, and random forest methods. , 2019, The Science of the total environment.

[11]  Purvi Prajapati,et al.  Study and Analysis of Decision Tree Based Classification Algorithms , 2018, International Journal of Computer Sciences and Engineering.

[12]  M. A. Hoque,et al.  An efficient heuristic to obtain a better initial feasible solution to the transportation problem , 2015, Appl. Soft Comput..

[13]  Hamid Reza Pourghasemi,et al.  Flood Spatial Modeling in Northern Iran Using Remote Sensing and GIS: A Comparison between Evidential Belief Functions and Its Ensemble with a Multivariate Logistic Regression Model , 2019, Remote. Sens..

[14]  Yong Qin,et al.  Ionic composition, geological signature and environmental impacts of coalbed methane produced water in China , 2021, Energy Sources, Part A: Recovery, Utilization, and Environmental Effects.

[15]  Junaida Sulaiman,et al.  Heavy Rainfall Forecasting Model Using Artificial Neural Network for Flood Prone Area , 2018 .

[16]  J. Adamowski,et al.  An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. , 2019, The Science of the total environment.

[17]  Dieu Tien Bui,et al.  A novel hybrid artificial intelligence approach for flood susceptibility assessment , 2017, Environ. Model. Softw..

[18]  Aida Mustapha,et al.  Experimenting Two Machine Learning Methods in Classifying River Water Quality , 2019, ACRIT.

[19]  Hyung-Sup Jung,et al.  Data Mining Approaches for Landslide Susceptibility Mapping in Umyeonsan, Seoul, South Korea , 2017 .

[20]  Wei Liu,et al.  Flood susceptibility assessment based on a novel random Naïve Bayes method: A comparison between different factor discretization methods , 2020 .

[21]  A-Xing Zhu,et al.  Flood susceptibility assessment in Hengfeng area coupling adaptive neuro-fuzzy inference system with genetic algorithm and differential evolution. , 2018, The Science of the total environment.

[22]  Asmidar Abu Bakar,et al.  Internet of Things (IoT) Architecture for Flood Data Management , 2018 .

[23]  Aida Mustapha,et al.  A fuzzy logic control in adjustable autonomy of a multi-agent system for an automated elderly movement monitoring application , 2018, Int. J. Medical Informatics.

[24]  Aida Mustapha,et al.  Modelling an Adjustable Autonomous Multi-agent Internet of Things System for Elderly Smart Home , 2019, AHFE.

[25]  Aida Mustapha,et al.  Evaluating the Performance of Three Classification Methods in Diagnosis of Parkinson's Disease , 2018, SCDM.

[26]  Lida Xu,et al.  An integrated information system for snowmelt flood early-warning based on internet of things , 2013, Information Systems Frontiers.

[27]  Aida Mustapha,et al.  Data Mining Approach to Herbs Classification , 2018, Indonesian Journal of Electrical Engineering and Computer Science.

[28]  Norlida Hassan,et al.  Comparative Performance of Supervised Learning Algorithms for Flood Prediction in Kemaman, Terengganu , 2021, Journal of Computer Science.

[29]  Osanaiye S. Babatunde A Smart Network Intrusion Detection System based on Network Data Analyzer and Support Vector Machine , 2020, International Journal of Emerging Trends in Engineering Research.

[30]  Zongxue Xu,et al.  Mapping flood susceptibility in mountainous areas on a national scale in China. , 2018, The Science of the total environment.

[31]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[32]  B. Pradhan,et al.  GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks , 2016, Environmental Earth Sciences.

[33]  Vijender Kumar Solanki,et al.  A Framework for IoT Sensor Data Acquisition and Analysis , 2018, EAI Endorsed Transactions on Internet of Things.