Towards a time and cost effective approach to water quality index class prediction

Abstract The development of water quality prediction models is an important step towards better water quality management of rivers. The traditional method for computing WQI is always associated with errors due to the protracted analysis of the water quality parameters in addition to the great effort and time involved in gathering and analyzing water samples. In addition, the cost of identifying the magnitude of some of the parameters through experimental testing is very high. The water quality of rivers in Malaysia is ranked into five classes based on water quality index (WQI). WQI is function of six water quality parameters: ammoniac nitrogen (NH3-N), biochemical oxygen demand (BOD), chemical oxygen demand (COD), dissolved oxygen (DO), pH, and suspended solids (SS). In this research, the decision tree machine learning technique is used to predict the WQI for the Klang River and its classification within a specific water quality class. Klang River is one of the most polluted rivers in Malaysia. Modeling experiments are designed to test the prediction and classification accuracy of the model based on various scenarios composed of different water quality parameters. Results show that the proposed prediction model has a promising potential to predict the class of the WQI. Moreover, the proposed model offers a more efficient process and cost-effective approach for the computation and prediction of WQI.

[1]  Vijay Kotu,et al.  Getting Started with RapidMiner , 2015 .

[2]  A. Z. Aris,et al.  Characterization of Water Quality Conditions in the Klang River Basin, Malaysia Using Self Organizing Map and K-means Algorithm☆ , 2015 .

[3]  Mohammad Azad,et al.  Multi-stage optimization of decision and inhibitory trees for decision tables with many-valued decisions , 2017, Eur. J. Oper. Res..

[4]  Archana Sarkar,et al.  River Water Quality Modelling Using Artificial Neural Network Technique , 2015 .

[5]  Peter L. M. Goethals,et al.  An applicability index for reliable and applicable decision trees in water quality modelling , 2016, Ecol. Informatics.

[6]  Zaher Mundher Yaseen,et al.  Application of artificial intelligence (AI) techniques in water quality index prediction: a case study in tropical region, Malaysia , 2017, Neural Computing and Applications.

[7]  S. D. Brown,et al.  Decision Tree Modeling , 2009 .

[8]  Gemma Manache,et al.  Identification of reliable regression- and correlation-based sensitivity measures for importance ranking of water-quality model parameters , 2008, Environ. Model. Softw..

[9]  Holger R. Maier,et al.  Neural networks for the prediction and forecasting of water resource variables: a review of modelling issues and applications , 2000, Environ. Model. Softw..

[10]  Sándor Molnár,et al.  Application of artificial neural networks to the forecasting of dissolved oxygen content in the Hungarian section of the river Danube , 2017 .

[11]  Ahmed El-Shafie,et al.  Water quality prediction model utilizing integrated wavelet-ANFIS model with cross-validation , 2010, Neural Computing and Applications.

[12]  Ahmed El-Shafie,et al.  Dynamic versus static neural network model for rainfall forecasting at Klang River Basin, Malaysia , 2011 .

[13]  Ahmed El-Shafie,et al.  Harmonize input selection for sediment transport prediction , 2017 .

[14]  Faridah Othman,et al.  Trend analysis of a tropical urban river water quality in Malaysia. , 2012, Journal of environmental monitoring : JEM.

[15]  F. Othman,et al.  Assessment of water quality parameters using multivariate analysis for Klang River basin, Malaysia , 2015, Environmental Monitoring and Assessment.

[16]  Werner Brack,et al.  Water quality indices across Europe--a comparison of the good ecological status of five river basins. , 2007, Journal of environmental monitoring : JEM.

[17]  H. Boyacıoğlu,et al.  Development of a water quality index based on a European classification scheme , 2009 .

[18]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[19]  Kwok-wing Chau,et al.  A review on integration of artificial intelligence into water quality modelling. , 2006, Marine pollution bulletin.

[20]  K. P. Singh,et al.  Support vector machines in water quality management. , 2011, Analytica chimica acta.