Smart Data Analysis for Water Quality in Catchment Area Monitoring

A water supply system that integrates catchment area monitoring, treatment and distribution are essential infrastructure both in urban and rural places all around the world. The water quality is becoming a key factor to evaluate life quality for most of the people. However, water quality is facing more and more challenges from industrial, agriculture and social pollution. Traditional water quality control research mostly focused on separate aspects, such as different types of physical, chemical or biological indicators. But their work lacks forward-looking for water quality estimation. This can bring huge influence for practical system on people's health. In this paper, we build a smart data analysis method to analyze and estimate the water quality, considering all the water quality standard indicators in a comprehensive environment. Considering as the water source, catchment area monitoring is the origins of water supply system, we select this area as our first stage for the data analysis process. We take an application from the water supply system in Oslo, Norway. Our method also provides Zero-Inflated Poisson Regression (ZIP) and Zero-inflated Negative Binomial Regression (ZINB) models to estimate biological water quality indicators based on the real historical data. For the results, we compare different influence factors, and show that our models can accurately estimate the biological indicators evolution process, especially for the trend. In addition, we also share interesting findings from the data analysis process, as well as providing future control decision support in water supply system.

[1]  Hossein Orouji,et al.  Modeling of Water Quality Parameters Using Data-Driven Models , 2013 .

[2]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[3]  Omid Bozorg-Haddad,et al.  Modeling Water-Quality Parameters Using Genetic Algorithm–Least Squares Support Vector Regression and Genetic Programming , 2017 .

[4]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[5]  E. Fallah-Mehdipour,et al.  Integration of Shuffled Frog Leaping Algorithm and Support Vector Regression for Prediction of Water Quality Parameters , 2016, Water Resources Management.

[6]  Wen Hu,et al.  Design and Deployment of a Remote Robust Sensor Network: Experiences from an Outdoor Water Quality Monitoring Network , 2007 .

[7]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[8]  Dermot Diamond,et al.  SmartCoast: A Wireless Sensor Network for Water Quality Monitoring , 2007, 32nd IEEE Conference on Local Computer Networks (LCN 2007).

[9]  Dimitri P. Bertsekas,et al.  Approximate Dynamic Programming , 2017, Encyclopedia of Machine Learning and Data Mining.

[10]  Fi-John Chang,et al.  Modeling water quality in an urban river using hydrological factors--data driven approaches. , 2015, Journal of environmental management.

[11]  Ibrahim A. Hameed,et al.  Comparison of Adaptive Neuro-Fuzzy Inference System (ANFIS) and Gaussian Process for Machine Learning (GPML) Algorithms for the Prediction of Norovirus Concentration in Drinking Water Supply , 2017, Trans. Large Scale Data Knowl. Centered Syst..

[12]  Ibrahim A. Hameed,et al.  Adaptive neuro-fuzzy inference system for predicting norovirus in drinking water supply , 2017, 2017 International Conference on Informatics, Health & Technology (ICIHT).

[13]  R. Mihăiescu,et al.  European Union Water framework directive. , 2009 .

[14]  R. Axler,et al.  Interactive Technologies for Collecting and Visualizing Water Quality Data , 2022 .

[15]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[16]  Matthieu Komorowski,et al.  Exploratory Data Analysis , 2016 .

[17]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[18]  René Heideman,et al.  A miniature porous aluminum oxide-based flow-cell for online water quality monitoring using bacterial sensor cells. , 2015, Biosensors & bioelectronics.