An Approach to Clustering Using the Expectation-Maximization and Selection of Attributes ReliefF Applied to Water Treatment Plants process

The water treatment process contains several physico-chemical parameters relevant to decision making and the water quality scenarios’ identification. Some scenarios are evident and can be observed without the application of mathematical or statistical techniques, however some of these scenarios are difficult to distinguish, and it is necessary to use computational intelligence techniques for solution. In this context, the paper aims to show the application of the expectation-maximization (EM) algorithm for data clusters of the coagulation process and the ReliefF algorithm to determine the importance of the physico-chemical parameters, using the WEKA tool to analyze historical dataset of a water treatment plant. The results were favorable to the scenarios’ identification and to determine the relevance of the parameters related to the process.

[1]  A Methodology for Clustering Lakes in Alberta on the basis of Water Quality Parameters , 2011 .

[2]  Soo-Woong Park,et al.  Decision model for coagulant dosage using genetic programming and multivariate statistical analysis for coagulation/flocculation at water treatment process , 2008 .

[3]  S. Heddam,et al.  ANFIS-based modelling for coagulant dosage in drinking water treatment plant: a case study , 2012, Environmental Monitoring and Assessment.

[4]  Andy Baker,et al.  New data mining and calibration approaches to the assessment of water treatment efficiency , 2012, Adv. Eng. Softw..

[5]  Richard Olawoyin,et al.  Application of artificial neural network (ANN)-self-organizing map (SOM) for the categorization of water, soil and sediment quality in petrochemical regions , 2013, Expert Syst. Appl..

[6]  Mika Liukkonen,et al.  Cluster analysis by self-organizing maps: An application to the modelling of water quality in a treatment process , 2013, Appl. Soft Comput..

[7]  S. J. Stanley,et al.  Development of a full-scale artificial neural network model for the removal of natural organic matter by enhanced coagulation , 1999 .

[8]  Sheikh Abdullah Siti Rozaimah,et al.  Pattern recognition of fractal profiles in coagulation-flocculation process of wastewater via neural network , 2007 .

[9]  R. Fenner,et al.  Weighted Least Squares with Expectation-Maximization Algorithm for Burst Detection in U.K. Water Distribution Systems , 2014 .

[10]  Abiodun Musa Aibinu,et al.  An Intelligent Modeling of Coagulant Dosing System for Water Treatment Plants based on Artificial Neural Network , 2012 .

[11]  A. M. Kalteh,et al.  Review of the self-organizing map (SOM) approach in water resources: Analysis, modelling and application , 2008, Environ. Model. Softw..

[12]  Louis Wehenkel,et al.  Data validation and missing data reconstruction using self-organizing map for water treatment , 2011, Neural Computing and Applications.

[13]  Zoran Kapelan,et al.  Evolutionary Algorithm and Expectation Maximization Strategies for Improved Detection of Pipe Bursts and Other Events in Water Distribution Systems , 2014 .

[14]  Fn Ogwueleka,et al.  OPTIMIZATION OF DRINKING WATER TREATMENT PROCESSES USING ARTIFICIAL NEURAL NETWORK , 2009 .

[15]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[16]  Andrew Blake,et al.  Using expectation-maximisation to learn dynamical models from visual data , 1999, Image Vis. Comput..

[17]  G. McLachlan,et al.  The EM Algorithm and Extensions: Second Edition , 2008 .

[18]  Sou-Sen Leu,et al.  Leak Prediction Model for Water Distribution Networks Created Using a Bayesian Network Learning Approach , 2016, Water Resources Management.