In this paper, a new classification method for production water is proposed, based on so real-time measured parameters. The classification method consists of three steps: 1) An initial classification of the Water Quality Index is computed using the method proposed by KUMAR; 2) Feature selection based on random forest (specifically based on the method varSelRF); and 3) Training of classifiers using different configurations of heuristic decision trees. A total of 4 datasets (5090 instances of 8 features each) representative of water samples from Portugal, Canada, Mexico, and Romania were used for method validation. The dataset was group in two families of different classes: binary (good and regular water) and multiclass (good, regular and bad water). Final classification accuracy reached 94.85% for the binary family and 91.73% for the multiclass family. The contribution consists of a continuous monitoring system to detect (in real time) dramatic changes in water quality and provide tools for historical studies behaviour in strategic points.
[1]
J. Camejo,et al.
Classifier for drinking water quality in real time
,
2013,
2013 International Conference on Computer Applications Technology (ICCAT).
[2]
R. Haught,et al.
Real-time contaminant detection and classification in a drinking water pipe using conventional water quality sensors: techniques and experimental results.
,
2009,
Journal of environmental management.
[3]
R D Harkins,et al.
An objective water quality index.
,
1974,
Journal - Water Pollution Control Federation.
[4]
W. Hoeffding,et al.
Rank Correlation Methods
,
1949
.
[5]
Babu J. Alappat,et al.
NSF-Water Quality Index: Does It Represent the Experts’ Opinion?
,
2009
.
[6]
M. Kendall.
Rank Correlation Methods
,
1949
.
[7]
Edward A. McBean,et al.
Real-Time Water Quality Monitoring: Assessment of Multisensor Data Using Bayesian Belief Networks
,
2012
.