Predictive modelling is a process used in predictive analytics to create a statistical model of future behaviour. Predictive analytics is the area of data mining concerned with forecasting probabilities and trends. On the other hand, Artificial Intelligence (AI) concerns itself with intelligent behaviour, i.e. the things that make us seem intelligent. Following this process of thinking, in this work the main goal is the assessment of the impact of using AI based tools for the development of intelligent predictive models, in particular those that may be used to establish the conditions in which the levels of manganese and turbidity in water supply are high. Indeed, one of the main problems that the water treatment plant at Monte Novo (in Evora, Portugal) uncovers is the appearance of high levels of manganese and turbidity in treated water, which sometimes exceed the parametric values established in Portuguese Law, respectively 50 μg dm -3 and 4 NTU. In this study we tried to find answers to the above problem by building predictive models. The models we developed shall enable the prediction of manganese and turbidity levels in treated water, in order to ensure that the water supply does not affect public health in a negative way and obeys the current legislation. The software used in this study was the Clementine 11.1. The C5.0 Algorithm was also used as a means of introducing Decision Trees and the KMeans Algorithm was used to construct clustering models. The data in the database was collected from 2005 to 2006 and includes reservoir water quality data, treated water data and volumes of water stored in the reservoir.
[1]
Efraim Turban,et al.
Decision support systems and intelligent systems
,
1997
.
[2]
Stan Matwin,et al.
Evaluating Data Mining Models: A Pattern Language
,
2002
.
[3]
Titrade Cristina-Maria.
Data Mining Technologies
,
2010
.
[4]
Bhavani Thuraisingham,et al.
Data Mining: Technologies, Techniques, Tools, and Trends
,
1998
.
[5]
Paul S. Bradley,et al.
Refining Initial Points for K-Means Clustering
,
1998,
ICML.
[6]
Dorian Pyle,et al.
Data Preparation for Data Mining
,
1999
.
[7]
Kwok-wing Chau,et al.
A review on integration of artificial intelligence into water quality modelling.
,
2006,
Marine pollution bulletin.
[8]
S. Džeroski.
Sciences: environmental sciences
,
2002
.
[9]
A. E. Greenberg,et al.
Standard Methods for the Examination of Water and Wastewater seventh edition
,
2013
.
[10]
Jan-Tai Kuo,et al.
USING ARTIFICIAL NEURAL NETWORK FOR RESERVOIR EUTROPHICATION PREDICTION
,
2007
.
[11]
Awwa,et al.
Standard Methods for the examination of water and wastewater
,
1999
.
[12]
Paulo Cortez,et al.
Ecological mining: a case study on dam water quality
,
2005
.
[13]
J. Ross Quinlan,et al.
Bagging, Boosting, and C4.5
,
1996,
AAAI/IAAI, Vol. 1.
[14]
Jiawei Han,et al.
Data Mining: Concepts and Techniques
,
2000
.
[15]
R. Suganya,et al.
Data Mining Concepts and Techniques
,
2010
.
[16]
A. Bachelor.
GLOSSARY OF TERMS GLOSSARY OF TERMS
,
2010
.