Modelling geosmin concentrations in three sources of raw water in Quebec, Canada

The presence of off-flavour compounds such as geosmin, often found in raw water, significantly reduces the organoleptic quality of distributed water and diverts the consumer from its use. To adapt water treatment processes to eliminate these compounds, it is necessary to be able to identify them quickly. Routine analysis could be considered a solution, but it is expensive and delays associated with obtaining the results of analysis are often important, thereby constituting a serious disadvantage. The development of decision-making tools such as predictive models seems to be an economic and feasible solution to counterbalance the limitations of analytical methods. Among these tools, multi-linear regression and principal component regression are easy to implement. However, due to certain disadvantages inherent in these methods (multicollinearity or non-linearity of the processes), the use of emergent models involving artificial neurons networks such as multi-layer perceptron could prove to be an interesting alternative. In a previous paper (Parinet et al., Water Res 44: 5847-5856, 2010), the possible parameters that affect the variability of taste and odour compounds were investigated using principal component analysis. In the present study, we expand the research by comparing the performance of three tools using different modelling scenarios (multi-linear regression, principal component regression and multi-layer perceptron) to model geosmin in drinking water sources using 38 microbiological and physicochemical parameters. Three very different sources of water, in terms of quality, were selected for the study. These sources supply drinking water to the Québec City area (Canada) and its vicinity, and were monitored three times per month over a 1-year period. Seven different modelling methods were tested for predicting geosmin in these sources. The comparison of the seven different models showed that simple models based on multi-linear regression provide sufficient predictive capacity with performance levels comparable to those obtained with artificial neural networks. The multi-linear regression model (R2 = 0.657, <0.001) used only four variables (phaeophytin, sum of green algae, chlorophyll-a and potential Redox) in comparison with ten variables (potassium, heterotrophic bacteria, organic nitrogen, total nitrogen, phaeophytin, total organic carbon, sum of green algae, potential Redox, UV absorbance at 254 nm and atypical bacteria) for the best model obtained with artificial neural networks (R2 = 0.843).

[1]  S. Watson,et al.  Actinomycetes in relation to taste and odour in drinking water: myths, tenets and truths. , 2006, Water research.

[2]  B. Rosen,et al.  Accumulation and Release of Geosmin during the Growth Phases of Anabaena circinalis (Kutz.) Rabenhorst , 1992 .

[3]  Manuel J. Rodríguez,et al.  Perception of drinking water in the Quebec City region (Canada): the influence of water quality and consumer location in the distribution system. , 2004, Journal of environmental management.

[4]  W. T. Blevins,et al.  Effects of carbon source, phosphorus concentration, and several micronutrients on biomass and geosmin production by Streptomyces halstedii , 2001, Journal of Industrial Microbiology and Biotechnology.

[5]  Tormod Næs,et al.  A unified description of classical classification methods for multicollinear data , 1998 .

[6]  Riyaz Shariff,et al.  Implementing artificial neural network models for real-time water colour forecasting in a water treatment plant , 2004 .

[7]  Bernard Legube,et al.  Principal component analysis: an appropriate tool for water quality evaluation and management—application to a tropical lake system , 2004 .

[8]  Manuel J. Rodríguez,et al.  Assessing empirical linear and non-linear modelling of residual chlorine in urban drinking water systems , 1998, Environ. Model. Softw..

[9]  A. Horne,et al.  Horizontal distribution of geosmin in a reservoir before and after copper treatment , 1999 .

[10]  Young-Seuk Park,et al.  Community patterning and identification of predominant factors in algal bloom in Daechung Reservoir (Korea) using artificial neural networks , 2007 .

[11]  W. T. Blevins,et al.  Environmental and nutritional factors affecting geosmin synthesis by Anabaena sp. , 2001, Water research.

[12]  D. Millie,et al.  EVALUATING THE RELATIONSHIP BETWEEN PHOTOPIGMENT SYNTHESIS AND 2‐METHYLISOBORNEOL ACCUMULATION IN CYANOBACTERIA , 1999 .

[13]  Manuel J. Rodríguez,et al.  Automated analysis of geosmin, 2-methyl-isoborneol, 2-isopropyl-3-methoxypyrazine, 2-isobutyl-3-methoxypyrazine and 2,4,6-trichloroanisole in water by SPME-GC-ITDMS/MS , 2011 .

[14]  C. Dionigi,et al.  Effects of temperature and oxygen concentration on geosmin production by Streptomyces tendae and Penicillium expansum , 1994 .

[15]  J. Yu,et al.  Occurrence of odour-causing compounds in different source waters of China , 2009 .

[16]  F. Jüttner,et al.  Dissolved and particle-bound geosmin in a mesotrophic lake (lake Zürich): spatial and seasonal distribution and the effect of grazers , 1999 .

[17]  Holger R. Maier,et al.  Neural network based modelling of environmental variables: A systematic approach , 2001 .

[18]  A. Bruchet Solved and unsolved cases of taste and odor episodes in the files of Inspector Cluzeau , 1999 .

[19]  R. John Linear Statistical Models: An Applied Approach , 1986 .

[20]  M. Gevrey,et al.  Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .

[21]  Y. Ishibashi,et al.  Interactions between bacteria-free Anabaena macrospora clone and bacteria isolated from unialgal culture , 1995 .

[22]  Y. Tsuchiya,et al.  Characterization of producing 2-methylisoborneol and geosmin , 1999 .

[23]  Brad A. Myers A new model for handling input , 1990, TOIS.

[24]  Holger R. Maier,et al.  Selection of input variables for data driven models: An average shifted histogram partial mutual information estimator approach , 2009 .

[25]  Manuel J. Rodríguez,et al.  Influence of water quality on the presence of off-flavour compounds (geosmin and 2-methylisoborneol). , 2010, Water research.

[26]  D. Degobbis,et al.  Relationships between heterotrophic bacteria and cyanobacteria in the northern Adriatic in relation to the mucilage phenomenon. , 2005, The Science of the total environment.

[27]  J. Lund,et al.  The inverted microscope method of estimating algal numbers and the statistical basis of estimations by counting , 1958, Hydrobiologia.

[28]  Holger R. Maier,et al.  Use of artificial neural networks for predicting optimal alum doses and treated water quality parameters , 2004, Environ. Model. Softw..

[29]  O. Köster,et al.  Occurrence of dissolved and particle-bound taste and odor compounds in Swiss lake waters. , 2009, Water research.

[30]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[31]  S. Chellam,et al.  Disinfection by-product formation following chlorination of drinking water: artificial neural network models and changes in speciation with treatment. , 2010, The Science of the total environment.

[32]  Handan Çamdevýren,et al.  Use of principal component scores in multiple linear regression models for prediction of Chlorophyll-a in reservoirs , 2005 .

[33]  P. Levallois,et al.  Evaluation of consumer attitudes on taste and tap water alternatives in Québec , 1999 .

[34]  F. Jüttner,et al.  Biochemical and Ecological Control of Geosmin and 2-Methylisoborneol in Source Waters , 2007, Applied and Environmental Microbiology.

[35]  Seo Jin Ki,et al.  Determination of the optimal parameters in regression models for the prediction of chlorophyll-a: a case study of the Yeongsan Reservoir, Korea. , 2009, The Science of the total environment.

[36]  T. Maekawa,et al.  Assessment for the complicated occurrence of nuisance odours from phytoplankton and environmental factors in a eutrophic lake , 2004 .

[37]  S. J. Hayes,et al.  Geosmin As an Odorous Metabolite In Cultures of A Free‐Living Amoeba, Vannella Species (Gymnamoebia, Vannellidae) , 1991 .

[38]  J. Frisvad,et al.  Characterization of volatile metabolites from 47 Penicillium taxa , 1995 .

[39]  I. Suffet,et al.  The Drinking Water Taste and Odor Wheel for the Millennium: Beyond Geosmin and 2-Methylisoborneol , 1999 .

[40]  Gerald T. Blain,et al.  Managing Taste and Odor Problems in a Eutrophic Drinking Water Reservoir , 2002 .

[41]  Holger R. Maier,et al.  Input determination for neural network models in water resources applications. Part 1—background and methodology , 2005 .

[42]  A. Post,et al.  Transient states of geosmin, pigments, carbohydrates and proteins in continuous cultures of Oscillatoria brevis induced by changes in nitrogen supply , 1988, Archives of Microbiology.

[43]  W. T. Blevins,et al.  Comparative physiology of geosmin production by Streptomyces halstedii and anabaena sp. , 1995 .

[44]  R. Oliver,et al.  Physiology of Geosmin Production by Anabaena circinalis Isolated from the Murrumbidgee River, Australia , 1992 .

[45]  Chien-Hung Wei Analysis of artificial neural network models for freeway ramp metering control , 2001, Artif. Intell. Eng..

[46]  Frank Denoyelles,et al.  Development of predictive models for geosmin-related taste and odor in Kansas, USA, drinking water reservoirs. , 2009, Water research.

[47]  S. Watson,et al.  Periphyton: a primary source of widespread and severe taste and odour. , 2004, Water science and technology : a journal of the International Association on Water Pollution Research.

[48]  S. Watson,et al.  Odours from pulp mill effluent treatment ponds: the origin of significant levels of geosmin and 2-methylisoborneol (MIB). , 2003, Chemosphere.

[49]  Ximing Cai,et al.  Input variable selection for water resources systems using a modified minimum redundancy maximum relevance (mMRMR) algorithm , 2009 .

[50]  Can Ozan Tan,et al.  Methodological issues in building, training, and testing artificial neural networks in ecological applications , 2005, q-bio/0510017.