Comparison of self-organizing maps classification approach with cluster and principal components analysis for large environmental data sets.

Three classification techniques (loading and score projections based on principal components analysis (PCA), cluster analysis (CA) and self-organizing maps (SOM)) were applied to a large environmental data set of chemical indicators of river water quality. The study was carried out by using long-term water quality monitoring data. The advantages of SOM algorithm and its classification and visualization ability for large environmental data sets are stressed. The results obtained allowed detecting natural clusters of monitoring locations with similar water quality type and identifying important discriminant variables responsible for the clustering. SOM clustering allows simultaneous observation of both spatial and temporal changes in water quality. The chemometric approach revealed different patterns of monitoring sites conditionally named "tributary", "urban", "rural" or "background". This objective separation could lead to an optimization of river monitoring nets and to a better tracing natural and anthropogenic changes along the river stream.

[1]  Jürgen W. Einax,et al.  River Pollution Data Interpreted by Means of Chemometric Methods , 1998 .

[2]  I. Stanimirova,et al.  Environmetric modeling and interpretation of river water monitoring data , 2002, Analytical and bioanalytical chemistry.

[3]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[4]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[5]  Nandish M. Mattikalli Time Series Analysis of Historical Surface Water Quality Data of the River Glen Catchment, U.K. , 1996 .

[6]  Varghese S. Jacob,et al.  Comparative performance of the FSCL neural net and K-means algorithm for market segmentation , 1996 .

[7]  David West,et al.  A comparison of SOM neural network and hierarchical clustering methods , 1996 .

[8]  Xuefeng Yan,et al.  SOM Integrated with CCA for the Feature Map and Classification of Complex Chemical Patterns , 2001, Comput. Chem..

[9]  Miklas Scholz,et al.  Application of the self-organizing map (SOM) to assess the heavy metal removal performance in experimental constructed wetlands. , 2006, Water research.

[10]  Stefan Tsakovski,et al.  Environmetrical Treatment of Water Quality Survey Data from Yantra River, Bulgaria , 2000 .

[11]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[12]  G. W. Milligan,et al.  An examination of the effect of six types of error perturbation on fifteen clustering algorithms , 1980 .

[13]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[14]  Bogusław Buszewski,et al.  Application of chemometrics in river water classification. , 2006, Water research.

[15]  C Cun,et al.  Time series analysis on chlorides, nitrates, ammonium and dissolved oxygen concentrations in the Seine river near Paris. , 1997, The Science of the total environment.

[16]  Darinka Brodnjak-Vončina,et al.  Chemometrics characterisation of the quality of river water , 2002 .

[17]  Emilio Marengo,et al.  How chemometrics can helpfully assist in evaluating environmental data. Lagoon water , 1995 .

[18]  Abhijit Mukherjee,et al.  Self-organizing neural network for identification of natural modes , 1998 .

[19]  W. D. Alberto,et al.  Pattern recognition techniques for the evaluation of spatial and temporal variations in water quality. A case study: Suquía River Basin (Cordoba-Argentina). , 2001 .

[20]  H. Reincke,et al.  Dioxin (PCDD/F) in the river Elbe--investigations of their origin by multivariate statistical methods. , 1998, Chemosphere.

[21]  Sovan Lek,et al.  A comparison of self-organizing map algorithm and some conventional statistical methods for ecological community ordination , 2001 .

[22]  Carolina Mendiguchía,et al.  Using chemometric tools to assess anthropogenic effects in river water. A case study: Guadalquivir River (Spain) , 2004 .

[23]  Shang-Lien Lo,et al.  Diagnosing reservoir water quality using self-organizing maps and fuzzy theory. , 2002, Water research.

[24]  Jürgen W. Einax,et al.  Chemometrics in Environmental Analysis , 1997 .

[25]  Young-Seuk Park,et al.  Application of a self-organizing map to select representative species in multivariate analysis: A case study determining diatom distribution patterns across France , 2006, Ecol. Informatics.