Chemometrical exploration of the wet precipitation chemistry from the Austrian Monitoring Network (1988-1999).

The present paper deals with the application of different chemometric methods to an environmental data set derived from the monitoring of wet precipitation in Austria (1988-1999). These methods are: principal component analysis (PCA); projection pursuit (PP); density-based spatial clustering of application with noise (DBSCAN); ordering points to identify the clustering structures (OPTICS); self-organizing maps (SOM), also called the Kohonen network; and the neural gas (NG) network. The aim of the study is to introduce some new approaches into environmetrics and to compare their usefulness with already existing techniques for the classification and interpretation of environmental data. The density-based approaches give information about the occurrence of natural clusters in the studied data set, which, however, do not occur in the case presented here; information about high-density zones (very similar samples) and extreme samples is also obtained. The partitioning techniques (clustering, but also neural gas and Kohonen networks) offer an opportunity to classify the objects of interest into several defined groups, the patterns of ionic concentration of which can be studied in detail. The visual aids, such as the color map and the Kohonen map, for each site are very helpful in understanding the relationships between samples and between samples and variables. All methods, and in particular projection pursuit, give information about samples with extreme characteristics.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  Desire L. Massart,et al.  Looking for Natural Patterns in Analytical Data, 2. Tracing Local Density with OPTICS , 2002, J. Chem. Inf. Comput. Sci..

[3]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[4]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[5]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[6]  Eric R. Ziegel,et al.  Handbook of Chemometrics and Qualimetrics, Part B , 2000, Technometrics.

[7]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[8]  Gerrit Kateman,et al.  USING ARTIFICIAL NEURAL NETWORKS FOR SOLVING CHEMICAL PROBLEMS .2. KOHONEN SELF-ORGANIZING FEATURE MAPS AND HOPFIELD NETWORKS , 1994 .

[9]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[10]  Thomas Jackson,et al.  Neural Computing - An Introduction , 1990 .

[11]  Desire L. Massart,et al.  Effect of different preprocessing methods for principal component analysis applied to the composition of mixtures: Detection of impurities in HPLC—DAD , 1994 .

[12]  S. D. Jong,et al.  Handbook of Chemometrics and Qualimetrics , 1998 .

[13]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[14]  D. Massart,et al.  Looking for natural patterns in data: Part 1. Density-based approach , 2001 .

[15]  Hans Puxbaum,et al.  A high density network for wet only precipitation chemistry sampling in Austria , 1996 .

[16]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[17]  Desire L. Massart,et al.  Similarities and Differences among Multivariate Display Techniques Illustrated by Belgian Cancer Mortality Distribution Data , 1988 .

[18]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[19]  S. Tsakovski,et al.  Trend, seasonal and multivariate modelling study of wet precipitation data from the Austrian Monitoring Network (1990-1997). , 2000, Journal of environmental monitoring : JEM.

[20]  H. Sator,et al.  Cluster Analysis in Clinical Chemistry: A Model , 1987 .

[21]  Frederik Questier,et al.  The Neural-Gas network for classifying analytical data , 2002 .

[22]  Stefan Tsakovski,et al.  Classification and receptor modeling of wet precipitation data from central Austria (1984–1993) , 1999 .

[23]  D. Massart,et al.  Sequential projection pursuit using genetic algorithms for data mining of analytical data. , 2000, Analytical chemistry.

[24]  M. Hubert,et al.  A fast method for robust principal components with applications to chemometrics , 2002 .

[25]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[26]  Frederik Questier,et al.  Data structures and data transformations for clustering chemical data , 2001 .