Application of Self-Organizing Map in Aerosol Single Particles Data Clustering

In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, "calcium-containing" particles, biogenic soot particles, and carbonaceous particles etc. are identified.

[1]  Andreas Rauber,et al.  The growing hierarchical self-organizing map: exploratory analysis of high-dimensional data , 2002, IEEE Trans. Neural Networks.

[2]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[3]  D. Dockery,et al.  An association between air pollution and mortality in six U.S. cities. , 1993, The New England journal of medicine.

[4]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[5]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[6]  Bernhard Spengler,et al.  Data processing in on-line laser mass spectrometry of inorganic, organic, or biological airborne particles , 1999 .

[7]  M. Clench,et al.  The Quantitative Analysis of Multicomponent Gaseous Mixtures of Organic Compounds by FT-IR , 1997 .

[8]  Anil K. Jain,et al.  A nonlinear projection method based on Kohonen's topology preserving maps , 1995, IEEE Trans. Neural Networks.

[9]  Yanda Li,et al.  Self-organizing map as a new method for clustering and data analysis , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[10]  A S Wexler,et al.  Application of the ART-2a algorithm to laser ablation aerosol mass spectrometry of particle standards. , 2001, Analytical chemistry.

[11]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[12]  Hujun Yin,et al.  Data visualisation and manifold mapping using the ViSOM , 2002, Neural Networks.

[13]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[14]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[15]  K Salt,et al.  Aerodynamic Particle Sizing versus Light Scattering Intensity Measurement as Methods for Real-Time Particle Sizing Coupled with Time-of-Flight Mass Spectrometry. , 1996, Analytical chemistry.