Using principal component analysis and fuzzy c–means clustering for the assessment of air quality monitoring

Determining whether a reduction can be made in the total number of monitoring stations within the Air Quality Monitoring Network is very important since in case of necessity, the devices at one group of stations having similar air pollution characteristics can be transferred to another zone. This would significantly decrease the capital investment and operational cost. Therefore, the objective of this study was grouping the monitoring stations that share similar air pollution characteristics by using the methods of principal component analysis (PCA) and fuzzy c– means (FCM). In addition, this study also enables determining the emission sources, evaluating the performances of the methods and examining the zone in terms of pollution. In the classification of monitoring stations, different groups were formed depending on both the method of analysis and the type of pollutants. As a result of PCA, 5 and 3 classes have been determined for SO2 and PM10, respectively. This shows that the number of monitoring stations can be decreased. When reduced classes were analyzed, it was observed that a clear distinction cannot be made considering the affected source type. During the implementation of the FCM method, in order to facilitate comparison with the PCA, the monitoring stations were classified into 5 and 3 groups for SO2 and PM10, respectively. When the results were analyzed, it was seen that the uncertainty in PCA was reduced. When the two methods are compared, FCM was found to provide more significant results than PCA. The evaluation in terms of pollution, the results of the study showed that PM10 exceeded the limit values at all the monitoring stations, and SO2 exceeded the limit values at only 3 of the 22 stations.

[1]  Wei-Zhen Lu,et al.  Performance assessment of air quality monitoring networks using principal component analysis and clu , 2011 .

[2]  R. Harrison,et al.  The use of trajectory cluster analysis to examine the long-range transport of secondary inorganic aerosol in the UK , 2005 .

[3]  A. Ç. Köne,et al.  A comparison for Turkish provinces’ performance of urban air pollution , 2012 .

[4]  Mar Viana,et al.  Determination of direct and fugitive PM emissions in a Mediterranean harbour by means of classic and novel tracer methods. , 2009, Journal of environmental management.

[5]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[6]  Michael D'Rosario,et al.  Share price clustering in Mexico , 2011 .

[7]  C. Jing,et al.  Principal component analysis of fluoride geochemistry of groundwater in Shanxi and Inner Mongolia, China , 2013 .

[8]  Witold Pedrycz,et al.  P-FCM: a proximity -- based fuzzy clustering , 2004, Fuzzy Sets Syst..

[9]  G. Tsekouras,et al.  A new approach for measuring the validity of the fuzzy c -means algorithm , 2004 .

[10]  Yadong Wang,et al.  Improving fuzzy c-means clustering based on feature-weight learning , 2004, Pattern Recognit. Lett..

[11]  Yang Yan,et al.  Fuzzy semi-supervised co-clustering for text documents , 2013, Fuzzy Sets Syst..

[12]  Kuo-Lung Wu,et al.  Unsupervised possibilistic clustering , 2006, Pattern Recognit..

[13]  Andrew B Lawson,et al.  Identifying natural and anthropogenic sources of metals in urban and rural soils using GIS-based data, PCA, and spatial interpolation. , 2009, Environmental pollution.

[14]  Miin-Shen Yang,et al.  Fuzzy clustering algorithms for mixed feature variables , 2004, Fuzzy Sets Syst..

[15]  S. Ramathilagam,et al.  Strong fuzzy c-means in medical image data analysis , 2012, J. Syst. Softw..

[16]  Wing-tat Hung,et al.  Interpretation of air quality in relation to monitoring station's surroundings , 2009 .

[17]  T. Warren Liao,et al.  II, A fuzzy c-means variant for the generation of fuzzy term sets , 2003, Fuzzy Sets Syst..

[18]  Shengrui Wang,et al.  FCM-Based Model Selection Algorithms for Determining the Number of Clusters , 2004, Pattern Recognit..

[19]  Bilge Özbay Modeling the Effects of Meteorological Factors on SO2 and PM10 Concentrations with Statistical Approaches , 2012 .

[20]  Gabriel Ibarra-Berastegi,et al.  Assessing spatial variability of SO2 field as detected by an air quality network using Self-Organizing Maps, cluster, and Principal Component Analysis , 2009 .

[21]  A. Furey,et al.  The assessment of the presence and main constituents of particulate matter ten microns (PM10) in Irish, rural and urban air , 2010 .

[22]  José C.M. Pires,et al.  Management of air quality monitoring using principal component and cluster analysis—Part I: SO2 and PM10 , 2008 .

[23]  Beatrice Lazzerini,et al.  A new fuzzy relational clustering algorithm based on the fuzzy C-means algorithm , 2005, Soft Comput..

[24]  S. Wold,et al.  Fuzzy clustering of 627 alcohols, guided by a strategy for cluster analysis of chemical compounds for combinatorial chemistry , 1998 .

[25]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[26]  Evelina Lamma,et al.  Unsupervised and supervised learning in cascade for petroleum geology , 2012, Expert Syst. Appl..

[27]  Yanling Li,et al.  An automatic fuzzy c-means algorithm for image segmentation , 2009, Soft Comput..