Impact of clustered meteorological parameters on air pollutants concentrations in the region of Annaba, Algeria

Abstract The main objective of this study is the characterization of meteorological conditions in the region of Annaba (Algeria) using clustering tools. The proposed two stages clustering approach is based on using the Self-Organizing Maps (SOMs) and the well known K-means clustering algorithm. Quantitative (using two categories of validity indices) and qualitative criteria were introduced to compare and verify the correctness of the results. The different experiments developed, extracted five classes, which were related to typical meteorological conditions in the area. The obtained meteorological clusters are then used to better elucidate the dependency of meteorology on air quality in the presence of seven measured pollutants. In the current paper, Artificial Neural Networks (ANNs), and more precisely, Multi-Layered Perceptron (MLP) is used for modeling air pollutants, as well as, simulating their behaviour in relation to the meteorological parameters of interest. This behaviour is also investigated with the aid of correlation coefficient, where only results are shown for comparison, several relations and conclusions have been drawn.

[1]  Andrew C. Comrie,et al.  Climatology and forecast modeling of ambient carbon monoxide in Phoenix, Arizona , 1999 .

[2]  S. Dudoit,et al.  A prediction-based resampling method for estimating the number of clusters in a dataset , 2002, Genome Biology.

[3]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[4]  Joydeep Ghosh,et al.  Relationship-based clustering and cluster ensembles for high-dimensional data mining , 2002 .

[5]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jaakko Astola,et al.  Analysis and Visualization of Gene Expression Microarray Data in Human Cancer Using Self-Organizing Maps , 2003, Machine Learning.

[7]  M. Kolehmainen,et al.  Neural networks and periodic components used in air quality forecasting , 2001 .

[8]  Kostas D. Karatzas,et al.  Air pollution modelling with the aid of computational intelligence methods in Thessaloniki, Greece , 2007, Simul. Model. Pract. Theory.

[9]  H. Elminir Dependence of urban air pollutants on meteorology. , 2005, The Science of the total environment.

[10]  Kuruvilla John,et al.  Spatio-temporal characteristics of gaseous and particulate pollutants in an urban region of Kolkata, India , 2008 .

[11]  J. Adame,et al.  Analysis of NO, NO2, NOx, O3 and oxidant (OX = O3 + NO2) levels measured in a metropolitan area in the southwest of Iberian Peninsula , 2012 .

[12]  Pragya Agarwal,et al.  Self-Organising Maps , 2008 .

[13]  S. Breton Une approche neuronale du contrôle robotique utilisant la vision binoculaire par reconstruction tridimensionnelle , 1999 .

[14]  Ignacio J. Turias,et al.  A competitive neural network approach for meteorological situation clustering , 2006 .

[15]  Gavin C. Cawley,et al.  Extensive evaluation of neural network models for the prediction of NO2 and PM10 concentrations, compared with a deterministic modelling system and measurements in central Helsinki , 2003 .

[16]  Damien Fay,et al.  Day Type Identification for Algerian Electricity Load using Kohonen Maps , 2008 .

[17]  Tarek Khadir,et al.  Self-Organizing Map and K-Means for Meteorological Day Type Identification for the Region of Annaba -Algeria- , 2008, 2008 7th Computer Information Systems and Industrial Management Applications.

[18]  Robert E. Davis,et al.  Statistics for the evaluation and comparison of models , 1985 .

[19]  Gülsen Aydin Keskin,et al.  Multivariate methods for ground-level ozone modeling , 2011 .

[20]  Sampsa Laine,et al.  Visualization of particle size and shape distributions using self-organizing maps , 2002 .

[21]  Marina Meila,et al.  Comparing clusterings: an axiomatic view , 2005, ICML.

[22]  M. Gardner,et al.  Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London , 1999 .

[23]  M. Zelenka An analysis of the meteorological parameters affecting ambient concentrations of acid aerosols in Uniontown, Pennsylvania☆ , 1997 .

[24]  Mikko T. Kolehmainen,et al.  Data exploration with self-organizing maps in environmental informatics and bioinformatics , 2004 .

[25]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[26]  Gabriel Ibarra-Berastegi,et al.  Rainfall yield characteristics of electrical storm observed in the Spanish Basque Country area during the period 1992 1996 , 2008 .

[27]  Hsin-Chung Lu,et al.  Prediction of daily maximum ozone concentrations from meteorological conditions using a two-stage neural network , 2006 .

[28]  Qing Yang,et al.  Modeling the effects of meteorology on ozone in Houston using cluster analysis and generalized additive models , 1998 .

[29]  Hanefi Bayraktar,et al.  Impacts of some meteorological parameters on SO2 and TSP concentrations in Erzurum, Turkey. , 2005, Chemosphere.

[30]  Esa Alhoniemi,et al.  Self-Organizing Map for Data Mining in MATLAB: The SOM Toolbox , 1999 .

[31]  Wei Wang,et al.  Characteristics of gaseous pollutants near a main traffic line in Beijing and its influencing factors , 2009 .

[32]  Hamdy K. Elminir,et al.  ESTIMATION OF AIR POLLUTANT CONCENTRATIONS FROM METEOROLOGICAL PARAMETERS USING ARTIFICIAL NEURAL NETWORK , 2006 .

[33]  Jerry M. Davis,et al.  An Automated Classification Scheme Designed to Better Elucidate the Dependence of Ozone on Meteorology , 1994 .

[34]  A. Gupta,et al.  Source apportionment of PM10 at residential and industrial sites of an urban region of Kolkata, India , 2007 .

[35]  Isabelle Guyon,et al.  A Stability Based Method for Discovering Structure in Clustered Data , 2001, Pacific Symposium on Biocomputing.

[36]  Milt Statheropoulos,et al.  Principal component and canonical correlation analysis for examining air pollution and meteorological data , 1998 .

[37]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[38]  R. K. Srivastava,et al.  Development and performance evaluation of statistical models correlating air pollutants and meteorological variables at Pantnagar, India , 2011 .

[39]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Christos Zerefos,et al.  Forecasting peak pollutant levels from meteorological variables , 1995 .

[41]  Patrick Rousset Applications des algorithmes d'auto-organisation à la classification et à la prévision , 1999 .

[42]  Melody Y. Kiang,et al.  An extended self-organizing map network for market segmentation - a telecommunication example , 2006, Decis. Support Syst..