Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model

Due to the health impacts caused by exposures to air pollutants in urban areas, monitoring and forecasting of air quality parameters have become popular as an important topic in atmospheric and environmental research today. The knowledge on the dynamics and complexity of air pollutants behavior has made artificial intelligence models as a useful tool for a more accurate pollutant concentration prediction. This paper focuses on an innovative method of daily air pollution prediction using combination of Support Vector Machine (SVM) as predictor and Partial Least Square (PLS) as a data selection tool based on the measured values of CO concentrations. The CO concentrations of Rey monitoring station in the south of Tehran, from Jan. 2007 to Feb. 2011, have been used to test the effectiveness of this method. The hourly CO concentrations have been predicted using the SVM and the hybrid PLS–SVM models. Similarly, daily CO concentrations have been predicted based on the aforementioned four years measured data. Results demonstrated that both models have good prediction ability; however the hybrid PLS–SVM has better accuracy. In the analysis presented in this paper, statistic estimators including relative mean errors, root mean squared errors and the mean absolute relative error have been employed to compare performances of the models. It has been concluded that the errors decrease after size reduction and coefficients of determination increase from 56 to 81% for SVM model to 65–85% for hybrid PLS–SVM model respectively. Also it was found that the hybrid PLS–SVM model required lower computational time than SVM model as expected, hence supporting the more accurate and faster prediction ability of hybrid PLS–SVM model.

[1]  Stanislaw Osowski,et al.  Forecasting of the daily meteorological pollution using wavelets and support vector machine , 2007, Eng. Appl. Artif. Intell..

[2]  P. J. García Nieto,et al.  Application of an SVM-based regression model to the air quality study at local scale in the Avilés urban area (Spain) , 2011, Math. Comput. Model..

[3]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[4]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[5]  Wei-Zhen Lu,et al.  Prediction of PM10 concentrations at urban traffic intersections using semi-empirical box modelling with instantaneous velocity and acceleration , 2009 .

[6]  Dong-Sool Kim,et al.  A new method of ozone forecasting using fuzzy expert and neural network systems. , 2004, The Science of the total environment.

[7]  Wei-Zhen Lu,et al.  Potential assessment of the "support vector machine" method in forecasting ambient air pollutant trends. , 2005, Chemosphere.

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  Scott M. Robeson,et al.  Evaluation and comparison of statistical forecast models for daily maximum ozone concentrations , 1990 .

[10]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[11]  C. Chan,et al.  Effect of meteorology and air pollutant transport on ozone episodes at a subtropical coastal Asian city, Hong Kong , 2000 .

[12]  Nicolas Moussiopoulos,et al.  Numerical simulation of photochemical smog formation in Athens, Greece—A case study , 1995 .

[13]  I. D. Wilson,et al.  Predicting the geo-temporal variations of crime and disorder , 2003 .

[14]  W Z Lu,et al.  Using Improved Neural Network Model to Analyze RSP, NOx and NO2 Levels in Urban Air in Mong Kok, Hong Kong , 2003, Environmental monitoring and assessment.

[15]  José David Martín-Guerrero,et al.  Neural networks for analysing the relevance of input variables in the prediction of tropospheric ozone concentration , 2006 .

[16]  S. I. V. Sousa,et al.  Multiple linear regression and artificial neural networks based on principal components to predict ozone concentrations , 2007, Environ. Model. Softw..

[17]  Mukesh Khare,et al.  A Review of Deterministic, Stochastic and Hybrid Vehicular Exhaust Emission Models , 2004 .

[18]  Yi Lin,et al.  Support Vector Machines for Classification in Nonstandard Situations , 2002, Machine Learning.

[19]  Feng Luan,et al.  Prediction of retention time of a variety of volatile organic compounds based on the heuristic method and support vector machine , 2005 .

[20]  David D. Parrish,et al.  Assessment of pollutant emission inventories by principal component analysis of ambient air measurements , 1992 .

[21]  M. Stone Continuum regression: Cross-validated sequentially constructed prediction embracing ordinary least s , 1990 .

[22]  V. Prybutok,et al.  A neural network model forecasting for prediction of daily maximum ozone concentration in an industrialized urban area. , 1996, Environmental pollution.

[23]  Wei-Zhen Lu,et al.  Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme. , 2008, The Science of the total environment.

[24]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[25]  Archontoula Chaloulakou,et al.  Comparative assessment of neural networks and regression models for forecasting summertime ozone in Athens. , 2003, The Science of the total environment.

[26]  Sheng Chen,et al.  Sparse support vector regression based on orthogonal forward selection for the generalised kernel model , 2006, Neurocomputing.

[27]  Wenjian Wang,et al.  Online prediction model based on support vector machine , 2008, Neurocomputing.