A Novel Method for Improving Air Pollution Prediction Based on Machine Learning Approaches: A Case Study Applied to the Capital City of Tehran

Environmental pollution has mainly been attributed to urbanization and industrial developments across the globe. Air pollution has been marked as one of the major problems of metropolitan areas around the world, especially in Tehran, the capital of Iran, where its administrators and residents have long been struggling with air pollution damage such as the health issues of its citizens. As far as the study area of this research is concerned, a considerable proportion of Tehran air pollution is attributed to PM10 and PM2.5 pollutants. Therefore, the present study was conducted to determine the prediction models to determine air pollutions based on PM10 and PM2.5 pollution concentrations in Tehran. To predict the air-pollution, the data related to day of week, month of year, topography, meteorology, and pollutant rate of two nearest neighbors as the input parameters and machine learning methods were used. These methods include a regression support vector machine, geographically weighted regression, artificial neural network and auto-regressive nonlinear neural network with an external input as the machine learning method for the air pollution prediction. A prediction model was then proposed to improve the afore-mentioned methods, by which the error percentage has been reduced and improved by 57%, 47%, 47% and 94%, respectively. The most reliable algorithm for the prediction of air pollution was autoregressive nonlinear neural network with external input using the proposed prediction model, where its one-day prediction error reached 1.79 µg/m3. Finally, using genetic algorithm, data for day of week, month of year, topography, wind direction, maximum temperature and pollutant rate of the two nearest neighbors were identified as the most effective parameters in the prediction of air pollution.

[1]  Jin Li,et al.  A review of comparative studies of spatial interpolation methods in environmental sciences: Performance and impact factors , 2011, Ecol. Informatics.

[2]  Filippo Sorbello,et al.  Three hours ahead prevision of SO2 pollutant concentration using an Elman neural based forecaster , 2008 .

[3]  Harri Niska,et al.  Methods for imputation of missing values in air quality data sets , 2004 .

[4]  Shikha Gupta,et al.  Identifying pollution sources and predicting urban air quality using ensemble learning methods , 2013 .

[5]  S. Moebus,et al.  Spatio-temporal modelling of residential exposure to particulate matter and gaseous pollutants for the Heinz Nixdorf Recall Cohort , 2014 .

[6]  Luca Palmeri,et al.  A comparison of methods for the assessment of odor impacts on air quality: Field inspection (VDI 3940) and the air dispersion model CALPUFF , 2012 .

[7]  J. Seinfeld,et al.  Atmospheric Chemistry and Physics: From Air Pollution to Climate Change , 1997 .

[8]  F. Kelly,et al.  Monitoring air pollution: Use of early warning systems for public health , 2012, Respirology.

[9]  M. Boznar,et al.  Perceptron neural network-based model predicts air pollution , 1997, Proceedings Intelligent Information Systems. IIS'97.

[10]  Ronald W. Schafer,et al.  What Is a Savitzky-Golay Filter? [Lecture Notes] , 2011, IEEE Signal Processing Magazine.

[11]  P Hyde,et al.  Forecasting PM10 in metropolitan areas: Efficacy of neural networks. , 2012, Environmental pollution.

[12]  Baca Cabrera,et al.  A geostatistical method for the analysis and prediction of air quality time series: application to the Aburrá Valley region , 2016 .

[13]  Qi Li,et al.  Artificial neural networks forecasting of PM2.5 pollution using air mass trajectory based geographic model and wavelet transformation , 2015 .

[14]  N. A. Mazzeo,et al.  A simple model for calculating air pollution within street canyons , 2014 .

[15]  Yafeng Yin,et al.  Prediction of hourly air pollutant concentrations near urban arterials using artificial neural network approach , 2009 .

[16]  Nadjet Djebbri,et al.  Artificial neural networks based air pollution monitoring in industrial sites , 2017, 2017 International Conference on Engineering and Technology (ICET).

[17]  J. Carretero,et al.  Assessment of ozone variations and meteorological effects in an urban area in the Mediterranean Coast. , 2002, The Science of the total environment.

[18]  Bijan Yeganeh,et al.  Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model , 2012 .

[19]  Yong Liu,et al.  A novel hybrid forecasting model for PM₁₀ and SO₂ daily concentrations. , 2015, The Science of the total environment.

[20]  Johan Lindström,et al.  Comparing universal kriging and land-use regression for predicting concentrations of gaseous oxides of nitrogen (NOx) for the Multi-Ethnic Study of Atherosclerosis and Air Pollution (MESA Air). , 2011, Atmospheric environment.

[21]  Marcello Farina,et al.  Forecasting peak air pollution levels using NARX models , 2009, Eng. Appl. Artif. Intell..

[22]  Ayse Betül Oktay,et al.  Forecasting air pollutant indicator levels with geographic models 3 days in advance using neural networks , 2010, Expert Syst. Appl..

[23]  D. Dockery,et al.  Health Effects of Fine Particulate Air Pollution: Lines that Connect , 2006, Journal of the Air & Waste Management Association.

[24]  Anikender Kumar,et al.  Forecasting of daily air quality index in Delhi. , 2011, The Science of the total environment.

[25]  Tsun-Jen Cheng,et al.  Spatiotemporal modeling with temporal-invariant variogram subgroups to estimate fine particulate matter PM2.5 concentrations , 2012 .

[26]  W. T. Davis,et al.  Air Pollution: Its Origin and Control , 1976 .