Revisiting urban air quality forecasting: a regression approach

We address air quality (AQ) forecasting as a regression problem employing computational intelligence (CI) methods for the Gdańsk Metropolitan Area (GMA) in Poland and the Thessaloniki Metropolitan Area (TMA) in Greece. Linear Regression as well as Artificial Neural Network models are developed, accompanied by Random Forest models, for five locations per study area and for a dataset of limited feature dimensionality. An ensemble approach is also used for generating and testing AQ forecasting models. Results indicate good model performance with a correlation coefficient between forecasts and measurements for the daily mean $$\hbox {PM}_{10}$$PM10 concentration one day in advance reaching 0.765 for one of the TMA locations and 0.64 for one of the GMA locations. Overall results suggest that the specific modelling approach can support the provision of air quality forecasts on the basis of limited feature space dimensionality and by employing simple linear regression models.

[1]  J. Kukkonen,et al.  Intercomparison of air quality data using principal component analysis, and forecasting of PM₁₀ and PM₂.₅ concentrations using artificial neural networks, in Thessaloniki and Helsinki. , 2011, The Science of the total environment.

[2]  Kostas D. Karatzas,et al.  Air pollution modelling with the aid of computational intelligence methods in Thessaloniki, Greece , 2007, Simul. Model. Pract. Theory.

[3]  N. S. Loboda,et al.  Short-range forecast of atmospheric pollutants using non-linear prediction method , 2008 .

[4]  Cezary Orlowski,et al.  Adaptation of an ANN-Based Air Quality Forecasting Model to a New Application Area , 2017, ACIIDS.

[5]  Brian Bukoski,et al.  Air Quality Forecasting , 2014 .

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Jianzhou Wang,et al.  A hybrid model for PM₂.₅ forecasting based on ensemble empirical mode decomposition and a general regression neural network. , 2014, The Science of the total environment.

[8]  S. Riffat,et al.  Future cities and environmental sustainability , 2017 .

[9]  Krzysztof Siwek,et al.  Improving the accuracy of prediction of PM10 pollution by the wavelet transformation and an ensemble of neural predictors , 2012, Eng. Appl. Artif. Intell..

[10]  Marcella Busilacchio,et al.  Recursive neural network model for analysis and forecast of PM10 and PM2.5 , 2017 .

[11]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[12]  Cezary Orlowski,et al.  Urban Air Quality Forecasting: A Regression and a Classification Approach , 2017, ACIIDS.

[13]  N Moussiopoulos,et al.  Air quality status in Greater Thessaloniki Area and the emission reductions needed for attaining the EU air quality legislation. , 2009, The Science of the total environment.

[14]  Pericles A. Mitkas,et al.  Sparse episode identification in environmental datasets: The case of air quality assessment , 2011, Expert Syst. Appl..

[15]  Harri Niska,et al.  Forecasting daily pollen concentrations using data-driven modeling methods in Thessaloniki, Greece , 2010 .

[16]  Jaakko Kukkonen,et al.  A review of operational, regional-scale, chemical weather forecasting models in Europe , 2012 .

[17]  Georgi Nikolov,et al.  Air quality monitoring in urban environments , 2016, 2016 39th International Spring Seminar on Electronics Technology (ISSE).

[18]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[19]  S. Dawe,et al.  A Systems Approach to Smart City Infrastructure : A Small City Perspective , 2016 .