Predicting hourly air pollutant levels using artificial neural networks coupled with uncertainty analysis by Monte Carlo simulations

Recent progress in developing artificial neural network (ANN) metamodels has paved the way for reliable use of these models in the prediction of air pollutant concentrations in urban atmosphere. However, improvement of prediction performance, proper selection of input parameters and model architecture, and quantification of model uncertainties remain key challenges to their practical use. This study has three main objectives: to select an ensemble of input parameters for ANN metamodels consisting of meteorological variables that are predictable by conventional weather forecast models and variables that properly describe the complex nature of pollutant source conditions in a major city, to optimize the ANN models to achieve the most accurate hourly prediction for a case study (city of Tehran), and to examine a methodology to analyze uncertainties based on ANN and Monte Carlo simulations (MCS). In the current study, the ANNs were constructed to predict criteria pollutants of nitrogen oxides (NOx), nitrogen dioxide (NO2), nitrogen monoxide (NO), ozone (O3), carbon monoxide (CO), and particulate matter with aerodynamic diameter of less than 10 μm (PM10) in Tehran based on the data collected at a monitoring station in the densely populated central area of the city. The best combination of input variables was comprehensively investigated taking into account the predictability of meteorological input variables and the study of model performance, correlation coefficients, and spectral analysis. Among numerous meteorological variables, wind speed, air temperature, relative humidity and wind direction were chosen as input variables for the ANN models. The complex nature of pollutant source conditions was reflected through the use of hour of the day and month of the year as input variables and the development of different models for each day of the week. After that, ANN models were constructed and validated, and a methodology of computing prediction intervals (PI) and probability of exceeding air quality thresholds was developed by combining ANNs and MCSs based on Latin Hypercube Sampling (LHS). The results showed that proper ANN models can be used as reliable metamodels for the prediction of hourly air pollutants in urban environments. High correlations were obtained with R2 of more than 0.82 between modeled and observed hourly pollutant levels for CO, NOx, NO2, NO, and PM10. However, predicted O3 levels were less accurate. The combined use of ANNs and MCSs seems very promising in analyzing air pollution prediction uncertainties. Replacing deterministic predictions with probabilistic PIs can enhance the reliability of ANN models and provide a means of quantifying prediction uncertainties.

[1]  R. H. Myers Classical and modern regression with applications , 1986 .

[2]  Joaquín B. Ordieres Meré,et al.  Development and comparative analysis of tropospheric ozone prediction models using linear and artificial intelligence-based models in Mexicali, Baja California (Mexico) and Calexico, California (US) , 2008, Environ. Model. Softw..

[3]  Marija Zlata Boznar,et al.  A neural network-based method for short-term predictions of ambient SO2 concentrations in highly polluted industrial areas of complex terrain , 1993 .

[4]  Amir F. Atiya,et al.  Comprehensive Review of Neural Network-Based Prediction Intervals and New Advances , 2011, IEEE Transactions on Neural Networks.

[5]  Chris Chatfield,et al.  Time series forecasting with neural networks: a comparative study using the air line data , 2008 .

[6]  M. Molina,et al.  Megacities and Atmospheric Pollution , 2004, Journal of the Air & Waste Management Association.

[7]  Jorge Reyes,et al.  An integrated neural network model for PM10 forecasting , 2006 .

[8]  M. Gardner,et al.  Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London , 1999 .

[9]  Gavin C. Cawley,et al.  Extensive evaluation of neural network models for the prediction of NO2 and PM10 concentrations, compared with a deterministic modelling system and measurements in central Helsinki , 2003 .

[10]  T. Fukuchi,et al.  Evaluation of polarization angle rotation of propagating light in a partially ionized atmosphere under discharge conditions , 2008 .

[11]  Ari Karppinen,et al.  A modelling system for predicting urban air pollution: model description and applications in the Helsinki metropolitan area , 2000 .

[12]  M. Kolehmainen,et al.  Neural networks and periodic components used in air quality forecasting , 2001 .

[13]  Holger R. Maier,et al.  Neural networks for the prediction and forecasting of water resource variables: a review of modelling issues and applications , 2000, Environ. Model. Softw..

[14]  Holger R. Maier,et al.  State of the Art of Artificial Neural Networks in Geotechnical Engineering , 2008 .

[15]  Keith W. Hipel Stochastic and statistical methods in hydrology and environmental engineering , 1994 .

[16]  Dirk P. Kroese,et al.  Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[17]  Reuven Y. Rubinstein,et al.  Simulation and the Monte Carlo Method , 1981 .

[18]  Brian D. Ripley,et al.  Stochastic Simulation , 2005 .

[19]  S. Vitabile,et al.  Two-days ahead prediction of daily maximum concentrations of SO2, O3, PM10, NO2, CO in the urban area of Palermo, Italy , 2007 .

[20]  Gabriel Ibarra-Berastegi,et al.  From diagnosis to prognosis for forecasting air pollution using neural networks: Air pollution monitoring in Bilbao , 2008, Environ. Model. Softw..

[21]  Jorge Reyes,et al.  Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile , 2000 .

[22]  Predrag Hercog,et al.  Neural network forecasting of air pollutants hourly concentrations using optimised temporal averages of meteorological variables and pollutant concentrations , 2009 .

[23]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[24]  L. Dawidowski,et al.  Artificial neural network for the identification of unknown air pollution sources , 1999 .

[25]  J. Seinfeld,et al.  Atmospheric Chemistry and Physics: From Air Pollution to Climate Change , 1997 .

[26]  Darko Koracin,et al.  Application of artificial neural networks to modeling the transport and dispersion of tracers in complex terrain , 2002 .

[27]  Ari Karppinen,et al.  A modelling system for predicting urban air pollution:: comparison of model predictions with the data of an urban measurement network in Helsinki , 2000 .

[28]  A. McMichael,et al.  The urban environment and health in a world of increasing globalization: issues for developing countries. , 2000, Bulletin of the World Health Organization.

[29]  Sean McKee,et al.  Monte Carlo Methods for Applied Scientists , 2005 .

[30]  Anastasia K Paschalidou,et al.  Forecasting hourly PM10 concentration in Cyprus through artificial neural networks and multiple regression models: implications to local environmental management , 2011, Environmental science and pollution research international.

[31]  C Borrego,et al.  Procedures for estimation of modelling uncertainty in air quality assessment. , 2008, Environment international.

[32]  Yun Zeng,et al.  Progress in developing an ANN model for air pollution index forecast , 2004 .

[33]  J. Hooyberghs,et al.  A neural network forecast for daily average PM10 concentrations in Belgium , 2005 .