Prediction models of CO, SPM and SO2 concentrations in the Campo de Gibraltar Region, Spain: a multiple comparison strategy

The ‘Campo de Gibraltar’ region is a very industrialized area where very few air pollution studies have been carried out. Up to date, no model has been developed in order to predict air pollutant levels in the different towns spread in the region. Carbon monoxide (CO), Sulphur dioxide (SO2) and suspended particulate matter (SPM) series have been investigated (years 1999–2000–2001). Multilayer perceptron models (MLPs) with backpropagation learning rule have been used. A resampling strategy with two-fold crossvalidation allowed the statistical comparison of the different models considered in this study. Artificial neural networks (ANN) models were compared with Persistence and ARIMA models and also with models based on standard Multiple Linear Regression (MLR) over test sets with data that had not been used in the training stage. The models based on ANNs showed better capability of generalization than those based on MLR. The designed procedure of random resampling permits an adequate and robust multiple comparison of the tested models. Principal component analysis (PCA) is used to reduce the dimensionality of data and to transform exogenous variables into significant and independent components. Short-term predictions were better than medium-term predictions in the case of CO and SO2 series. Conversely, medium-term predictions were better in the case of SPM concentrations. The predictions are significantly promising (e.g., dSPM 24-ahead = 0.906, dCO 1-ahead = 0.891, dSO2 1-ahead = 0.851).

[1]  W. Wilson,et al.  Fine particles and coarse particles: concentration relationships relevant to epidemiologic studies. , 1997, Journal of the Air & Waste Management Association.

[2]  LiMin Fu,et al.  Neural networks in computer intelligence , 1994 .

[3]  Martin T. Hagan,et al.  Neural network design , 1995 .

[4]  Elisa Guerrero Vázquez,et al.  Multiple comparison procedures applied to model selection , 2002, Neurocomputing.

[5]  Gavin C. Cawley,et al.  Modelling SO2 concentration at a point with statistical approaches , 2004, Environ. Model. Softw..

[6]  A. J. Feelders,et al.  On the Statistical Comparison of Inductive Learning Methods , 1995, AISTATS.

[7]  Jorge Reyes,et al.  Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile , 2000 .

[8]  Harri Niska,et al.  Methods for imputation of missing values in air quality data sets , 2004 .

[9]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[10]  Asha B. Chelani,et al.  Prediction of sulphur dioxide concentration using artificial neural networks , 2002, Environ. Model. Softw..

[11]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[12]  A. Meystel,et al.  Intelligent Systems , 2001 .

[13]  Linear regression analyses of ozone and sulphur dioxide in ambient air , 1986 .

[14]  Giuseppe Nunnari,et al.  The application of neural techniques to the modelling of time-series of atmospheric pollution data , 1998 .

[15]  J. Jobson Applied Multivariate Data Analysis , 1995 .

[16]  J. Schwartz,et al.  Is Daily Mortality Associated Specifically with Fine Particles? , 1996, Journal of the Air & Waste Management Association.

[17]  A. Hofzumahaus Measurement of Photolysis Frequencies in the Atmosphere , 2007 .

[18]  G. Kallos,et al.  Saharan dust contributions to PM10 and TSP levels in Southern and Eastern Spain , 2001 .

[19]  Timothy Masters,et al.  Advanced algorithms for neural networks: a C++ sourcebook , 1995 .

[20]  Armando Pelliccioni,et al.  Use of Neural Net Models to Forecast Atmospheric Pollution , 2000 .

[21]  M. Gardner,et al.  Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London , 1999 .

[22]  Jose Torres-Jimenez,et al.  Short-term ozone forecasting by artificial neural networks , 1995 .

[23]  Mukesh Khare,et al.  ARTIFICIAL NEURAL NETWORK BASED LINE SOURCE MODELS FOR VEHICULAR EXHAUST EMISSION PREDICTIONS OF AN URBAN ROADWAY , 2004 .

[24]  Andrew C. Comrie,et al.  Climatology and forecast modeling of ambient carbon monoxide in Phoenix, Arizona , 1999 .

[25]  G. Cobb Introduction to Design and Analysis of Experiments , 1997 .

[26]  Giorgio Corani,et al.  Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning , 2005 .

[27]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[28]  I. J. Myung,et al.  GUEST EDITORS' INTRODUCTION: Special Issue on Model Selection , 2000 .

[29]  Zucchini,et al.  An Introduction to Model Selection. , 2000, Journal of mathematical psychology.

[30]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[31]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[32]  Geok See Ng,et al.  Democracy in pattern classifications: combinations of votes from various pattern classifiers , 1998, Artif. Intell. Eng..

[33]  Christos Zerefos,et al.  Forecasting peak pollutant levels from meteorological variables , 1995 .

[34]  Georgios Grivas,et al.  Artificial neural network models for prediction of PM10 hourly concentrations, in the Greater Area of Athens, Greece , 2006 .

[35]  A. Tamhane,et al.  Multiple Comparison Procedures , 2009 .

[36]  I. Jolliffe Principal Component Analysis , 2002 .

[37]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[38]  Kazuhiko Sakamoto,et al.  Influence of ozone, relative humidity, and flow rate on the deposition and oxidation of sulfur dioxide on yellow sand , 2004 .

[39]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[40]  J C Bailar,et al.  The association between daily mortality and ambient air particle pollution in Montreal, Quebec. 1. Nonaccidental mortality. , 2001, Environmental research.

[41]  Mahmut Bayramoglu,et al.  Adaptive neuro-fuzzy based modelling for prediction of air pollution daily levels in city of Zonguldak. , 2006, Chemosphere.

[42]  Gavin C. Cawley,et al.  Extensive evaluation of neural network models for the prediction of NO2 and PM10 concentrations, compared with a deterministic modelling system and measurements in central Helsinki , 2003 .

[43]  Dwayne E. Heard,et al.  Analytical techniques for atmospheric measurement , 2006 .

[44]  A. Tamhane,et al.  Multiple Comparison Procedures , 1989 .

[45]  P. Viotti,et al.  Atmospheric urban pollution: applications of an artificial neural network (ANN) to the city of Perugia , 2002 .

[46]  Vladimir Vapnik,et al.  Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .