Hybrid Neural Networks and Boosted Regression Tree Models for Predicting Roadside Particulate Matter

This paper examines the application of artificial neural network (ANN) and boosted regression tree (BRT) methods in air quality modelling. The methods were applied to developing air quality models for predicting roadside particle mass concentration (PM10, PM2.5) and particle number counts (PNC) based on air pollution, traffic and meteorological data from Marylebone Road in London. Elastic net, Lasso and principal components analysis were used as feature selection methods for the ANN models to reduce the number of predictor variables and improve their generalisation. The performance of the ANN with feature selection (ANN hybrid) and the BRT models was evaluated and compared using statistical performance metrics. The performance parameters include root mean square error (RMSE), fraction of prediction within a factor of two of the observation (FAC2), mean bias (MB), mean gross error (MGE), the coefficient of correlation (R) and coefficient of efficiency (CoE) values. The input variables selected by the elastic net produced the best performing ANN models. The ANN hybrid produced models performed only slightly better than the BRT models. The R values of the ANN elastic net and BRT models were 0.96 and 0.95 for PM10, 0.96 and 0.96 for PM2.5 and 0.89 and 0.87 for PNC, respectively. Their corresponding CoE values were 0.72 and 0.70 for PM10, 0.74 and 0.76 for PM2.5 and 0.81 and 0.71 for PNC respectively. About 80–99% of all the model predictions are within a factor of two of the observed particle concentrations. The BRT models offer more advantages regarding model interpretation and permit feature selection. Therefore, the study recommends the use of BRT over ANN where the model interpretation is a priority.

[1]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[2]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[3]  C. Willmott Some Comments on the Evaluation of Model Performance , 1982 .

[4]  Jérôme Chave,et al.  Scale and Scaling in Ecological and Economic Systems , 2003 .

[5]  Chunming Li,et al.  A Statistical PCA Method for Face Recognition , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[6]  Roy M. Harrison,et al.  Estimation of the emission factors of particle number and mass fractions from traffic at a site where mean vehicle speeds vary over short distances , 2006 .

[7]  I. Barmpadimos,et al.  Influence of meteorology on PM 10 trends and variability in Switzerland from 1991 to 2008 , 2010 .

[8]  Max Kuhn,et al.  caret: Classification and Regression Training , 2015 .

[9]  Marija Zlata Boznar,et al.  Artificial Neural Networks - a Useful Tool in Air Pollution and Meteorological Modelling , 2011 .

[10]  Stephen Dorling,et al.  Statistical surface ozone models: an improved methodology to account for non-linear behaviour , 2000 .

[11]  Kit Yan Chan,et al.  Identification of significant factors for air pollution levels using a neural network based knowledge discovery system , 2013, Neurocomputing.

[12]  The Benefits and Costs of the Clean Air Act : 1990 to 2020 , 2010 .

[13]  Steven R H Barrett,et al.  Public health impacts of combustion emissions in the United Kingdom. , 2012, Environmental science & technology.

[14]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[15]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[16]  A. Hodgson,et al.  Traffic-related air pollution near busy roads: the East Bay Children's Respiratory Health Study. , 2004, American journal of respiratory and critical care medicine.

[17]  Gordon J. Esplin,et al.  Approximate explicit solution to the general line source problem , 1995 .

[18]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[19]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[20]  C. Willmott ON THE VALIDATION OF MODELS , 1981 .

[21]  Mukesh Khare,et al.  Artificial neural network approach for modelling nitrogen dioxide dispersion from vehicular exhaust emissions , 2006 .

[22]  Holger R. Maier,et al.  Input determination for neural network models in water resources applications. Part 2. Case study: forecasting salinity in a river , 2005 .

[23]  Karl Ropkins,et al.  openair - An R package for air quality data analysis , 2012, Environ. Model. Softw..

[24]  David C. Carslaw,et al.  Analysis of air pollution data at a mixed source location using boosted regression trees , 2009 .

[25]  Anna Lindgren,et al.  Traffic exposure associated with allergic asthma and allergic rhinitis in adults. A cross-sectional study in southern Sweden , 2009, International journal of health geographics.

[26]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[27]  Julian D. Olden,et al.  Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks , 2002 .

[28]  Roy M. Harrison,et al.  Interpretation of particulate elemental and organic carbon concentrations at rural, urban and kerbside sites , 2005 .

[29]  I. Barmpadimos,et al.  Corrigendum to "Influence of meteorology on PM 10 trends and variability in Switzerland from 1991 to 2008" published in Atmos. Chem. Phys., 11, 1813–1835, 2011 , 2011 .

[30]  Michael Lipsett,et al.  Title: Traffic-related air pollution near busy roads: the East Bay Children's Respiratory Health Study Authors and Affiliations , 2004 .

[31]  D. Legates,et al.  A refined index of model performance: a rejoinder , 2013 .

[32]  Bert Brunekreef,et al.  Air pollution from traffic and the development of respiratory infections and asthmatic and allergic symptoms in children. , 2002, American journal of respiratory and critical care medicine.

[33]  Brian D. Ripley,et al.  Feed-Forward Neural Networks and Multinomial Log-Linear Models , 2015 .

[34]  C. Hewitt,et al.  Air pollution in the United Kingdom , 1997 .

[35]  Daniel J. Jacob,et al.  Correlations between fine particulate matter (PM2.5) and meteorological variables in the United States: implications for the sensitivity of PM2.5 to climate change. , 2010 .

[36]  Ernő Mészáros Atmospheric Chemistry , 1981 .

[37]  Joachim Heinrich,et al.  Traffic at residential address, respiratory health, and atopy in adults: the National German Health Survey 1998. , 2005, Environmental research.

[38]  Kaarle Hämeri,et al.  Spatial-temporal variations of particle number concentrations between a busy street and the urban background , 2013 .

[39]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[40]  Scot I. McNeill,et al.  A Modal Identification Algorithm Combining Blind Source Separation and State Space Realization , 2013 .

[41]  Shahidan M. Abdullah,et al.  An overview of principal component analysis , 2013 .

[42]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[43]  J. Friedman Stochastic gradient boosting , 2002 .

[44]  Kiros Berhane,et al.  Traffic, Susceptibility, and Childhood Asthma , 2006, Environmental health perspectives.

[45]  S. Hanna,et al.  Air quality model performance evaluation , 2004 .

[46]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[47]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[48]  B. Brunekreef,et al.  Effects of long-term exposure to traffic-related air pollution on respiratory and cardiovascular mortality in the Netherlands: the NLCS-AIR study. , 2009, Research report.