Extending Urban Air Quality Maps Beyond the Coverage of a Mobile Sensor Network: Data Sources, Methods, and Performance Evaluation

Targeting the problem of generating high-resolution air quality maps for cities, we leverage four different sources of data: (i) in-situ air quality measurements produced by our mobile sensor network deployed on public transportation vehicles, (ii) explanatory air-quality and meteorological variables obtained from two static monitoring stations, (iii) land-use data of the city, and (iv) traffic statistics. We propose two novel approaches for estimating the targeted pollutant level at desired time-location pairs, extending also to areas of the city that are beyond the coverage of our mobile sensor network. The first is a log-linear regression model which is built over a virtual dependency graph based on land-use data. The second is a deep learning framework that automatically captures the dependencies of the data based on autoencoders. We have evaluated the two proposed approaches against three canonical modeling techniques considering metrics of coefficient of determination (R²), root mean square error (RMSE), and the fraction of predictions within a factor of two of observations (FAC2). Using more than 45 million real measurements in the models, the results show consistently superior performance in respect to the canonical techniques.

[1]  Lothar Thiele,et al.  Deriving high-resolution urban air pollution maps using mobile sensor nodes , 2015 .

[2]  Lester L. Yuan,et al.  Comparison of spatial interpolation methods for the estimation of air quality data , 2004, Journal of Exposure Analysis and Environmental Epidemiology.

[3]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[4]  Ari Karppinen,et al.  Meteorological dependence of size-fractionated number concentrations of urban aerosol particles , 2006 .

[5]  Lothar Thiele,et al.  On-the-Fly Calibration of Low-Cost Gas Sensors , 2012, EWSN.

[6]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.

[7]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[8]  Vijay Sivaraman,et al.  Air Pollution Exposure Estimation and Finding Association with Human Activity using Wearable Sensor Network , 2014, MLSDA'14.

[9]  Ali Marjovi,et al.  Model-based rendezvous calibration of mobile sensor networks for monitoring air quality , 2015, 2015 IEEE SENSORS.

[10]  F. Paccaud,et al.  Determinants and burden of chronic kidney disease in the population-based CoLaus study: a cross-sectional analysis. , 2013, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[11]  G. Lowry,et al.  Towards a definition of inorganic nanoparticles from an environmental, health and safety perspective. , 2009, Nature nanotechnology.

[12]  M. G. Estes,et al.  Estimating ground-level PM(2.5) concentrations in the southeastern U.S. using geographically weighted regression. , 2013, Environmental research.

[13]  Michael Brauer,et al.  Application of land use regression to estimate long-term concentrations of traffic-related nitrogen oxides and fine particulate matter. , 2007, Environmental science & technology.

[14]  Ali Marjovi,et al.  High Resolution Air Pollution Maps in Urban Environments Using Mobile Sensor Networks , 2015, 2015 International Conference on Distributed Computing in Sensor Systems.

[15]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[16]  Karl Aberer,et al.  A middleware for fast and flexible sensor network deployment , 2006, VLDB.

[17]  Anthony J. Jakeman,et al.  Modeling distributions of air pollutant concentrations—III. The hybrid deterministic-statistical distribution approach , 1988 .

[18]  Elia Georgiana Dragomir,et al.  Air Quality Index Prediction using K-Nearest Neighbor Technique , 2010 .

[19]  S. Low Choy,et al.  Using the Generalised Additive Model to model the particle number count of ultrafine particles , 2011 .

[20]  F. Paccaud,et al.  Short-Term Increase in Particulate Matter Blunts Nocturnal Blood Pressure Dipping and Daytime Urinary Sodium Excretion , 2012, Hypertension.

[21]  Bo Huang,et al.  Modeling the spatio-temporal heterogeneity in the PM10-PM2.5 relationship , 2015 .

[22]  Bernard De Baets,et al.  Prediction of ultrafine particle number concentrations in urban environments by means of Gaussian process regression based on measurements of oxides of nitrogen , 2014, Environ. Model. Softw..

[23]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[24]  Jukka Corander,et al.  Forecasting size-fractionated particle number concentrations in the urban atmosphere , 2012 .

[25]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[26]  Mikko Kolehmainen,et al.  Evolving the neural network model for forecasting air pollution time series , 2004, Eng. Appl. Artif. Intell..

[27]  T. Xia,et al.  Toxic Potential of Materials at the Nanolevel , 2006, Science.

[28]  Lothar Thiele,et al.  Health-optimal routing in urban areas , 2015, IPSN.

[29]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[30]  V. Prybutok,et al.  A neural network model forecasting for prediction of daily maximum ozone concentration in an industrialized urban area. , 1996, Environmental pollution.

[31]  Ali Marjovi,et al.  Mitigating Slow Dynamics of Low-Cost Chemical Sensors for Mobile Air Quality Monitoring Sensor Networks , 2016, EWSN.

[32]  Wei Dong,et al.  Mosaic: A low-cost mobile sensing system for urban air quality monitoring , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[33]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[34]  J. Schwartz,et al.  Fine particles are more strongly associated than coarse particles with acute respiratory health effects in schoolchildren. , 2000, Epidemiology.

[35]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[36]  Boi Faltings,et al.  A Region-Based Model for Estimating Urban Air Pollution , 2014, AAAI.

[37]  Boi Faltings,et al.  Estimating Urban Ultrafine Particle Distributions with Gaussian Process Models , 2014 .