Street-level Air Quality Inference Based on Geographically Context-aware Random Forest Using Opportunistic Mobile Sensor Network

The spatial heterogeneity and temporal variability of air pollution in urban environments make air quality inference for fine-grained air pollution monitoring extremely challenging. Most of the existing work estimates the air quality using sparse measurements collected from a limited number of fixed monitoring stations or make use of computationally demanding physicochemical models simulating the source and fate of pollutants across multiple spatial scales. In this work, we propose a geographically context-aware random forest model for street-level air quality inference using high spatial resolution data collected by an opportunistic mobile sensor network. Compared with a traditional random forest model, the proposed method builds a local model for each location by considering the neighbors in both geographical and feature space. The model is evaluated on our real air quality dataset collected from mobile sensors in Antwerp, Belgium. The experimental results show that the proposed method outperforms a series of commonly used methods including Ordinary Kriging (OK), Inverse Distance Weighting (IDW) and Random forest (RF).

[1]  Silvia Lenaerts,et al.  Cyclist exposure to black carbon, ultrafine particles and heavy metals: An experimental study along two commuting routes near Antwerp, Belgium , 2018, Environmental research.

[2]  Johan Lindström,et al.  Comparing universal kriging and land-use regression for predicting concentrations of gaseous oxides of nitrogen (NOx) for the Multi-Ethnic Study of Atherosclerosis and Air Pollution (MESA Air). , 2011, Atmospheric environment.

[3]  Edith C.-H. Ngai,et al.  Real-time air pollution monitoring with sensors on city bus , 2020, Digit. Commun. Networks.

[4]  Bijan Yeganeh,et al.  Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model , 2012 .

[5]  Joris Van den Bossche,et al.  Mobile monitoring for mapping spatial variation in urban air quality: Development and validation of a methodology based on an extensive dataset , 2015 .

[6]  L. Knibbs,et al.  A machine learning method to estimate PM2.5 concentrations across China with remote sensing, meteorological and land use information. , 2018, The Science of the total environment.

[7]  N. A. Mazzeo,et al.  A simple model for calculating air pollution within street canyons , 2014 .

[8]  M. Shima,et al.  Spatiotemporal land use random forest model for estimating metropolitan NO2 exposure in Japan. , 2018, The Science of the total environment.

[9]  Bert Brunekreef,et al.  Spatial variation of ultrafine particles and black carbon in two cities: results from a short-term measurement campaign. , 2015, The Science of the total environment.

[10]  William D. Philpot,et al.  Small-Scale Climate Maps: A Sensitivity Analysis of Some Common Assumptions Associated with Grid-Point Interpolation and Contouring , 1985 .

[11]  P. Monks,et al.  The spatio-temporal evolution of black carbon in the North-West European ‘air pollution hotspot’ , 2020 .

[12]  D A Savitz,et al.  Pulmonary function and ambient particulate matter: epidemiological evidence from NHANES I. , 1991, Archives of environmental health.

[13]  K. Pericleous,et al.  Modelling air quality in street canyons : a review , 2003 .

[14]  J. Schwartz,et al.  Lung function and chronic exposure to air pollution: a cross-sectional analysis of NHANES II. , 1989, Environmental research.

[15]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[16]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  L. Knibbs,et al.  Spatiotemporal patterns of PM10 concentrations over China during 2005-2016: A satellite-based estimation using the random forests approach. , 2018, Environmental pollution.

[19]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[20]  Jennifer K Vanos,et al.  Low-cost mobile air pollution monitoring in urban environments: a pilot study in Lubbock, Texas , 2018, Environmental technology.

[21]  Rui Ji,et al.  Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: a systematic review and meta-analysis , 2020, International Journal of Infectious Diseases.

[22]  Piet Demeester,et al.  City of things: An integrated and multi-technology testbed for IoT smart city experiments , 2016, 2016 IEEE International Smart Cities Conference (ISC2).

[23]  B. Ripley Classification and Regression Trees , 2015 .

[24]  Paul S. Monks,et al.  Ultrafine particles in four European urban environments: Results from a new continuous long-term monitoring network , 2016 .

[25]  N. Kh. Arystanbekova,et al.  Application of Gaussian plume models for air pollution simulation at instantaneous emissions , 2004, Math. Comput. Simul..

[26]  R. Webster,et al.  Kriging: a method of interpolation for geographical information systems , 1990, Int. J. Geogr. Inf. Sci..