An improved low-power measurement of ambient NO2 and O3 combining electrochemical sensor clusters and machine learning

Abstract. Low-cost sensors (LCSs) are an appealing solution to the problem of spatial resolution in air quality measurement, but they currently do not have the same analytical performance as regulatory reference methods. Individual sensors can be susceptible to analytical cross-interferences; have random signal variability; and experience drift over short, medium and long timescales. To overcome some of the performance limitations of individual sensors we use a clustering approach using the instantaneous median signal from six identical electrochemical sensors to minimize the randomized drifts and inter-sensor differences. We report here on a low-power analytical device (< 200 W) that is comprised of clusters of sensors for NO2, Ox, CO and total volatile organic compounds (VOCs) and that measures supporting parameters such as water vapour and temperature. This was tested in the field against reference monitors, collecting ambient air pollution data in Beijing, China. Comparisons were made of NO2 and Ox clustered sensor data against reference methods for calibrations derived from factory settings, in-field simple linear regression (SLR) and then against three machine learning (ML) algorithms. The parametric supervised ML algorithms, boosted regression trees (BRTs) and boosted linear regression (BLR), and the non-parametric technique, Gaussian process (GP), used all available sensor data to improve the measurement estimate of NO2 and Ox. In all cases ML produced an observational value that was closer to reference measurements than SLR alone. In combination, sensor clustering and ML generated sensor data of a quality that was close to that of regulatory measurements (using the RMSE metric) yet retained a very substantial cost and power advantage.

[1]  Toyosaka Moriizumi,et al.  Gas identification using micro gas sensor array and neural-network pattern recognition , 1996 .

[2]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[3]  Gian Carlo Cardinali,et al.  An electronic nose based on solid state sensor arrays for low-cost indoor air quality monitoring applications , 2004 .

[4]  C. Chan,et al.  Air pollution in mega cities in China , 2008 .

[5]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[6]  S Roberts,et al.  Gaussian processes for time-series modelling , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[7]  J. Salmond,et al.  Validation of low-cost ozone measurement instruments suitable for use in an air-quality monitoring network , 2013 .

[8]  Gb Stewart,et al.  The use of electrochemical sensors for monitoring urban air quality in low-cost, high-density networks , 2013 .

[9]  U. Lerner,et al.  On the feasibility of measuring urban air pollution by wireless distributed sensor networks. , 2015, The Science of the total environment.

[10]  Michael Hannigan,et al.  Quantification Method for Electrolytic Sensors in Long-Term Monitoring of Ambient Air Quality , 2015, Sensors.

[11]  A. Lewis,et al.  Evaluating the performance of low cost chemical sensors for air pollution research. , 2016, Faraday discussions.

[12]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[13]  Jiming Hao,et al.  Air pollution and control action in Beijing , 2016 .

[14]  S. De Vito,et al.  Dynamic neural network architectures for on field stochastic calibration of indicative low cost air quality sensing systems , 2016 .

[15]  Alexandre Caron,et al.  Performances and limitations of electronic gas sensors to investigate an indoor air quality event , 2016 .

[16]  G. Hagler,et al.  Community Air Sensor Network (CAIRSENSE) project: evaluation of low-cost sensor performance in a suburban environment in the southeastern United States. , 2016, Atmospheric measurement techniques.

[17]  David M. Broday,et al.  Wireless Distributed Environmental Sensor Networks for Air Pollution Measurement—The Promise and the Current Reality , 2017, Sensors.

[18]  A. Lewis,et al.  Clustering approaches to improve the performance of low cost air pollution sensors. , 2017, Faraday discussions.

[19]  Yong Qi,et al.  An accident prediction approach based on XGBoost , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[20]  Jonathan P. Franklin,et al.  Calibration and assessment of electrochemical air quality sensors by co-location with regulatory-grade instruments , 2017 .

[21]  Grant R. McKercher,et al.  Characteristics and applications of small, portable gaseous air pollution monitors. , 2017, Environmental pollution.

[22]  L. Spinelle,et al.  Field calibration of a cluster of low-cost commercially available sensors for air quality monitoring. Part B: NO, CO and CO2 , 2017 .

[23]  Allen L. Robinson,et al.  A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring , 2018 .

[24]  Andrea Polidori,et al.  Air Quality Sensors and Data Adjustment Algorithms: When Is It No Longer a Measurement? , 2018, Environmental science & technology.

[25]  Wei Dong,et al.  Calibrating Low-Cost Sensors by a Two-Phase Learning Approach for Urban Air Quality Measurement , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..