Efficient PM2.5 forecasting using geographical correlation based on integrated deep learning algorithms

This paper proposes a deep learning model that integrates a convolutional neural network and a gated recurrent unit with groups of neighboring stations to accurately predict PM2.5 concentrations at 25 stations in Seoul, South Korea. The deep learning model uses observations obtained from one Korea Meteorological Administration (KMA) station, 25 National Institute of Environmental Research (NIER) stations, and 28 automatic weather stations (AWSs) throughout Seoul. To train the deep learning model, we use all available meteorological and air quality data observed between 2015 and 2017. With the trained model, we predict PM2.5 concentrations at all 25 NIER stations in Seoul for 2018. This study also proposes a geographical polygon group model that determines the optimal number of neighboring NIER stations required to increase the accuracy of PM2.5 concentration predictions at the target station. Comparing the model measures for each of the 25 monitoring sites in 2018, we find that the geographical polygon group model achieves an index of agreement of 0.82–0.89 and a Pearson correlation coefficient of 0.70–0.83. Compared to the method using only meteorological and air quality data from one target station (average IOA = 0.77) to predict PM2.5 concentrations at the 25 stations in Seoul, the proposed method using geographical correlation-based neighboring NIER stations as polygonal groups (average IOA = 0.85) improves the PM2.5 prediction accuracy by an average of about 10%. This approach, based on deep learning, can be updated to predict air pollution or air quality indices up to several days in advance.

[1]  Yanting Chen,et al.  Spatial distribution and sources identification of elements in PM2.5 among the coastal city group in the Western Taiwan Strait region, China. , 2013, The Science of the total environment.

[2]  Jorge Reyes,et al.  Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile , 2000 .

[3]  D. Byun,et al.  Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System , 2006 .

[4]  Yoshua Bengio,et al.  Light Gated Recurrent Units for Speech Recognition , 2018, IEEE Transactions on Emerging Topics in Computational Intelligence.

[5]  Yunsoo Choi,et al.  A data ensemble approach for real-time air quality forecasting using extremely randomized trees and deep neural networks , 2019, Neural Computing and Applications.

[6]  Ebrahim Eslami,et al.  Using a deep convolutional neural network to predict 2017 ozone concentrations, 24 hours in advance , 2020, Neural Networks.

[7]  Shaoning Pang,et al.  Spatio-temporal PM2.5 prediction by spatial data aided incremental support vector regression , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[8]  John N. McHenry,et al.  Development and implementation of a remote-sensing and in situ data-assimilating version of CMAQ for operational PM2.5 forecasting. Part 1: MODIS aerosol optical depth (AOD) data-assimilation design and testing , 2015, Journal of the Air & Waste Management Association.

[9]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[10]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[11]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[12]  S. Christopher,et al.  Remote Sensing of Particulate Pollution from Space: Have We Reached the Promised Land? , 2009, Journal of the Air & Waste Management Association.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Jiansheng Wu,et al.  Applying land use regression model to estimate spatial variation of PM2.5 in Beijing, China , 2015, Environmental Science and Pollution Research.

[15]  Weiqi Zhou,et al.  Impact of urbanization level on urban air quality: a case of fine particles (PM(2.5)) in Chinese cities. , 2014, Environmental pollution.

[16]  Ebrahim Eslami,et al.  Real-time 7-day forecast of pollen counts using a deep convolutional neural network , 2019, Neural Computing and Applications.

[17]  A. Cohen,et al.  Exposure assessment for estimation of the global burden of disease attributable to outdoor air pollution. , 2012, Environmental science & technology.

[18]  C. Sioutas,et al.  Particulate Air Pollution, Ambulatory Heart Rate Variability, and Cardiac Arrhythmia in Retirement Community Residents with Coronary Artery Disease , 2013, Environmental health perspectives.

[19]  Robert E. Davis,et al.  Statistics for the evaluation and comparison of models , 1985 .

[20]  C. Song,et al.  Concentration Trajectory Route of Air pollution with an Integrated Lagrangian model (C-TRAIL Model v1.0) derived from the Community Multiscale Air Quality Model (CMAQ Model v5.2) , 2020 .

[21]  Qi Ying,et al.  Spatial and temporal variations of six criteria air pollutants in 31 provincial capital cities in China during 2013-2014. , 2014, Environment international.

[22]  Xiaoyan Ma,et al.  Can MODIS AOD be employed to derive PM2.5 in Beijing-Tianjin-Hebei over China? , 2016 .

[23]  Mahmut Bayramoglu,et al.  Adaptive neuro-fuzzy based modelling for prediction of air pollution daily levels in city of Zonguldak. , 2006, Chemosphere.

[24]  R. Martin,et al.  Toward the next generation of air quality monitoring: Particulate Matter , 2013 .

[25]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[26]  Jun Ma,et al.  Deep learning-based PM2.5 prediction considering the spatiotemporal correlations: A case study of Beijing, China. , 2020, The Science of the total environment.

[27]  Xiang Li,et al.  Deep learning architecture for air quality predictions , 2016, Environmental Science and Pollution Research.

[28]  H. Kim,et al.  Development of a daily PM10 and PM2.5 prediction system using a deep long short-term memory neural network model , 2019, Atmospheric Chemistry and Physics.

[29]  Jorge Reyes,et al.  An integrated neural network model for PM10 forecasting , 2006 .

[30]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[31]  Xiang Li,et al.  Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. , 2017, Environmental pollution.

[32]  Jhoon Kim,et al.  The Impact of the Direct Effect of Aerosols on Meteorology and Air Quality Using Aerosol Optical Depth Assimilation During the KORUS‐AQ Campaign , 2019, Journal of geophysical research. Atmospheres : JGR.

[33]  Yang Zhang,et al.  Real-time air quality forecasting, part II: State of the science, current research needs, and future prospects , 2012 .

[34]  J. Herman,et al.  First Top‐Down Estimates of Anthropogenic NOx Emissions Using High‐Resolution Airborne Remote Sensing Observations , 2018 .

[35]  Yang Liu,et al.  Limitations of Remotely Sensed Aerosol as a Spatial Proxy for Fine Particulate Matter , 2009, Environmental health perspectives.

[36]  C. L. Philip Chen,et al.  Predictive Deep Boltzmann Machine for Multiperiod Wind Speed Forecasting , 2015, IEEE Transactions on Sustainable Energy.

[37]  Liangpei Zhang,et al.  Estimating Ground‐Level PM2.5 by Fusing Satellite and Station Observations: A Geo‐Intelligent Deep Learning Approach , 2017, 1707.03558.

[38]  M. Brauer,et al.  Risk of Nonaccidental and Cardiovascular Mortality in Relation to Long-term Exposure to Low Concentrations of Fine Particulate Matter: A Canadian National-Level Cohort Study , 2012, Environmental health perspectives.

[39]  Yunsoo Choi,et al.  A real-time hourly ozone prediction system using deep convolutional neural network , 2019, Neural Computing and Applications.

[40]  Ke Lu,et al.  Missing data imputation by K nearest neighbours based on grey relational structure and mutual information , 2015, Applied Intelligence.

[41]  Marcella Busilacchio,et al.  Recursive neural network model for analysis and forecast of PM10 and PM2.5 , 2017 .

[42]  J. Mandel,et al.  Quantifying the Impact of Biomass Burning Emissions on Major Inorganic Aerosols and Their Precursors in the U.S. , 2017 .

[43]  Jianjun He,et al.  Annual and diurnal variations of gaseous and particulate pollutants in 31 provincial capital cities based on in situ air quality monitoring data from China National Environmental Monitoring Center. , 2016, Environment international.

[44]  Huan Liu,et al.  Feasibility and difficulties of China's new air quality standard compliance: PRD case of PM 2.5 and ozone from 2010 to 2025 , 2013 .

[45]  N Moussiopoulos,et al.  Statistical analysis of environmental data as the basis of forecasting: an air quality application. , 2002, The Science of the total environment.