Distributed Deep Features Extraction Model for Air Quality Forecasting

Several studies in environmental engineering emphasize the importance of air quality forecasting for sustainable development around the world. In this paper, we studied a new approach for air quality forecasting in Busan metropolitan city. We proposed a convolutional Bi-Directional Long-Short Term Memory (Bi-LSTM) autoencoder model trained using a distributed architecture to predict the concentration of the air quality particles (PM2.5 and PM10). The proposed deep learning model can automatically learn the intrinsic correlation among the pollutants in different location. Also, the meteorological and the pollution gas information at each location are fully utilized, which is beneficial for the performance of the model. We used multiple one-dimension convolutional neural network (CNN) layers to extract the local spatial features and a stacked Bi-LSTM layer to learn the spatiotemporal correlation of air quality particles. In addition, we used a stacked deep autoencoder to encode the essential transformation patterns of the pollution gas and the meteorological data, since they are very important for providing useful information that can significantly improve the prediction of the air quality particles. Finally, in order to reduce the training time and the resource consumption, we used a distributed deep leaning approach called data parallelism, which has never been used to tackle the problem of air quality forecasting. We evaluated our approach with extensive experiments based on the data collected in Busan metropolitan city. The results reveal the superiority of our framework over ten baseline models and display how the distributed deep learning model can significantly improve the training time and even the prediction accuracy.

[1]  Amar Nath Gill,et al.  Predicting air quality using ARIMA, ARFIMA and HW smoothing , 2014, Model. Assist. Stat. Appl..

[2]  Arwa S. Sayegh,et al.  Comparing the Performance of Statistical Models for Predicting PM10 Concentrations , 2014 .

[3]  Jianming Xu,et al.  Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai , 2020 .

[4]  Ping-Huan Kuo,et al.  A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities , 2018, Sensors.

[5]  José M. Cecilia,et al.  Air-Pollution Prediction in Smart Cities through Machine Learning Methods: A Case of Study in Murcia, Spain , 2018, J. Univers. Comput. Sci..

[6]  Jiachen Zhao,et al.  Long short-term memory - Fully connected (LSTM-FC) neural network for PM2.5 concentration prediction. , 2019, Chemosphere.

[7]  Yong Li,et al.  Hourly PM2.5 concentration forecast using stacked autoencoder model with emphasis on seasonality , 2019, Journal of Cleaner Production.

[8]  Guodong Guo,et al.  A survey on deep learning based face recognition , 2019, Comput. Vis. Image Underst..

[9]  Dong Yang,et al.  PM2.5 concentration prediction using hidden semi-Markov model-based times series data mining , 2009, Expert Syst. Appl..

[10]  Wenjian Wang,et al.  Online prediction model based on support vector machine , 2008, Neurocomputing.

[11]  A. Inés,et al.  DeepClas4Bio: Connecting bioimaging tools with deep learning frameworks for image classification , 2019, Comput. Biol. Medicine.

[12]  Congcong Wen,et al.  A novel spatiotemporal convolutional long short-term neural network for air pollution prediction. , 2019, The Science of the total environment.

[13]  Carsten Maple,et al.  Comparative Analysis of Machine Learning Techniques for Predicting Air Quality in Smart Cities , 2019, IEEE Access.

[14]  Qi Li,et al.  A hybrid model for spatiotemporal forecasting of PM2.5 based on graph convolutional neural network and long short-term memory. , 2019, The Science of the total environment.