Prediction of outdoor PM2.5 concentrations based on a three-stage hybrid neural network model

Abstract PM2.5 concentrations forecasting has become an effective method to deal with the severe air pollution ahead. This study proposes a three-stage hybrid neural network model to forecast outdoor PM2.5 concentrations. It subsumes outlier correction, decomposition, neural network, and metaheuristic optimization to guarantee high accuracy. The adopted methods serve different purposes and ultimately aim at improving the performance of model. Firstly, a robust outlier correction preprocessing method is employed to filter the abnormal information in the original PM2.5 concentrations series and reduce its complexity. Then, the processed series is adaptively decomposed into several subseries by empirical wavelet transform. The subseries are more stationary and their trends are more obvious than original series. To forecast each subseries and generate multi-step forecasting outputs, weighted regularized extreme learning machine whose parameters are optimally determined by neural network algorithm is used. Finally, inverse empirical wavelet transform is implemented to reconstruct the subseries and correct unexpected forecasting values beyond the frequency bands. The proposed model is tested by the actual data collected from four cities in China. Results demonstrate that: (1) the proposed model has accurate forecasting performance, whose root mean square errors of 5-step daily forecasting results in Beijing, Tianjin, Shijiazhuang, and Tangshan are 4.7555μg/m3, 3.3024μg/m3, 6.4326μg/m3, and 4.3191μg/m3, respectively; (2) all components of the hybrid model are indispensable to promote accuracy; (3) the proposed model significantly outperforms seven benchmark models and five existing models; (4) the forecasting results can be utilized to analyze the trend of pollution and perform better management measures.

[1]  Yuexia Sun,et al.  Indoor air quality and occupants' ventilation habits in China: Seasonal measurement and long-term monitoring , 2018, Building and Environment.

[2]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[3]  Norden E. Huang,et al.  Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method , 2009, Adv. Data Sci. Adapt. Anal..

[4]  Witold Pedrycz,et al.  Multivariate time series anomaly detection: A framework of Hidden Markov Models , 2017, Appl. Soft Comput..

[5]  Dominique Zosso,et al.  Variational Mode Decomposition , 2014, IEEE Transactions on Signal Processing.

[6]  Olivier Grunder,et al.  A novel hybrid model for air quality index forecasting based on two-phase decomposition technique and modified extreme learning machine. , 2017, The Science of the total environment.

[7]  Yong Cheng,et al.  Hybrid algorithm for short-term forecasting of PM2.5 in China , 2019, Atmospheric Environment.

[8]  Rodrigo Herrera,et al.  A dynamic multiple equation approach for forecasting PM2.5 pollution in Santiago, Chile , 2018, International Journal of Forecasting.

[9]  Shaolong Sun,et al.  A secondary-decomposition-ensemble learning paradigm for forecasting PM2.5 concentration , 2018, Atmospheric Pollution Research.

[10]  Qiao Junfei,et al.  Predicting PM2.5 Concentrations at a Regional Background Station Using Second Order Self-Organizing Fuzzy Neural Network , 2017 .

[11]  Lei Zhao,et al.  Long-term monitoring of indoor CO2 and PM2.5 in Chinese homes: Concentrations and their relationships with outdoor environments , 2018, Building and Environment.

[12]  Majid Salari,et al.  Statistical models for multi-step-ahead forecasting of fine particulate matter in urban areas , 2019, Atmospheric Pollution Research.

[13]  Paul J. Gemperline,et al.  Nonlinear multivariate calibration using principal components regression and artificial neural networks , 1991 .

[14]  B. Mai,et al.  Vertical distribution of PAHs in the indoor and outdoor PM2.5 in Guangzhou, China , 2005 .

[15]  Onur Seref,et al.  Relaxed support vector regression , 2018, Annals of Operations Research.

[16]  Norden E. Huang,et al.  Complementary Ensemble Empirical Mode Decomposition: a Novel Noise Enhanced Data Analysis Method , 2010, Adv. Data Sci. Adapt. Anal..

[17]  P. Gupta,et al.  Application of Moderate Resolution Imaging Spectroradiometer (MODIS) Aerosol Optical Depth (AOD) and Weather Research Forecasting (WRF) model meteorological data for assessment of fine particulate matter (PM2.5) over India , 2019, Atmospheric Pollution Research.

[18]  Haiping Wu,et al.  An intelligent hybrid model for air pollutant concentrations forecasting: Case of Beijing in China , 2019, Sustainable Cities and Society.

[19]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[20]  Jean-Christophe Golaz,et al.  The roles of aerosol direct and indirect effects in past and future climate change , 2013 .

[21]  Jane Lin,et al.  Trend analysis and forecast of PM2.5 in Fuzhou, China using the ARIMA model , 2018, Ecological Indicators.

[22]  Li-Chiu Chang,et al.  Explore a deep learning multi-output neural network for regional multi-step-ahead air quality forecasts , 2019, Journal of Cleaner Production.

[23]  Jérôme Gilles,et al.  Empirical Wavelet Transform , 2013, IEEE Transactions on Signal Processing.

[24]  Shaolong Sun,et al.  Application of decomposition-ensemble learning paradigm with phase space reconstruction for day-ahead PM2.5 concentration forecasting. , 2017, Journal of environmental management.

[25]  Patricio Perez,et al.  Forecasting hourly PM2.5 in Santiago de Chile with emphasis on night episodes , 2016 .

[26]  Hui Liu,et al.  Air PM2.5 concentration multi-step forecasting using a new hybrid modeling method: Comparing cases for four cities in China , 2019, Atmospheric Pollution Research.

[27]  Hong Zhang,et al.  A novel hybrid-Garch model based on ARIMA and SVM for PM2.5 concentrations forecasting , 2017 .

[28]  Marcella Busilacchio,et al.  Recursive neural network model for analysis and forecast of PM10 and PM2.5 , 2017 .

[29]  Martha Cobo,et al.  Discovering relationships and forecasting PM10 and PM2.5 concentrations in Bogotá, Colombia, using Artificial Neural Networks, Principal Component Analysis, and k-means clustering , 2018, Atmospheric Pollution Research.

[30]  Chao Chen,et al.  A hybrid framework for forecasting PM2.5 concentrations using multi-step deterministic and probabilistic strategy , 2019, Air Quality, Atmosphere & Health.

[31]  Anupam Yadav,et al.  A dynamic metaheuristic optimization model inspired by biological nervous systems: Neural network algorithm , 2018, Appl. Soft Comput..

[32]  Filippo Sorbello,et al.  Three hours ahead prevision of SO2 pollutant concentration using an Elman neural based forecaster , 2008 .

[33]  Brian Broderick,et al.  A GIS model for personal exposure to PM10 for Dublin commuters , 2015 .

[34]  K. Matthews,et al.  Chronic PM2.5 exposure and inflammation: determining sensitive subgroups in mid-life women. , 2014, Environmental research.

[35]  Wei Sun,et al.  Daily PM2.5 concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm. , 2017, Journal of environmental management.

[36]  Ping-Huan Kuo,et al.  A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities , 2018, Sensors.

[37]  Peizhi Li,et al.  The analysis and application of a new hybrid pollutants forecasting model using modified Kolmogorov-Zurbenko filter. , 2017, The Science of the total environment.

[38]  Ronald K. Pearson,et al.  Outliers in process modeling and identification , 2002, IEEE Trans. Control. Syst. Technol..

[39]  Subutai Ahmad,et al.  Unsupervised real-time anomaly detection for streaming data , 2017, Neurocomputing.

[40]  Ling Yang,et al.  PM2.5 forecasting using SVR with PSOGSA algorithm based on CEEMD, GRNN and GCA considering meteorological factors , 2018, Atmospheric Environment.

[41]  Chao Chen,et al.  Improved pollution forecasting hybrid algorithms based on the ensemble method , 2019, Applied Mathematical Modelling.

[42]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[43]  Liangzhu Wang,et al.  Forecasting simulations of indoor environment using data assimilation via an Ensemble Kalman Filter , 2013 .

[44]  Haixiang Guo,et al.  Research and application of a novel hybrid decomposition-ensemble learning paradigm with error correction for daily PM 10 forecasting , 2018 .

[45]  Michael J. Devaney,et al.  Bearing damage detection via wavelet packet decomposition of the stator current , 2004, IEEE Transactions on Instrumentation and Measurement.

[46]  Yanjun Shi,et al.  Two-step-hybrid model based on data preprocessing and intelligent optimization algorithms (CS and GWO) for NO2 and SO2 forecasting , 2019, Atmospheric Pollution Research.

[47]  Guilherme Carrilho da Graça,et al.  Impact of PM2.5 in indoor urban environments: A review , 2018, Sustainable Cities and Society.

[48]  Wei Gao,et al.  Numerical air quality forecasting over eastern China: An operational application of WRF-Chem , 2017 .