Daily PM2.5 concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm.

Increased attention has been paid to PM2.5 pollution in China. Due to its detrimental effects on environment and health, it is important to establish a PM2.5 concentration forecasting model with high precision for its monitoring and controlling. This paper presents a novel hybrid model based on principal component analysis (PCA) and least squares support vector machine (LSSVM) optimized by cuckoo search (CS). First PCA is adopted to extract original features and reduce dimension for input selection. Then LSSVM is applied to predict the daily PM2.5 concentration. The parameters in LSSVM are fine-tuned by CS to improve its generalization. An experiment study reveals that the proposed approach outperforms a single LSSVM model with default parameters and a general regression neural network (GRNN) model in PM2.5 concentration prediction. Therefore the established model presents the potential to be applied to air quality forecasting systems.

[1]  J. Chow,et al.  A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile , 2008 .

[2]  Xin-She Yang,et al.  Cuckoo Search via Lévy flights , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[3]  G H Huang,et al.  A coupled factorial-analysis-based interval programming approach and its application to air quality management , 2013, Journal of the Air & Waste Management Association.

[4]  Yang Zhang,et al.  Real-time air quality forecasting, part I: History, techniques, and current status , 2012 .

[5]  Weidong Zhang,et al.  Prediction of 24-hour-average PM(2.5) concentrations using a hidden Markov model with different emission distributions in Northern California. , 2013, The Science of the total environment.

[6]  Ari Karppinen,et al.  Evaluation of a multiple regression model for the forecasting of the concentrations of NOx and PM10 in Athens and Helsinki. , 2011, The Science of the total environment.

[7]  Fong-Ching Yuan,et al.  Using least square support vector regression with genetic algorithm to forecast beta systematic risk , 2015, J. Comput. Sci..

[8]  Shikha Gupta,et al.  Linear and nonlinear modeling approaches for urban air quality prediction. , 2012, The Science of the total environment.

[9]  Qi Wu,et al.  Hybrid model based on wavelet support vector machine and modified genetic algorithm penalizing Gaussian noises for power load forecasts , 2011, Expert Syst. Appl..

[10]  Shiming Zheng,et al.  The impacts of provincial energy and environmental policies on air pollution control in China , 2015 .

[11]  Dongsheng Chen,et al.  A neural network based ensemble approach for improving the accuracy of meteorological fields used for regional air quality modeling. , 2012, Journal of environmental management.

[12]  W. You,et al.  Estimating PM2.5 in Xi'an, China using aerosol optical depth: a comparison between the MODIS and MISR retrieval models. , 2015, The Science of the total environment.

[13]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[14]  Qi Li,et al.  Artificial neural networks forecasting of PM2.5 pollution using air mass trajectory based geographic model and wavelet transformation , 2015 .

[15]  Prem B. Parajuli,et al.  A STELLA Model to Estimate Soil CO2 Emissions from a Short-Rotation Woody Crop , 2012, Water, Air, & Soil Pollution.

[16]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[17]  Shahbaz Gul Hassan,et al.  Prediction of the temperature in a Chinese solar greenhouse based on LSSVM optimized by improved PSO , 2016, Comput. Electron. Agric..

[18]  Jiming Hao,et al.  Air pollution and control action in Beijing , 2016 .

[19]  Guohe Huang,et al.  Interactive Fuzzy Boundary Interval Programming for Air Quality Management Under Uncertainty , 2013, Water, Air, & Soil Pollution.

[20]  Guohe Huang,et al.  A sequential factorial analysis approach to characterize the effects of uncertainties for supporting air quality management , 2013 .

[21]  Yuehua Huang,et al.  Short-term wind power prediction based on LSSVM–GSA model , 2015 .

[22]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[23]  Xiaolei Li,et al.  Traffic Flow Forecasting by a Least Squares Support Vector Machine with a Fruit Fly Optimization Algorithm , 2016 .

[24]  N. Pérez,et al.  Neural network model for the prediction of PM10 daily concentrations in two sites in the Western Mediterranean. , 2013, The Science of the total environment.

[25]  Bijan Yeganeh,et al.  Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model , 2012 .

[26]  Yong Liu,et al.  A novel hybrid forecasting model for PM₁₀ and SO₂ daily concentrations. , 2015, The Science of the total environment.

[27]  Le Jian,et al.  An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. , 2012, The Science of the total environment.

[28]  Yan Wang,et al.  Air quality assessment by contingent valuation in Ji'nan, China. , 2009, Journal of environmental management.

[29]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[30]  Jianzhou Wang,et al.  A hybrid model for PM₂.₅ forecasting based on ensemble empirical mode decomposition and a general regression neural network. , 2014, The Science of the total environment.

[31]  Mohsen Safari,et al.  A novel PSO-LSSVM model for predicting liquid rate of two phase flow through wellhead chokes , 2015 .

[32]  Siti Sakira Kamaruddin,et al.  Gasoline Price Forecasting: An Application of LSSVM with Improved ABC , 2014 .

[33]  Yang Zhang,et al.  Real-time air quality forecasting, part II: State of the science, current research needs, and future prospects , 2012 .

[34]  Huiru Zhao,et al.  Annual Electric Load Forecasting by a Least Squares Support Vector Machine with a Fruit Fly Optimization Algorithm , 2012 .

[35]  Bruce Misstear,et al.  Real time air quality forecasting using integrated parametric and non-parametric regression techniques , 2015 .