Data Mining and ANFIS Application to Particulate Matter Air Pollutant Prediction. A Comparative Study

The paper analyzes two artificial intelligence methods for particulate matter air pollutant prediction, namely data mining and adaptive neuro-fuzzy inference system (ANFIS). Both methods provide predictive knowledge under the form of rule base, the first method, data mining, as an explicit rule base, and ANFIS as an internal fuzzy rule base used to perform predictions. In order to determine the optimal number of prediction model inputs, we have perform a correlation analysis between particulate matter and other air pollutants. This operation imposed NO2 and CO concentrations as inputs of the prediction model, together with four values of PM10 concentration (from current hour to three hours ago), the output of the model being the prediction of the next hour PM10 concentration. The two prediction models are investigated through simulation in different structures and configurations using SAS® and MATLAB® respectively. The results are compared in terms of statistical parameters (RMSE, MAPE) and simulation time.

[1]  Krzysztof Siwek,et al.  Evolving the ensemble of predictors model for forecasting the daily average PM10 , 2011 .

[2]  Rajan Chattamvelli Data Mining Methods , 2009 .

[3]  M. Oprea,et al.  Particulate Matter Air Pollutants Forecasting using Inductive Learning Approach , 2016 .

[4]  Kemal Polat,et al.  Usage of output-dependent data scaling in modeling and prediction of air pollution daily concentration values (PM10) in the city of Konya , 2011, Neural Computing and Applications.

[5]  Helena Gómez-Adorno,et al.  Forecast of Air Quality Based on Ozone by Decision Trees and Neural Networks , 2012, MICAI.

[6]  Petr Hájek,et al.  Predicting Common Air Quality Index-The Case of Czech Microregions , 2015 .

[7]  Mihaela Oprea,et al.  Particulate matter prediction using ANFIS modelling techniques , 2015, 2015 19th International Conference on System Theory, Control and Computing (ICSTCC).

[8]  Andreas Kerschbaumer,et al.  A new structure identification scheme for ANFIS and its application for the simulation of virtual air pollution monitoring stations in urban areas , 2015, Eng. Appl. Artif. Intell..

[9]  S. Osowski,et al.  Data mining methods for prediction of air pollution , 2016, Int. J. Appl. Math. Comput. Sci..

[10]  Shadi Ausati,et al.  Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM 2.5 , 2016 .

[11]  Amit Kumar Gorai,et al.  Development of ANFIS models for air quality forecasting and input optimization for reducing the computational cost and time , 2016 .

[12]  S. Samarasinghe,et al.  Complex time series analysis of PM10 and PM2.5 for a coastal site using artificial neural network modelling and k-means clustering , 2014 .

[13]  Jesús Ariel Carrasco-Ochoa,et al.  Assessment and prediction of air quality using fuzzy logic and autoregressive models , 2012 .

[14]  Patricio Perez,et al.  PM2.5 forecasting in a large city: Comparison of three methods , 2008 .

[15]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[16]  Marek Wojtylak,et al.  APPLICATION DATA MINING FOR FORECASTING OF HIGHT-LEVEL AIR POLLUTION IN URBAN-INDUSTRIAL AREA IN SOUTHERN POLAND , 2007 .

[17]  M. Imteaz,et al.  Seasonal rainfall forecasting by adaptive network-based fuzzy inference system (ANFIS) using large scale climate signals , 2016, Climate Dynamics.

[18]  Dobrivoje Popovic,et al.  Computational Intelligence in Time Series Forecasting: Theory and Engineering Applications (Advances in Industrial Control) , 2005 .

[19]  P. Goyal,et al.  Artificial intelligence based approach to forecast PM2.5 during haze episodes: A case study of Delhi, India , 2015 .

[20]  Marija Savic,et al.  Adaptive-network-based fuzzy inference system (ANFIS) modelbased prediction of the surface ozone concentration , 2014 .

[21]  Koji Zettsu,et al.  Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2.5 , 2015, Neural Computing and Applications.

[22]  Mihaela Oprea,et al.  A comparative study of computational intelligence techniques applied to PM2.5 air pollution forecasting , 2016, 2016 6th International Conference on Computers Communications and Control (ICCCC).

[23]  Roozbeh Shad,et al.  Predicting air pollution using fuzzy genetic linear membership kriging in GIS , 2009, Comput. Environ. Urban Syst..

[24]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.

[25]  Diana Domanska,et al.  Application of fuzzy time series models for forecasting pollution concentrations , 2012, Expert Syst. Appl..

[26]  Yang Zhang,et al.  Real-time air quality forecasting, part I: History, techniques, and current status , 2012 .

[27]  Sascha Schubert,et al.  Time Series Data Mining with SAS ® Enterprise Miner ™ , 2011 .

[28]  Jae Kwon Bae,et al.  Combining models from neural networks and inductive learning algorithms , 2011, Expert Syst. Appl..

[29]  Pericles A. Mitkas,et al.  Development and evaluation of data mining models for air quality prediction in Athens, Greece , 2009, ITEE.