Fuzzy inductive reasoning forecasting strategies able to cope with missing data: A smart grid application

Dealing with missing data is of great practical and theoretical interest in forecasting applications. In this study, we deal with the problem of forecasting with missing data in smart grid and BEMS applications, where the information from home area sensors and/or smart meters is sometimes missing, which may hinder or even prevent the forecasting of the next hours and days. In concrete, we focus in a Soft Computing technique called Fuzzy Inductive Reasoning (FIR) and its improved version that can cope with missing information in the forecasting process: flexible FIR. In this article eight different strategies for flexible FIR forecasting are defined and studied taking into account: causal relevance of input variables, consistency of predictions, inertia criterion and K-Nearest Neighbours. Furthermore, we evaluate the implications of prediction accuracy and number of predictions, when the number of Missing Values (MVs) in the training dataset is increased progressively. To this end, a real smart grid forecasting application, i.e. electricity load forecasting, has been chosen in this study. The results show that all eight strategies proposed are able to cope with MVs and take advantage of the inherent information in the data, with better results in those strategies making use of causal relevance. In addition, the robustness of flexible FIR and its eight strategies are proved taking into account that the percentage of electricity load predictions from the test dataset is on average 96.15% when the %MVs in training dataset was around 73%.

[1]  MengChu Zhou,et al.  Generating Highly Accurate Predictions for Missing QoS Data via Aggregating Nonnegative Latent Factor Models , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[3]  Àngela Nebot,et al.  Hybrid methodologies for electricity load forecasting: Entropy-based feature selection with machine learning and soft computing techniques , 2015 .

[4]  Witold Pedrycz,et al.  Training a feed-forward network with incomplete data due to missing input variables , 2003, Appl. Soft Comput..

[5]  Àngela Nebot,et al.  Local Maximum Ozone Concentration Prediction Using Soft Computing Methodologies , 2003 .

[6]  Feng Wan,et al.  Pre-processing for missing data: A hybrid approach to air pollution prediction in Macau , 2010, 2010 IEEE International Conference on Automation and Logistics.

[7]  Enrico Zio,et al.  Reconstruction of missing data in multidimensional time series by fuzzy similarity , 2015, Appl. Soft Comput..

[8]  Àngela Nebot,et al.  A flexible fuzzy inductive reasoning approach for load modelling able to cope with missing data , 2015, SimuTools.

[9]  Àngela Nebot,et al.  Causal relevance to improve the prediction accuracy of dynamical systems using inductive reasoning , 2009, Int. J. Gen. Syst..

[10]  F. Castro,et al.  Causal Relevancy Approaches to Improve the Students' Prediction Performance in an e-Learning Environment , 2007, 2007 Sixth Mexican International Conference on Artificial Intelligence, Special Session (MICAI).

[11]  Robert P. Goldman,et al.  Imputation of Missing Data Using Machine Learning Techniques , 1996, KDD.

[12]  Volker Tresp,et al.  Training Neural Networks with Deficient Data , 1993, NIPS.

[13]  Tshilidzi Marwala,et al.  Modeling of missing data prediction: Computational intelligence and optimization algorithms , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[14]  Àngela Nebot,et al.  Confidence measures for predictions in fuzzy inductive reasoning , 2010, Int. J. Gen. Syst..

[15]  Cesare Alippi,et al.  Model ensemble for an effective on-line reconstruction of missing data in sensor networks , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[16]  Leonardo Franco,et al.  Missing data imputation using statistical and machine learning methods in a real breast cancer problem , 2010, Artif. Intell. Medicine.

[17]  Àngela Nebot,et al.  Modeling and Simulation of the Central Nervous System Control with Generic Fuzzy Models , 2003, Simul..

[18]  Àngela Nebot,et al.  Fuzzy inductive reasoning: a consolidated approach to data-driven construction of complex dynamical systems , 2012, Int. J. Gen. Syst..

[19]  Àngela Nebot,et al.  Short-term electric load forecasting using computational intelligence methods , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[20]  Àngela Nebot,et al.  Visual-FIR: A tool for model identification and prediction of dynamical complex systems , 2008, Simul. Model. Pract. Theory.

[21]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[22]  Spyros Makridakis,et al.  Accuracy measures: theoretical and practical concerns☆ , 1993 .

[23]  Àngela Nebot,et al.  A soft computing decision support framework to improve the e-learning experience , 2008, SpringSim '08.

[24]  George J. Klir,et al.  Architecture of Systems Problem Solving , 1985, Springer US.