Feature Selection and Evolutionary Rule Learning for Big Data in Smart Building Energy Management

Since buildings are one of the largest sources of energy consumption in most cities of the world, energy management is one of the major concerns in their design. To ameliorate this problem, buildings are becoming smarter by the incorporation of intelligent supervision and control systems. Data captured by the sensors can be interpreted and processed by rule-based computation methods of biological inspiration (such as genetic fuzzy systems, GFS) for predicting the future behavior of the building in a knowledge-based interpretable human-like manner. GFS are computational models inspired in human cognition which use evolutionary computation (inspired in the natural evolution) to automatically learn fuzzy rules which contain explicit imprecise knowledge about a system or process. This knowledge, represented using fuzzy rules that involve fuzzy linguistic variables and values, is used to perform approximate reasoning on the input values for obtaining inferred values for the output variables. In energy management of buildings, these rules allow a smart control of the system actuators to reduce the building average energy consumption. However, the large amount of data produced on a per second basis complicates the generation of accurate and interpretable models by means of traditional methods. In this paper, we present an evolutionary computation-based approach, namely a genetic fuzzy system, to build scalable and interpretable knowledge bases for predicting energy consumption in smart buildings. For accomplishing this task, we propose a cognitive computation system for multi-step prediction based on S-FRULER, a state-of-the-art scalable distributed GFS, coupled with a feature subset selection method to automatically select the most relevant features for different time steps. S-FRULER is able to learn a fuzzy rule-based system made up of Takagi-Sugeno-Kang (TSK) rules that are able to predict the output values using both linguistic imprecise knowledge (represented by fuzzy sets) and fuzzy inference. Experiments with real data on two different problems related with the energy management revealed an average improvement of 6% on accuracy with respect to S-FRULER without feature selection, and with knowledge bases with a lower number of variables.

[1]  Alan Meier,et al.  Rating the energy performance of buildings , 2004 .

[2]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[3]  Rafael Marcos Luque Baena,et al.  Study and classification of plum varieties using image analysis and deep learning techniques , 2018, Progress in Artificial Intelligence.

[4]  M. Woloszyn,et al.  Numerical prediction of indoor air humidity and its effect on indoor environment , 2003 .

[5]  Jie Zhao,et al.  Occupant behavior and schedule modeling for building energy simulation through office appliance power consumption data mining , 2014 .

[6]  Miriam A. M. Capretz,et al.  Machine Learning With Big Data: Challenges and Approaches , 2017, IEEE Access.

[7]  Athanasios V. Vasilakos,et al.  Big data: From beginning to future , 2016, Int. J. Inf. Manag..

[8]  Nursyarizal Mohd Nor,et al.  A review on optimized control systems for building energy and comfort management of smart sustainable buildings , 2014 .

[9]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[10]  María José del Jesús,et al.  Revisiting Evolutionary Fuzzy Systems: Taxonomy, applications, new trends and challenges , 2015, Knowl. Based Syst..

[11]  Hong Qiao,et al.  A Novel Manifold Regularized Online Semi-supervised Learning Model , 2018, Cognitive Computation.

[12]  María José del Jesús,et al.  Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks , 2014, WIREs Data Mining Knowl. Discov..

[13]  Amaury Lendasse,et al.  Anomaly-Based Intrusion Detection Using Extreme Learning Machine and Aggregation of Network Traffic Statistics in Probability Space , 2018, Cognitive Computation.

[14]  Manuel Mucientes,et al.  FRULER: Fuzzy Rule Learning through Evolution for Regression , 2015, Inf. Sci..

[15]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[16]  Francisco Herrera,et al.  A Fast and Scalable Multiobjective Genetic Fuzzy System for Linguistic Fuzzy Modeling in High-Dimensional Regression Problems , 2011, IEEE Transactions on Fuzzy Systems.

[17]  Sylvain Robert,et al.  State of the art in building modelling and energy performances prediction: A review , 2013 .

[18]  Gianluca Bontempi,et al.  Machine Learning Strategies for Time Series Forecasting , 2012, eBISS.

[19]  F. Gomide,et al.  Ten years of genetic fuzzy systems: current framework and new trends , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[20]  Manuel Mucientes,et al.  Scalable modeling of thermal dynamics in buildings using fuzzy rules for regression , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[21]  Francisco Herrera,et al.  Genetic fuzzy systems: taxonomy, current research trends and prospects , 2008, Evol. Intell..

[22]  H. Ishibuchi,et al.  Empirical study on learning in fuzzy systems by rice taste analysis , 1994 .

[23]  Davide Anguita,et al.  Big Data Analytics in the Cloud: Spark on Hadoop vs MPI/OpenMP on Beowulf , 2015, INNS Conference on Big Data.

[24]  Manuel Mucientes,et al.  S-FRULER: Scalable fuzzy rule learning through evolution for regression , 2016, Knowl. Based Syst..

[25]  Antonio A. Márquez,et al.  An efficient adaptive fuzzy inference system for complex and high dimensional regression problems in linguistic fuzzy modelling , 2013, Knowl. Based Syst..

[26]  Elena Marchiori,et al.  Class Conditional Nearest Neighbor for Large Margin Instance Selection , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Tao Lu,et al.  Prediction of indoor temperature and relative humidity using neural network models: model comparison , 2009, Neural Computing and Applications.

[28]  Hossam Faris,et al.  Simultaneous Feature Selection and Support Vector Machine Optimization Using the Grasshopper Optimization Algorithm , 2018, Cognitive Computation.

[29]  George Baird,et al.  Energy Performance Buildings , 2017 .

[30]  Jesús Alcalá-Fdez,et al.  Genetic learning of accurate and compact fuzzy rule based systems based on the 2-tuples linguistic representation , 2007, Int. J. Approx. Reason..

[31]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[32]  Yi Huang,et al.  A Semi-blind Model with Parameter Identification for Building Temperature Estimation , 2017, Cognitive Computation.

[33]  Manuel Mucientes,et al.  Reducing the complexity in genetic learning of accurate regression TSK rule-based systems , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[34]  Lala Septem Riza,et al.  frbs: Fuzzy Rule-Based Systems for Classification and Regression in R , 2015 .

[35]  Baojun Zhao,et al.  Conditional Random Mapping for Effective ELM Feature Representation , 2018, Cognitive Computation.

[36]  Francisco Herrera,et al.  Big Data: Tutorial and guidelines on information and process fusion for analytics algorithms with MapReduce , 2018, Inf. Fusion.

[37]  Bertil Thomas,et al.  Artificial neural network models for indoor temperature prediction: investigations in two buildings , 2006, Neural Computing and Applications.

[38]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[39]  Haibo Wang,et al.  Observer-Based Stabilization Control of Time-Delay T-S Fuzzy Systems via the Non-Uniform Delay Partitioning Approach , 2017, Cognitive Computation.

[40]  Jesús Alcalá-Fdez,et al.  A Proposal for the Genetic Lateral Tuning of Linguistic Fuzzy Systems and Its Interaction With Rule Selection , 2007, IEEE Transactions on Fuzzy Systems.

[41]  António E. Ruano,et al.  Prediction of building's temperature using neural networks models , 2006 .

[42]  Natasha Balac,et al.  Large Scale predictive analytics for real-time energy management , 2013, 2013 IEEE International Conference on Big Data.

[43]  Frédéric Magoulès,et al.  A review on the prediction of building energy consumption , 2012 .

[44]  Mohcine Zouak,et al.  A comparison of linear and neural network ARX models applied to a prediction of the indoor temperature of a building , 2004, Neural Computing & Applications.

[45]  Manuel Mucientes,et al.  An instance selection algorithm for regression and its application in variance reduction , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[46]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[47]  Shengwei Wang,et al.  Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques , 2014 .