k-Shape clustering algorithm for building energy usage patterns analysis and forecasting model accuracy improvement

Abstract Clustering algorithms have been successfully applied in analyzing building energy consumption data. It has proven to be an effective technique to identify representative energy consumption patterns as well as being a pre-processing step for other techniques. In this paper, we propose a clustering method based on k-shape algorithm, which is a relatively novel method to identify shape patterns in time-series data. In the experiment, clustering is performed for each individual building according to its hourly consumption. The novelty of this paper is that a new k-shape algorithm is applied to detect building-energy usage patterns at different levels, and the clustering result is further utilized to improve the accuracy of forecasting models. Ten institutional buildings covering three different typologies are used as case studies and a set of hourly and weekly energy consumption data is further analyzed in this paper. The experimental results reveal that this proposed method can detect building energy usage patterns in different time granularity effectively and also proves that the forecasting accuracy of SVR model is significantly improved by utilizing the results of the proposed clustering method.

[1]  Siew Eang Lee,et al.  Retrofitting solutions for two different occupancy levels of educational buildings in tropics , 2018 .

[2]  Michael Conlon,et al.  A clustering approach to domestic electricity load profile characterisation using smart metering data , 2015 .

[3]  Cláudia Antunes,et al.  Temporal Data Mining: an overview , 2001 .

[4]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[5]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[6]  Iñaki Albisua,et al.  SEP/COP: An efficient method to find the best partition in hierarchical clustering based on a new cluster validity index , 2010, Pattern Recognit..

[7]  Tingting Guo,et al.  Short-Term Load Forecasting for Electric Power Systems Using the PSO-SVR and FCM Clustering Techniques , 2011 .

[8]  Mohammad Yusri Hassan,et al.  A review on applications of ANN and SVM for building electrical energy consumption forecasting , 2014 .

[9]  Jin Wen,et al.  Net-zero energy building clusters emulator for energy planning and operation evaluation , 2017, Comput. Environ. Urban Syst..

[10]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[11]  Peng Xu,et al.  Demand reduction in building energy systems based on economic model predictive control , 2012 .

[12]  Ian F. C. Smith,et al.  A Bounded Index for Cluster Validity , 2007, MLDM.

[13]  Luis Gravano,et al.  k-Shape: Efficient and Accurate Clustering of Time Series , 2015, SIGMOD Conference.

[14]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[15]  S. Dolnicar,et al.  An examination of indexes for determining the number of clusters in binary data sets , 2002, Psychometrika.

[16]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[17]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[18]  Pierre Gançarski,et al.  A global averaging method for dynamic time warping, with applications to clustering , 2011, Pattern Recognit..

[19]  Siew Eang Lee,et al.  Review of occupancy sensing systems and occupancy modeling methodologies for the application in institutional buildings , 2016 .

[20]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[21]  G. Chicco,et al.  Comparisons among clustering techniques for electricity customer classification , 2006, IEEE Transactions on Power Systems.

[22]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[23]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  G. W. Milligan,et al.  An examination of procedures for determining the number of clusters in a data set , 1985 .

[25]  R. L. Thorndike Who belongs in the family? , 1953 .

[26]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[27]  Diego Klabjan,et al.  Clustering time-series energy data from smart meters , 2015, 1603.07602.

[28]  G. K. Papagiannis,et al.  Load profiling in the deregulated electricity markets: A review of the applications , 2012, 2012 9th International Conference on the European Energy Market.

[29]  Nima Amjady,et al.  Short-term hourly load forecasting using time-series modeling with peak load estimation capability , 2001 .

[30]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Johan A. K. Suykens,et al.  Identifying Customer Profiles in Power Load Time Series Using Spectral Clustering , 2009, ICANN.

[32]  Z. Vale,et al.  Typical load profiles in the smart grid context — A clustering methods comparison , 2012, 2012 IEEE Power and Energy Society General Meeting.

[33]  Minho Kim,et al.  New indices for cluster validity assessment , 2005, Pattern Recognit. Lett..

[34]  Fan Zhang,et al.  Time series forecasting for building energy consumption using weighted Support Vector Regression with differential evolution optimization technique , 2016 .

[35]  Belén Carro,et al.  Classification and Clustering of Electricity Demand Patterns in Industrial Parks , 2012 .

[36]  Fan Zhang,et al.  A review on time series forecasting techniques for building energy consumption , 2017 .

[37]  Eamonn J. Keogh,et al.  Time Series Epenthesis: Clustering Time Series Streams Requires Ignoring Some Data , 2011, 2011 IEEE 11th International Conference on Data Mining.

[38]  Sylvain Robert,et al.  State of the art in building modelling and energy performances prediction: A review , 2013 .

[39]  Fábio Gonçalves Jota,et al.  Building load management using cluster and statistical analyses , 2011 .

[40]  Tania Cerquitelli,et al.  Fault Detection Analysis of Building Energy Consumption Using Data Mining Techniques , 2013 .

[41]  Chirag Deb,et al.  Forecasting diurnal cooling energy load for institutional buildings using Artificial Neural Networks , 2016 .

[42]  Frédéric Magoulès,et al.  A review on the prediction of building energy consumption , 2012 .

[43]  Tim Oates,et al.  Identifying distinctive subsequences in multivariate time series by clustering , 1999, KDD '99.

[44]  Shubha Singh,et al.  A Survey of Clustering Techniques , 2010 .

[45]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.