A Novel Clustering Index to Find Optimal Clusters Size With Application to Segmentation of Energy Consumers

Increased deployment of residential smart meters has made it possible to record energy consumption data on short intervals. These data, if used efficiently, carry valuable information for managing power demand and increasing energy consumption efficiency. An efficient way to analyze these data is to first identify clusters of energy consumers, and then focus on analyzing these clusters. However deciding on the optimal number of clusters is a challenging task. In this article, we propose a clustering index that effectively finds the optimal number of clusters. The proposed index is an entropy-based measure that is obtained from eigenvalue analysis of the correlation matrix of time series of consumption data. A genetic algorithm based feature selection is used to reduce the number of features, which are then fed into clustering algorithms. We apply the proposed clustering index on two ground truth synthetic data sets and two real world energy consumption data set. The numerical simulations reveal the effectiveness of the proposed method and its superiority to a number of existing clustering indices.

[1]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[2]  Damminda Alahakoon,et al.  Multi-granular electricity consumer load profiling for smart homes using a scalable big data algorithm , 2018, Sustainable Cities and Society.

[3]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[4]  Maria G. Knyazeva,et al.  Assessment of EEG synchronization based on state-space analysis , 2005, NeuroImage.

[5]  Furong Li,et al.  Multi-resolution load profile clustering for smart metering data , 2016, 2017 IEEE Power & Energy Society General Meeting.

[6]  H. Vincent Poor,et al.  Multiobjective Optimization for Demand Side Management Program in Smart Grid , 2018, IEEE Transactions on Industrial Informatics.

[7]  Guoqiang Hu,et al.  A Cooperative Demand Response Scheme Using Punishment Mechanism and Application to Industrial Refrigerated Warehouses , 2014, IEEE Transactions on Industrial Informatics.

[8]  Peter A. Flach,et al.  Feature Construction and Calibration for Clustering Daily Load Curves from Smart-Meter Data , 2016, IEEE Trans. Ind. Informatics.

[9]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Minho Kim,et al.  New indices for cluster validity assessment , 2005, Pattern Recognit. Lett..

[11]  Reto Meuli,et al.  Dysconnection Topography in Schizophrenia Revealed with State-Space Analysis of EEG , 2007, PloS one.

[12]  Ram Rajagopal,et al.  Household Energy Consumption Segmentation Using Hourly Data , 2014, IEEE Transactions on Smart Grid.

[13]  Mahdi Jalili,et al.  Demand Response Planning Tool using Markov Decision Process , 2018, 2018 IEEE 16th International Conference on Industrial Informatics (INDIN).

[14]  Mahdi Jalili,et al.  Application of Deep Learning Long Short-Term Memory in Energy Demand Forecasting , 2019, EANN.

[15]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[16]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[17]  Ignacio González Alonso,et al.  Using the Big Data generated by the Smart Home to improve energy efficiency management , 2015, Energy Efficiency.

[18]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[19]  Peter Grindrod,et al.  Analysis and Clustering of Residential Customers Energy Behavioral Demand Using Smart Meter Data , 2016, IEEE Transactions on Smart Grid.

[20]  Daswin De Silva,et al.  A Data Mining Framework for Electricity Consumption Analysis From Meter Data , 2011, IEEE Transactions on Industrial Informatics.

[21]  Chongqing Kang,et al.  Sparse and Redundant Representation-Based Smart Meter Data Compression and Pattern Extraction , 2017, IEEE Transactions on Power Systems.

[22]  Xinghuo Yu,et al.  Smart Electricity Meter Data Intelligence for Future Energy Systems: A Survey , 2016, IEEE Transactions on Industrial Informatics.

[23]  Xiufeng Liu,et al.  A Hybrid ICT-Solution for Smart Meter Data Analytics , 2016, ArXiv.

[24]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[25]  Michalis Vazirgiannis,et al.  Clustering validity assessment: finding the optimal partitioning of a data set , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[26]  Pedro Faria,et al.  Aggregation and Remuneration of Electricity Consumers and Producers for the Definition of Demand-Response Programs , 2016, IEEE Transactions on Industrial Informatics.

[27]  Yu-Hsiang Hsiao,et al.  Household Electricity Demand Forecast Based on Context Information and User Daily Schedule Analysis From Meter Data , 2015, IEEE Transactions on Industrial Informatics.

[28]  Gianfranco Chicco,et al.  Overview and performance assessment of the clustering methods for electrical load pattern grouping , 2012 .

[29]  Latifa Oukhellou,et al.  A Dedicated Mixture Model for Clustering Smart Meter Data: Identification and Analysis of Electricity Consumption Behaviors , 2017 .

[30]  Xinghuo Yu,et al.  Advanced analytics for harnessing the power of smart meter big data , 2013, 2013 IEEE International Workshop on Inteligent Energy Systems (IWIES).