Data size reduction with symbolic aggregate approximation for electrical load pattern grouping

Data size reduction techniques may be helpful in the process of categorising the electrical load consumption patterns on the basis of their shape. Starting from a macro-class of consumers defined according to certain general criteria on the type of consumers and the period of the year or week, the representative load pattern (RLP) of each consumer can be built by considering the data points in the time domain or a reduced number of features. This study exploits the effects of using the symbolic aggregate approximation (SAX) method to form the reduced set of features. The portion of the time–amplitude plane defining the RLP is partitioned into sub-portions on the basis of the characteristics of the whole data set. A specific partitioning of the time axis is proposed on the basis of the cumulative distribution function of the RLP variations in time. Each RLP is then coded according to the SAX principles. The resulting codes are then used into a hierarchical clustering procedure. The validity of the clustering results obtained by using the SAX data representation with the proposed non-uniform partitioning of the time axis is presented and discussed, including comparisons with the results obtained from other data size reduction techniques.

[1]  B.D. Pitt,et al.  Application of data mining techniques to load profiling , 1999, Proceedings of the 21st International Conference on Power Industry Computer Applications. Connecting Utilities. PICA 99. To the Millennium and Beyond (Cat. No.99CH36351).

[2]  Z. Styczynski,et al.  Application of statistical and neural approaches to the daily load profiles modelling in power distribution systems , 1999, 1999 IEEE Transmission and Distribution Conference (Cat. No. 99CH36333).

[3]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[4]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[5]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[6]  Z. Vale,et al.  An electric energy consumer characterization framework based on data mining techniques , 2005, IEEE Transactions on Power Systems.

[7]  N.D. Hatziargyriou,et al.  Two-Stage Pattern Recognition of Load Curves for Classification of Electricity Customers , 2007, IEEE Transactions on Power Systems.

[8]  Eamonn J. Keogh,et al.  iSAX: indexing and mining terabyte sized time series , 2008, KDD.

[9]  G. Chicco,et al.  Support Vector Clustering of Electrical Load Pattern Data , 2009, IEEE Transactions on Power Systems.

[10]  D. Kirschen,et al.  Quantifying the Effect of Demand Response on Electricity Markets , 2007, IEEE Transactions on Power Systems.