An Adaptive Weighted Pearson Similarity Measurement Method for Load Curve Clustering

Load curve data from advanced metering infrastructure record the consumers’ behavior. User consumption models help one understand a more intelligent power provisioning and clustering the load data is one of the popular approaches for building these models. Similarity measurements are important in the clustering model, but, load curve data is a time series style data, and traditional measurement methods are not suitable for load curve data. To cluster the load curve data more accurately, this paper applied an enhanced Pearson similarity for load curve data clustering. Our method introduces the ‘trend alteration point’ concept and integrates it with the Pearson similarity. By introducing a weight for Pearson distance, this method helps to keep the whole contour of the load data and the partial similarity. Based on the weighed Pearson distance, a weighed Pearson-based hierarchy clustering algorithm is proposed. Years of load curve data are used for evaluation. Several user consumption models are found and analyzed. Results show that the proposed method improves the accuracy of load data clustering.

[1]  Michael Conlon,et al.  A clustering approach to domestic electricity load profile characterisation using smart metering data , 2015 .

[2]  Huang Mei-mei Load Forecasting by Multi-Hierarchy Clustering Combining Hierarchy Clustering with Approaching Algorithm in Two Directions , 2007 .

[3]  S.R. Abbas,et al.  Electric Load Forecasting Using Support Vector Machines Optimized by Genetic Algorithm , 2006, 2006 IEEE International Multitopic Conference.

[4]  Krzysztof Gajowniczek,et al.  Do Customers Choose Proper Tariff? Empirical Analysis Based on Polish Data Using Unsupervised Techniques , 2018 .

[5]  Bram Steurtewagen,et al.  Predicting Consumer Load Profiles Using Commercial and Open Data , 2016, IEEE Transactions on Power Systems.

[6]  Bruce Stephen,et al.  Classification of AMI Residential Load Profiles in the Presence of Missing Data , 2016, IEEE Transactions on Smart Grid.

[7]  Belén Carro,et al.  Classification and Clustering of Electricity Demand Patterns in Industrial Parks , 2012 .

[8]  Jianzhong Wu,et al.  k-means based load estimation of domestic smart meter measurements , 2017 .

[9]  Mikko Kolehmainen,et al.  Data-based method for creating electricity use load profiles using large amount of customer-specific hourly measured electricity use data , 2010 .

[10]  Michael E. Webber,et al.  Clustering analysis of residential electricity demand profiles , 2014 .

[11]  Gianfranco Chicco,et al.  Overview and performance assessment of the clustering methods for electrical load pattern grouping , 2012 .

[12]  Gianluca Sapienza,et al.  Robust Real-Time Load Profile Encoding and Classification Framework for Efficient Power Systems Operation , 2015, IEEE Transactions on Power Systems.

[13]  Seddik Bacha,et al.  Time series distance-based methods for non-intrusive load monitoring in residential buildings , 2015 .

[14]  Abdulsalam Yassine,et al.  Big Data Mining of Energy Time Series for Behavioral Analytics and Energy Consumption Forecasting , 2018 .

[15]  Aidan Duffy,et al.  Evaluation of time series techniques to characterise domestic electricity demand , 2013 .

[16]  Orhan Kesemen,et al.  Fuzzy c-means clustering algorithm for directional data (FCM4DD) , 2016, Expert Syst. Appl..

[17]  Víctor Manuel Fernandes Mendes,et al.  A fuzzy clustering approach to a demand response model , 2016 .

[18]  Sanjay Lall,et al.  Shape-Based Approach to Household Electric Load Curve Clustering and Prediction , 2017, IEEE Transactions on Smart Grid.

[19]  Xinghuo Yu,et al.  Hybrid Load Profile Clustering for identifying patterns of electricity consumers , 2016, 2016 IEEE 25th International Symposium on Industrial Electronics (ISIE).

[20]  Francisco Martínez-Álvarez,et al.  Big Data Analytics for Discovering Electricity Consumption Patterns in Smart Cities , 2018 .

[21]  Ahmed Abdulaal,et al.  A Fuzzy Genetic Algorithm Classifier: The Impact of Time-Series Load Data Temporal Dimension on Classification Performance , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[22]  A. B. M. Shawkat Ali,et al.  Identification of typical load profiles using K-means clustering algorithm , 2014, Asia-Pacific World Congress on Computer Science and Engineering.