Privacy-Preserving Distributed Clustering for Electrical Load Profiling

Electrical load profiling supports retailers and distribution network operators in having a better understanding of the consumption behavior of consumers. However, traditional clustering methods for load profiling are centralized and require access to all the smart meter data, thus causing privacy issues for consumers and retailers. To tackle this issue, we propose a privacy-preserving distributed clustering framework for load profiling by developing a privacy-preserving accelerated average consensus (PP-AAC) algorithm with proven convergence. Using the proposed framework, we modify several commonly used clustering methods, including k-means, fuzzy C-means, and Gaussian mixture model, to provide privacy-preserving distributed clustering methods. In this way, load profiling can be performed only by local calculations and information sharing between neighboring data owners without sacrificing privacy. Meanwhile, compared to traditional centralized clustering methods, the computational time consumed by each data owner is significantly reduced. The privacy and complexity of the proposed privacy-preserving distributed clustering framework are analyzed. The correctness, efficiency, effectiveness, and privacy-preserving feature of the proposed framework and the proposed PP-AAC algorithm are verified using a real-world Irish residential dataset.

[1]  Goran Strbac,et al.  C-Vine Copula Mixture Model for Clustering of Residential Electrical Load Pattern Data , 2017, IEEE Transactions on Power Systems.

[2]  Devesh C. Jinwala,et al.  An Efficient Approach for Privacy Preserving Distributed K-Means Clustering Based on Shamir's Secret Sharing Scheme , 2012, IFIPTM.

[3]  Jiajia Yang,et al.  A Model of Customizing Electricity Retail Prices Based on Load Profile Clustering Analysis , 2019, IEEE Transactions on Smart Grid.

[4]  G. Chicco,et al.  Comparisons among clustering techniques for electricity customer classification , 2006, IEEE Transactions on Power Systems.

[5]  Graeme Burt,et al.  Enhanced Load Profiling for Residential Network Customers , 2014, IEEE Transactions on Power Delivery.

[6]  Zhenjun Ma,et al.  Identification of typical building daily electricity usage profiles using Gaussian mixture model-based clustering and hierarchical clustering , 2018, Applied Energy.

[7]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[8]  Chris Clifton,et al.  Privacy-preserving clustering with distributed EM mixture modeling , 2004, Knowledge and Information Systems.

[9]  K. Srinathan,et al.  Efficient Privacy Preserving K-Means Clustering , 2010, PAISI.

[10]  Wojciech Kwedlo,et al.  A clustering method combining differential evolution with the K-means algorithm , 2011, Pattern Recognit. Lett..

[11]  Jiguo Yu,et al.  Mutual Privacy Preserving $k$ -Means Clustering in Social Participatory Sensing , 2017, IEEE Transactions on Industrial Informatics.

[12]  Johan Driesen,et al.  The impact of vehicle-to-grid on the distribution grid , 2011 .

[13]  N.D. Hatziargyriou,et al.  Two-Stage Pattern Recognition of Load Curves for Classification of Electricity Customers , 2007, IEEE Transactions on Power Systems.

[14]  Richard M. Murray,et al.  Privacy preserving average consensus , 2014, 53rd IEEE Conference on Decision and Control.

[15]  Bruno Sinopoli,et al.  Communication Complexity and Energy Efficient Consensus Algorithm , 2010 .

[16]  Gianfranco Chicco,et al.  Overview and performance assessment of the clustering methods for electrical load pattern grouping , 2012 .

[17]  Yacine Challal,et al.  Efficient and Privacy-Preserving k-Means Clustering for Big Data Mining , 2016, 2016 IEEE Trustcom/BigDataSE/ISPA.

[18]  Bruno Francois,et al.  Energy Management and Operational Planning of a Microgrid With a PV-Based Active Generator for Smart Grid Applications , 2011, IEEE Transactions on Industrial Electronics.

[19]  Luis Orozco-Barbosa,et al.  Privacy Preserving k-Means Clustering in Multi-Party Environment , 2007, SECRYPT.

[20]  Bikash Pal,et al.  Statistical Representation of Distribution System Loads Using Gaussian Mixture Model , 2010, IEEE Transactions on Power Systems.

[21]  Gabriela Hug,et al.  Forecasting of Smart Meter Time Series Based on Neural Networks , 2016, DARE@PKDD/ECML.

[22]  Ling Shi,et al.  Consensus-Based Data-Privacy Preserving Data Aggregation , 2019, IEEE Transactions on Automatic Control.

[23]  Richard Weber,et al.  Soft clustering - Fuzzy and rough approaches and their extensions and derivatives , 2013, Int. J. Approx. Reason..

[24]  Kuo-Lung Wu,et al.  Analysis of parameter selections for fuzzy c-means , 2012, Pattern Recognit..

[25]  Chris Clifton,et al.  Tools for privacy preserving distributed data mining , 2002, SKDD.

[26]  Stephen P. Boyd,et al.  A scheme for robust distributed sensor fusion based on average consensus , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[27]  Peter Grindrod,et al.  Analysis and Clustering of Residential Customers Energy Behavioral Demand Using Smart Meter Data , 2016, IEEE Transactions on Smart Grid.

[28]  D.Z. Marques,et al.  A comparative analysis of neural and fuzzy cluster techniques applied to the characterization of electric load in substations , 2004, 2004 IEEE/PES Transmision and Distribution Conference and Exposition: Latin America (IEEE Cat. No. 04EX956).

[29]  M. C. Ortiz,et al.  Selecting variables for k-means cluster analysis by using a genetic algorithm that optimises the silhouettes , 2004 .

[30]  Mikko Kolehmainen,et al.  Data-based method for creating electricity use load profiles using large amount of customer-specific hourly measured electricity use data , 2010 .

[31]  Safia Nait Bahloul,et al.  Privacy preserving k-means clustering: a survey research , 2012, Int. Arab J. Inf. Technol..

[32]  Purnima Bholowalia,et al.  EBK-Means: A Clustering Technique based on Elbow Method and K-Means in WSN , 2014 .

[33]  Geoffrey J. McLachlan,et al.  Corruption-Resistant Privacy Preserving Distributed EM Algorithm for Model-Based Clustering , 2017, 2017 IEEE Trustcom/BigDataSE/ICESS.

[34]  Yi Wang,et al.  Clustering of Electricity Consumption Behavior Dynamics Toward Big Data Applications , 2016, IEEE Transactions on Smart Grid.

[35]  Patrick D. McDaniel,et al.  Security and Privacy Challenges in the Smart Grid , 2009, IEEE Security & Privacy.

[36]  Furong Li,et al.  A novel time-of-use tariff design based on Gaussian Mixture Model , 2016 .

[37]  Chunhua Su,et al.  Privacy-Preserving Two-Party K-Means Clustering via Secure Approximation , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[38]  T. C. Aysal,et al.  Accelerated Distributed Average Consensus via Localized Node State Prediction , 2009, IEEE Transactions on Signal Processing.