Privacy-Preserving Distributed Joint Probability Modeling for Spatial-Correlated Wind Farms

Building the joint probability distribution (JPD) of multiple spatial-correlated wind farms (WFs) is critical for chance-constrained optimal decision-making. The vertical partitioning historical wind power data of WFs is the premise of training the JPD. However, to protect data privacy, WFs with different stakeholders will refuse to share raw data directly or send raw data to a third party as no one knows whether the third party can be trusted. Moreover, the centralized training way is also faced with costly high bandwidth communication, single-point failure and limited scalability. To solve the problems, distributed algorithm is an alternative. But to the best of our knowledge, rarely has literature proposed privacy-preserving distributed (PPD) algorithm to build the JPD of spatial-correlated WFs. Therefore, based on the additive homomorphic encryption and the average consensus algorithm, we first propose a PPD summation algorithm. Meanwhile, based on the binary hash function and the average consensus algorithm, we then present a PPD inner product algorithm. Thereafter, combining the PPD summation and inner product algorithms, a PPD expectation-maximization algorithm for training the Gaussian-mixture-model-based JPD of WFs is eventually developed. The correctness and the robustness to communicate failure of the proposed algorithm is empirically verified using historical data.

[1]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[2]  Bri-Mathias Hodge,et al.  Overview and Meteorological Validation of the Wind Integration National Dataset toolkit , 2015 .

[3]  Craig Gentry,et al.  Fully homomorphic encryption using ideal lattices , 2009, STOC '09.

[4]  Ümit Y. Ogras,et al.  Dimensionality reduction using magnitude and shape approximations , 2003, CIKM '03.

[5]  Yang Weng,et al.  Diffusion-Based EM Algorithm for Distributed Estimation of Gaussian Mixtures in Wireless Sensor Networks , 2011, Sensors.

[6]  Sergio Barbarossa,et al.  Consensus for distributed EM-based clustering in WSNs , 2010, 2010 IEEE Sensor Array and Multichannel Signal Processing Workshop.

[7]  Javier Contreras,et al.  A Chance-Constrained Unit Commitment With an $n-K$ Security Criterion and Significant Wind Generation , 2013, IEEE Transactions on Power Systems.

[8]  Gu Si-yang,et al.  Privacy preserving association rule mining in vertically partitioned data , 2006 .

[9]  Jin Lin,et al.  A Versatile Probability Distribution Model for Wind Power Forecast Errors and Its Application in Economic Dispatch , 2013, IEEE Transactions on Power Systems.

[10]  Geoffrey J. McLachlan,et al.  Corruption-Resistant Privacy Preserving Distributed EM Algorithm for Model-Based Clustering , 2017, 2017 IEEE Trustcom/BigDataSE/ICESS.

[11]  Yongpei Guan,et al.  A Chance-Constrained Two-Stage Stochastic Program for Unit Commitment With Uncertain Wind Power Output , 2012, IEEE Transactions on Power Systems.

[12]  Sanparith Marukatat,et al.  Fast nearest neighbor retrieval using randomized binary codes and approximate Euclidean distance , 2013, Pattern Recognit. Lett..

[13]  Hiroshi Nakagawa,et al.  Privacy-Preserving EM Algorithm for Clustering on Social Network , 2012, PAKDD.

[14]  Bertha Guijarro-Berdiñas,et al.  A survey of methods for distributed machine learning , 2012, Progress in Artificial Intelligence.

[15]  Ömer Egecioglu,et al.  Dimensionality reduction and similarity computation by inner-product approximations , 2000, IEEE Transactions on Knowledge and Data Engineering.

[16]  Pascal Paillier,et al.  Public-Key Cryptosystems Based on Composite Degree Residuosity Classes , 1999, EUROCRYPT.

[17]  Omer Egecioglu,et al.  How to Approximate the Inner-product: Fast Dynamic Algorithms for Euclidean Similarity , 1999 .

[18]  Chris Clifton,et al.  Privacy-preserving clustering with distributed EM mixture modeling , 2004, Knowledge and Information Systems.

[19]  Bri-Mathias Hodge,et al.  Wind power forecasting error distributions over multiple timescales , 2011, 2011 IEEE Power and Energy Society General Meeting.

[20]  Craig Gentry,et al.  A fully homomorphic encryption scheme , 2009 .

[21]  A. Fabbri,et al.  Assessment of the cost associated with wind generation prediction errors in a liberalized electricity market , 2005, IEEE Transactions on Power Systems.

[22]  R. G. Harley,et al.  Chance constrained unit commitment with wind generation and superconducting magnetic energy storages , 2012, 2012 IEEE Power and Energy Society General Meeting.

[23]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[24]  Tomoaki Ohtsuki,et al.  Distributed EM Algorithms for Acoustic Source Localization in Sensor Networks , 2006, IEEE Vehicular Technology Conference.

[25]  Feng Liu,et al.  Chance-Constrained Economic Dispatch With Non-Gaussian Correlated Wind Power Uncertainty , 2017, IEEE Transactions on Power Systems.

[26]  Yin Xu,et al.  Risk-Limiting Load Restoration for Resilience Enhancement With Intermittent Energy Resources , 2017, IEEE Transactions on Smart Grid.

[27]  Xiao-Ping Zhang,et al.  A Solution to the Chance-Constrained Two-Stage Stochastic Program for Unit Commitment With Wind Energy Integration , 2016, IEEE Transactions on Power Systems.

[28]  Diffusion scheme of distributed EM algorithm for Gaussian mixtures over random networks , 2009, 2009 IEEE International Conference on Control and Automation.

[29]  Ömer Egecioglu,et al.  Parametric Approximation Algorithms for High-Dimensional Euclidean Similarity , 2001, PKDD.

[30]  Chris Clifton,et al.  Tools for privacy preserving distributed data mining , 2002, SKDD.

[31]  Bikash Pal,et al.  Statistical Representation of Distribution System Loads Using Gaussian Mixture Model , 2010, IEEE Transactions on Power Systems.

[32]  Joydeep Ghosh,et al.  Privacy-preserving distributed clustering using generative models , 2003, Third IEEE International Conference on Data Mining.

[33]  F. Bouffard,et al.  Stochastic security for operations planning with significant wind power generation , 2008, 2008 IEEE Power and Energy Society General Meeting - Conversion and Delivery of Electrical Energy in the 21st Century.

[34]  Madeleine Gibescu,et al.  Gaussian Mixture Based Probabilistic Load Flow For LV-Network Planning , 2017, IEEE Transactions on Power Systems.

[35]  Ronald L. Rivest,et al.  ON DATA BANKS AND PRIVACY HOMOMORPHISMS , 1978 .

[36]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[37]  Feng Liu,et al.  A Conditional Model of Wind Power Forecast Errors and Its Application in Scenario Generation , 2017, 1708.06759.

[38]  Yucui Guo,et al.  Research on Secure Scalar Product Protocol and Its' Application , 2010, 2010 6th International Conference on Wireless Communications Networking and Mobile Computing (WiCOM).

[39]  Jian Xu,et al.  A Versatile Mixture Distribution and Its Application in Economic Dispatch with Multiple Wind Farms , 2017, IEEE Transactions on Sustainable Energy.

[40]  Dongbing Gu,et al.  Distributed EM Algorithm for Gaussian Mixtures in Sensor Networks , 2008, IEEE Transactions on Neural Networks.

[41]  Roberto López-Valcarce,et al.  A Diffusion-Based EM Algorithm for Distributed Estimation in Unreliable Sensor Networks , 2013, IEEE Signal Processing Letters.

[42]  Chen Shen,et al.  An Adjustable Chance-Constrained Approach for Flexible Ramping Capacity Allocation , 2018, IEEE Transactions on Sustainable Energy.

[43]  Chen Shen,et al.  A Distributed Probabilistic Modeling Algorithm for the Aggregated Power Forecast Error of Multiple Newly Built Wind Farms , 2019, IEEE Transactions on Sustainable Energy.

[44]  G. Schwarz Estimating the Dimension of a Model , 1978 .