Data-Driven Outage Restoration Time Prediction via Transfer Learning With Cluster Ensembles

This paper develops a data-driven approach to accurately predict the restoration time of outages under different scales and factors. To achieve the goal, the proposed method consists of three stages. First, given the unprecedented amount of data collected by utilities, a sparse dictionary-based ensemble spectral clustering (SDESC) method is proposed to decompose historical outage datasets, which enjoys good computational efficiency and scalability. Specifically, each outage sample is represented by a linear combination of a small number of selected dictionary samples using a density-based method. Then, the dictionary-based representation is utilized to perform the spectral analysis to group the data samples with similar features into the same subsets. In the second stage, a knowledge-transfer-added restoration time prediction model is trained for each subset by combining weather information and outage-related features. The transfer learning technology is introduced with the aim of dealing with the underestimation problem caused by data imbalance in different subsets, thus improving the model performance. Furthermore, to connect unseen outages with the learned outage subsets, a t-distributed stochastic neighbor embedding-based strategy is applied. The proposed method fully builds on and is also tested on a large real-world outage dataset from a utility provider with a time span of six consecutive years. The numerical results validate that our method has high prediction accuracy while showing good stability against real-world data limitations.

[1]  Hui Xiong,et al.  A Comprehensive Survey on Transfer Learning , 2019, Proceedings of the IEEE.

[2]  Haibin Liu,et al.  Statistical Forecasting of Electric Power Restoration Times in Hurricanes and Ice Storms , 2007, IEEE Transactions on Power Systems.

[3]  Xinlei Chen,et al.  Large Scale Spectral Clustering with Landmark-Based Representation , 2011, AAAI.

[4]  Josef Spidlen,et al.  Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets , 2019, Nature Communications.

[5]  Dongpu Cao,et al.  Levenberg–Marquardt Backpropagation Training of Multilayer Neural Networks for State Estimation of a Safety-Critical Cyber-Physical System , 2018, IEEE Transactions on Industrial Informatics.

[6]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Heinrich Jiang,et al.  DBSCAN++: Towards fast and scalable density clustering , 2018, ICML.

[8]  Anmar Arif,et al.  Distribution Network Outage Data Analysis and Repair Time Prediction Using Deep Learning , 2018, 2018 IEEE International Conference on Probabilistic Methods Applied to Power Systems (PMAPS).

[9]  Dorit S. Hochba,et al.  Approximation Algorithms for NP-Hard Problems , 1997, SIGA.

[10]  Alejandro Ribeiro,et al.  Predicting Power Outages Using Graph Neural Networks , 2018, 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[11]  Bruce G. Terrell,et al.  National Oceanic and Atmospheric Administration , 2020, Federal Regulatory Guide.

[12]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[13]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[14]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[15]  Yixin Yin,et al.  Blast furnace condition data clustering based on combination of T-distributed stochastic neighbor embedding and spectral clustering , 2019, 2019 IEEE 15th International Conference on Control and Automation (ICCA).

[16]  A. K. Jain,et al.  Data Clustering : A , 2007 .

[17]  Ritu yadav,et al.  Advanced Methods to Improve Performance of K-Means Algorithm: A Review , 2012 .

[18]  Hao Yu,et al.  Improved Computation for Levenberg–Marquardt Training , 2010, IEEE Transactions on Neural Networks.

[19]  Meng Yue,et al.  A Bayesian Approach-Based Outage Prediction in Electric Utility Systems Using Radar Measurement Data , 2018, IEEE Transactions on Smart Grid.

[20]  Mohammad Rastegar,et al.  Outage Cause Detection in Power Distribution Systems Based on Data Mining , 2021, IEEE Transactions on Industrial Informatics.

[21]  Mo-Yuen Chow,et al.  Time of outage restoration analysis in distribution systems , 1996 .

[22]  Mari Ostendorf,et al.  Real-Time Prediction of the Duration of Distribution System Outages , 2018, IEEE Transactions on Power Systems.

[23]  Mo-Yuen Chow,et al.  Power Distribution Fault Cause Identification With Imbalanced Data Using the Data Mining-Based Fuzzy Classification $E$-Algorithm , 2007, IEEE Transactions on Power Systems.

[24]  Emmanouil N. Anagnostou,et al.  Enhancing Weather-Related Power Outage Prediction by Event Severity Classification , 2020, IEEE Access.

[25]  Gavriel Salomon,et al.  T RANSFER OF LEARNING , 1992 .

[26]  Vipin Kumar,et al.  Chapman & Hall/CRC Data Mining and Knowledge Discovery Series , 2008 .

[27]  Anil Pahwa,et al.  AdaBoost$^{+}$: An Ensemble Learning Approach for Estimating Weather-Related Outages in Distribution Systems , 2014, IEEE Transactions on Power Systems.

[28]  M. Pravin Kumar,et al.  BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM , 2012 .

[29]  A. Pahwa,et al.  Modeling Weather-Related Failures of Overhead Distribution Lines , 2007, 2007 IEEE Power Engineering Society General Meeting.