Combining Unsupervised and Supervised Learning for Asset Class Failure Prediction in Power Systems

In power systems, an asset class is a group of power equipment that has the same function and shares similar electrical and/or mechanical characteristics. Predicting failures for different asset classes is critical for electric utilities toward developing cost-effective asset management strategies. Previously, physical age based Weibull distribution has been widely used for failure prediction. However, this mathematical model cannot incorporate asset condition data. As a result, the prediction cannot be very specific and accurate for individual assets. To solve this important problem, this paper proposes a novel and comprehensive data-driven approach based on asset condition data: K-means clustering as an unsupervised learning method is used to analyze the inner structure of historical asset condition data and produce the asset conditional ages; logistic regression as a supervised learning method takes in both asset physical ages and conditional ages to classify and predict asset operation statuses. Furthermore, an index called average aging rate is defined to quantify, track, and estimate the relationship between asset physical age and conditional age. This approach was applied to a medium-voltage cable class in an urban distribution system in West Canada. Case studies and comparison with standard Weibull distribution are provided. The proposed approach demonstrates higher accuracy measured by F1-Score than Weibull distribution method for asset class failure prediction.

[1]  김창욱,et al.  A Convolutional Neural Network for Fault Classification and Diagnosis in Semiconductor Manufacturing Processes , 2016 .

[2]  Rafael Maranzato,et al.  Fraud detection in reputation systems in e-markets using logistic regression and stepwise optimization , 2010, SIAP.

[3]  [EPUB] Ieee Guide For Partial Discharge Testing Of Shielded Power , 2007 .

[4]  Mostafa Aliyari,et al.  Reliability performance analysis in power distribution system using Weibull distribution-A case study , 2013, 18th Electric Power Distribution Conference.

[5]  Wenyuan Li,et al.  Determining Optimal Inspection Intervals in Maintenance Considering Equipment Aging Failures , 2017, IEEE Transactions on Power Systems.

[6]  Wenyuan Li,et al.  A Risk Evaluation Based Approach to Replacement Strategy of Aged HVDC Components and Its Application at BCTC , 2007, IEEE Transactions on Power Delivery.

[7]  Gian Antonio Susto,et al.  Machine Learning for Predictive Maintenance: A Multiple Classifier Approach , 2015, IEEE Transactions on Industrial Informatics.

[8]  R. Lofaro,et al.  Condition Monitoring of Cables Task 3 Report: Condition Monitoring Techniques for Electric Cables , 2009 .

[9]  Nitesh V. Chawla,et al.  Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.

[10]  Mehdi Vakilian,et al.  Transformer winding faults classification based on transfer function analysis by support vector machine , 2012 .

[11]  Yucai Dong,et al.  Data Fusion Method of Multi-dimension Oil Monitoring Based on Principal Components Analysis , 2012, 2012 International Conference on Industrial Control and Electronics Engineering.

[12]  A Kusiak,et al.  A Data-Driven Approach for Monitoring Blade Pitch Faults in Wind Turbines , 2011, IEEE Transactions on Sustainable Energy.

[13]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[14]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[15]  Vladica Mijailovic,et al.  Optimal spares availability strategy for power transformer components , 2010 .

[16]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[17]  Muhammad Buhari,et al.  Modelling of Ageing Distribution Cable for Replacement Planning , 2016, IEEE Transactions on Power Systems.

[18]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[19]  Zhixue Zhang,et al.  CMMS and its application in power systems , 2003, SMC'03 Conference Proceedings. 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme - System Security and Assurance (Cat. No.03CH37483).

[20]  Robert M. Haralick,et al.  Feature normalization and likelihood-based similarity measures for image retrieval , 2001, Pattern Recognit. Lett..

[21]  Wenyuan Li,et al.  Incorporating a Condition Monitoring Based Aging Failure Model of a Circuit Breaker in Substation Reliability Assessment , 2015, IEEE Transactions on Power Systems.

[22]  AhmadAmir,et al.  A k-mean clustering algorithm for mixed numeric and categorical data , 2007 .

[23]  W. Li,et al.  Incorporating Aging Failures in Power System Reliability Evaluation , 2002, IEEE Power Engineering Review.

[24]  Ieee staff 2012 IEEE PES Transmission and Distribution Conference and Exposition (T&D) , 2012 .

[25]  Vladica Mijailovic Method for effects evaluation of some forms of power transformers preventive maintenance , 2008 .

[26]  Roy Billinton,et al.  Reliability evaluation of power systems , 1984 .

[27]  Jeff Golarz,et al.  Understanding Dissolved Gas Analysis (DGA) techniques and interpretations , 2016, 2016 IEEE/PES Transmission and Distribution Conference and Exposition (T&D).

[28]  P. Mirowski,et al.  Statistical Machine Learning and Dissolved Gas Analysis: A Review , 2012, IEEE Transactions on Power Delivery.

[29]  Gaigai Cai,et al.  Reliability estimation for cutting tools based on logistic regression model using vibration signals , 2011 .

[30]  R. Fostiak Computer maintenance management systems (cmms) for hydroelectric power systems , 2001, 2001 IEEE Power Engineering Society Winter Meeting. Conference Proceedings (Cat. No.01CH37194).

[31]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[32]  Erik Cambria,et al.  Jumping NLP Curves: A Review of Natural Language Processing Research [Review Article] , 2014, IEEE Computational Intelligence Magazine.

[33]  Lipika Dey,et al.  A k-mean clustering algorithm for mixed numeric and categorical data , 2007, Data Knowl. Eng..

[34]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[35]  Peerapon Vateekul,et al.  Fault detection for circulating water pump using time series forecasting and outlier detection , 2017, 2017 9th International Conference on Knowledge and Smart Technology (KST).

[36]  Wenyuan Li,et al.  Reliability Assessment of Electric Power Systems Using Monte Carlo Methods , 1994 .

[37]  Kazuyuki Aihara,et al.  Logistic Regression for Single Trial EEG Classification , 2006, NIPS.

[38]  J. M. Nahman,et al.  Optimal sparing strategy for a group of substations , 1991 .