Learning deep representation of imbalanced SCADA data for fault detection of wind turbines

Abstract Numerous intelligent fault diagnosis models have been developed on supervisory control and data acquisition (SCADA) systems of wind turbines, so as to process massive SCADA data effectively and accurately. However, there is a problem ignored among these studies. That is, SCADA data distribution is imbalanced and the anomalous data mining is not sufficient. The amount of normal data is much more than that of abnormal data, which makes these models tend to be biased toward majority class, i.e., normal data, while accuracy of diagnosing fault is poor. Aimed at overcoming this problem, a novel intelligent fault diagnosis methodology is proposed based on exquisitely designed deep neural networks. The between-classes imbalance problem is handled by learning deep representation that can preserve within-class information and between-classes information based on triplet loss. The proposed method encourages a pair of data belonging to same class to be projected onto points as close as possible in new embedding space. It tries to enforce a margin between different class data. The effectiveness and generalization of the proposed method are validated on the SCADA data of two wind turbines containing blades icing accretion fault. The result demonstrates the proposed method outperforms the traditional normal behavior modeling method.

[1]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Miguel A. Sanz-Bobi,et al.  SIMAP: Intelligent System for Predictive Maintenance: Application to the health condition monitoring of a windturbine gearbox , 2006, Comput. Ind..

[4]  Honglak Lee,et al.  Unsupervised feature learning for audio classification using convolutional deep belief networks , 2009, NIPS.

[5]  Shuhui Li,et al.  Using neural networks to estimate wind turbine power generation , 2001 .

[6]  Y. Zhang,et al.  Improved Wind Speed Prediction Using Empirical Mode Decomposition , 2018 .

[7]  Kai Ming Ting,et al.  An Instance-weighting Method to Induce Cost-sensitive Trees , 2001 .

[8]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Xin Zhou,et al.  Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data , 2016 .

[10]  Shuang Gao,et al.  Wind speed prediction with RBF neural network based on PCA and ICA , 2018 .

[11]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[12]  Simon J. Watson,et al.  Using SCADA data for wind turbine condition monitoring – a review , 2017 .

[13]  David Infield,et al.  Online wind turbine fault detection through automated SCADA data analysis , 2009 .

[14]  Hui Han,et al.  Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[15]  Tomasz Maciejewski,et al.  Local neighbourhood extension of SMOTE for mining imbalanced data , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[18]  Zhenyu Wu,et al.  An Integrated Ensemble Learning Model for Imbalanced Fault Diagnostics and Prognostics , 2018, IEEE Access.

[19]  Gaoliang Peng,et al.  A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load , 2018, Mechanical Systems and Signal Processing.

[20]  Sofiane Achiche,et al.  Wind turbine condition monitoring based on SCADA data using normal behavior models. Part 1: System description , 2013, Appl. Soft Comput..

[21]  Andrew Kusiak,et al.  The prediction and diagnosis of wind turbine faults , 2011 .

[22]  Taghi M. Khoshgoftaar,et al.  Improving Software-Quality Predictions With Data Sampling and Boosting , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[23]  Zhengding Qiu,et al.  The effect of imbalanced data sets on LDA: A theoretical and empirical analysis , 2007, Pattern Recognit..

[24]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[25]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[26]  Lina Bertling Tjernberg,et al.  An Artificial Neural Network Approach for Early Fault Detection of Gearbox Bearings , 2015, IEEE Transactions on Smart Grid.

[27]  Lihui Wang,et al.  Imbalanced data fault diagnosis of rotating machinery using synthetic oversampling and feature learning , 2018, Journal of Manufacturing Systems.

[28]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Chih-Fong Tsai,et al.  Clustering-based undersampling in class-imbalanced data , 2017, Inf. Sci..

[30]  M. Schlechtingen,et al.  Using Data-Mining Approaches for Wind Turbine Power Curve Monitoring: A Comparative Study , 2013, IEEE Transactions on Sustainable Energy.

[31]  N Chawla,et al.  SMOTEBoost:ブースティングにおけるマイノリティクラスの予測改善(原標題は英語) , 2003 .

[32]  Per Johan Nicklasson,et al.  Ice sensors for wind turbines , 2006 .

[33]  Fausto Pedro García Márquez,et al.  Ice detection using thermal infrared radiometry on wind turbine blades , 2016 .

[34]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[35]  Cong Wang,et al.  Construction of hierarchical diagnosis network based on deep learning and its application in the fault pattern recognition of rolling element bearings , 2016 .