A deep learning approach for anomaly detection based on SAE and LSTM in mechanical equipment

Anomaly in mechanical systems may cause equipment to break down with serious safety, environment, and economic impact. Since many mechanical equipment usually operates under tough working environments, which makes them vulnerable to types of faults, anomaly detection for mechanical equipment usually requires considerable domain knowledge. However, a common dilemma in many practical applications is that one may not be able to obtain the empirical knowledge about anomaly or the history data is completely unlabelled, which makes conventional fault identification methods not applicable. In order to fill the gap, this paper proposes a novel deep learning–based method for anomaly detection in mechanical equipment by combining two types of deep learning architectures, stacked autoencoders (SAE) and long short-term memory (LSTM) neural networks, to identify anomaly condition in a completely unsupervised manner. The proposed method focuses on the anomaly detection through multiple features sequence when the history data is unlabelled and the empirical knowledge about anomaly is absent. An experiment for anomaly detection in rotary machinery through wavelet packet decomposition (WPD) and data-driven models demonstrates the efficiency and stability of the proposed approach. The method can be divided into two stages: SAE-based multiple features sequence representation and LSTM-based anomaly identification. During the experiment, fivefold cross-validation has been applied to validate the performance and stability of the proposed approach. The results show that the proposed approach could detect anomaly working condition with 99% accuracy under a completely unsupervised learning environment and offer an alternative method to leverage and integrate features for anomaly detection without empirical knowledge.

[1]  Marc'Aurelio Ranzato,et al.  Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[2]  Y. Z. Ayele,et al.  Risk based inspection of offshore topsides static mechanical equipment in Arctic conditions , 2016, 2016 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM).

[3]  Radu-Emil Precup,et al.  An overview on fault diagnosis and nature-inspired optimal control of industrial process applications , 2015, Comput. Ind..

[4]  Mehmet Karaköse,et al.  Anomaly detection using a modified kernel-based tracking in the pantograph-catenary system , 2015, Expert Syst. Appl..

[5]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[6]  Leonidas J. Guibas,et al.  Taskonomy: Disentangling Task Transfer Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Young-Jin Kim,et al.  An architecture for emergency event prediction using LSTM recurrent neural networks , 2018, Expert Syst. Appl..

[8]  Kesheng Wang,et al.  A deep learning driven method for fault classification and degradation assessment in mechanical equipment , 2019, Comput. Ind..

[9]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[10]  Qiao Hu,et al.  Fault diagnosis of rotating machinery based on improved wavelet package transform and SVMs ensemble , 2007 .

[11]  Edwin Lughofer,et al.  Residual-based fault detection using soft computing techniques for condition monitoring at rolling mills , 2014, Inf. Sci..

[12]  Armin Rastbood,et al.  Prediction of structural forces of segmental tunnel lining using FEM based artificial neural network , 2017 .

[13]  K. Pressel,et al.  Condition-based maintenance of mechanical setup in aluminum wire bonding equipment by data mining , 2017 .

[14]  Chen Lu,et al.  Fault diagnosis of rotary machinery components using a stacked denoising autoencoder-based health state identification , 2017, Signal Process..

[15]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[16]  Yong Zhang,et al.  Classification of EEG Signals Based on Autoregressive Model and Wavelet Packet Decomposition , 2017, Neural Processing Letters.

[17]  Michael G. Pecht,et al.  Using cross-validation for model parameter selection of sequential probability ratio test , 2012, Expert Syst. Appl..

[18]  Oliver Niggemann,et al.  Self-Organizing Maps for Anomaly Localization and Predictive Maintenance in Cyber-Physical Production Systems , 2018 .

[19]  Yaguo Lei,et al.  A review on empirical mode decomposition in fault diagnosis of rotating machinery , 2013 .

[20]  Gang Qu,et al.  Modified genetic algorithm-based feature selection combined with pre-trained deep neural network for demand forecasting in outpatient department , 2017, Expert Syst. Appl..

[21]  Victoria M. Catterson,et al.  Diagnosis of tidal turbine vibration data through deep neural networks , 2016 .

[22]  Lina Bertling Tjernberg,et al.  An Artificial Neural Network Approach for Early Fault Detection of Gearbox Bearings , 2015, IEEE Transactions on Smart Grid.

[23]  Jose Antonino-Daviu,et al.  Application of Infrared Thermography to Failure Detection in Industrial Induction Motors: Case Stories , 2017, IEEE Transactions on Industry Applications.

[24]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[25]  Ping Yan,et al.  Research on a configurable method for fault diagnosis knowledge of machine tools and its application , 2018 .

[26]  Yaacob Sazali,et al.  Classification of human emotion from EEG using discrete wavelet transform , 2010 .

[27]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[29]  M. Landry,et al.  An Improved Vibration Analysis Algorithm as a Diagnostic Tool for Detecting Mechanical Anomalies on Power Circuit Breakers , 2008, IEEE Transactions on Power Delivery.

[30]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Arturo Garcia-Perez,et al.  Real-time SVD-based detection of multiple combined faults in induction motors , 2014, Comput. Electr. Eng..

[32]  Robert Babuska,et al.  Railway Track Circuit Fault Diagnosis Using Recurrent Neural Networks , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[33]  Díbio Leandro Borges,et al.  Analysis of mammogram classification using a wavelet transform decomposition , 2003, Pattern Recognit. Lett..

[34]  Nguyen Lu Dang Khoa,et al.  Kernel-based support vector machines for automated health status assessment in monitoring sensor data , 2018 .

[35]  Ruqiang Yan,et al.  Learning to Monitor Machine Health with Convolutional Bi-Directional LSTM Networks , 2017, Sensors.

[36]  James Griffin,et al.  Multiple classification of the force and acceleration signals extracted during multiple machine processes: part 1 intelligent classification from an anomaly perspective , 2017 .

[37]  Klaus-Dieter Thoben,et al.  Current trends on ICT technologies for enterprise information systems , 2016, Comput. Ind..

[38]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[39]  Steven X. Ding,et al.  A Survey of Fault Diagnosis and Fault-Tolerant Techniques—Part I: Fault Diagnosis With Model-Based and Signal-Based Approaches , 2015, IEEE Transactions on Industrial Electronics.

[40]  Carlos León,et al.  Rule-based system to detect energy efficiency anomalies in smart buildings, a data mining approach , 2016, Expert Syst. Appl..

[41]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[42]  Guoyu Meng,et al.  Vibration signal analysis using parameterized time–frequency method for features extraction of varying-speed rotary machinery , 2015 .

[43]  Qian Chen,et al.  A novel method for feature extraction using crossover characteristics of nonlinear data and its application to fault diagnosis of rotary machinery , 2014 .

[44]  Plamen P. Angelov,et al.  Fully unsupervised fault detection and identification based on recursive density estimation and self-evolving cloud-based classifier , 2015, Neurocomputing.

[45]  Tarun Gupta,et al.  A research study on unsupervised machine learning algorithms for early fault detection in predictive maintenance , 2018, 2018 5th International Conference on Industrial Engineering and Applications (ICIEA).

[46]  Yi Wang,et al.  A data-driven method based on deep belief networks for backlash error prediction in machining centers , 2020, J. Intell. Manuf..