Unsupervised deep representation learning for motor fault diagnosis by mutual information maximization

Data-driven deep learning technology has gained many achievements in the field of motor fault diagnosis and prognostics. However, the application objects of those previous studies are commonly limited to the faulty data sharing the similar distribution under unvarying stable working condition. Unfortunately, this limitation is nearly invalid in the real-world scenario, where the working condition is complicated and changes invariably, resulting in the unfavourable situation that the deep representation learning methods of the previous studies always fail in extracting the effective representations for fault diagnosis in real applications. To tackle this issue, inspired by f -divergence estimation, this work takes a different route and proposes an unsupervised deep representation learning approach, named Deep Mutual Information Maximization (DMIM), using variational divergence estimation approach to maximize mutual information (MI) between the input and output of a deep neural network. Meanwhile the representation distribution is automatically tuned by matching to a prior distribution with the same philosophy of Variational Autoencoder. Opposite to previous works which learn representations basically with supervised feedback regulation or unsupervised reconstruction, the proposed unsupervised MI maximization framework aims to make representational characteristics like independence play a bigger role to capture the most unique representations. To verify the effectiveness of our proposal, faulty motor data from the motor tests under European driving cycle for simulating the real working scenario, are collected for validation. It turns out that DMIM outperforms many popular unsupervised and fully-supervised learning methods. It opens new avenues for unsupervised learning of representations for motor fault diagnosis.

[1]  Gaoliang Peng,et al.  A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load , 2018, Mechanical Systems and Signal Processing.

[2]  Chao Liu,et al.  Deep Transfer Network with Joint Distribution Adaptation: A New Intelligent Fault Diagnosis Framework for Industry Application , 2018, ISA transactions.

[3]  Qiang Zhou,et al.  A hybrid fault diagnosis method for mechanical components based on ontology and signal analysis , 2019, J. Intell. Manuf..

[4]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Wei Zhang,et al.  Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation , 2018, Journal of Intelligent Manufacturing.

[6]  Jun He,et al.  Unsupervised Fault Diagnosis of a Gear Transmission Chain Using a Deep Belief Network , 2017, Sensors.

[7]  Martin J. Wainwright,et al.  Estimating Divergence Functionals and the Likelihood Ratio by Convex Risk Minimization , 2008, IEEE Transactions on Information Theory.

[8]  Liang Guo,et al.  A recurrent neural network based health indicator for remaining useful life prediction of bearings , 2017, Neurocomputing.

[9]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[10]  Chengjin Qin,et al.  Fault Diagnosis of Induction Motors Using Recurrence Quantification Analysis and LSTM with Weighted BN , 2019, Shock and Vibration.

[11]  Xiao Li,et al.  Composite Fault Diagnosis for Rotating Machinery of Large Units Based on Evidence Theory and Multi-Information Fusion , 2019, Shock and Vibration.

[12]  Zhibin Zhao,et al.  Sparse Deep Stacking Network for Fault Diagnosis of Motor , 2018, IEEE Transactions on Industrial Informatics.

[13]  S. M. Ali,et al.  A General Class of Coefficients of Divergence of One Distribution from Another , 1966 .

[14]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[15]  Xinyu Li,et al.  A zero-shot learning method for fault diagnosis under unknown working loads , 2019, Journal of Intelligent Manufacturing.

[16]  Pieter Abbeel,et al.  InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[17]  Wei Jiang,et al.  Unsupervised fault diagnosis of rolling bearings using a deep neural network based on generative adversarial networks , 2018, Neurocomputing.

[18]  Myeongsu Kang,et al.  Deep Residual Networks With Dynamically Weighted Wavelet Coefficients for Fault Diagnosis of Planetary Gearboxes , 2018, IEEE Transactions on Industrial Electronics.

[19]  Haidong Shao,et al.  Rolling bearing fault feature learning using improved convolutional deep belief network with compressed sensing , 2018 .

[20]  Sebastian Nowozin,et al.  f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization , 2016, NIPS.

[21]  Robert X. Gao,et al.  Deep learning and its applications to machine health monitoring , 2019, Mechanical Systems and Signal Processing.

[22]  Chengjin Qin,et al.  A pre-generated matrix-based method for real-time robotic drilling chatter monitoring , 2019 .

[23]  Li Chen,et al.  Design and Analysis of an Electrical Variable Transmission for a Series–Parallel Hybrid Electric Vehicle , 2011, IEEE Transactions on Vehicular Technology.

[24]  Cong Wang,et al.  A supervised sparsity-based wavelet feature for bearing fault diagnosis , 2019, J. Intell. Manuf..

[25]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[26]  Guy Marchal,et al.  Multimodality image registration by maximization of mutual information , 1997, IEEE Transactions on Medical Imaging.

[27]  Mohamed El Hachemi Benbouzid A review of induction motors signature analysis as a medium for faults detection , 2000, IEEE Trans. Ind. Electron..

[28]  Roger B. Grosse,et al.  Isolating Sources of Disentanglement in Variational Autoencoders , 2018, NeurIPS.

[29]  Huijun Gao,et al.  Data-Based Techniques Focused on Modern Industry: An Overview , 2015, IEEE Transactions on Industrial Electronics.

[30]  Liang Guo,et al.  Machinery health indicator construction based on convolutional neural networks considering trend burr , 2018, Neurocomputing.

[31]  Chengjin Qin,et al.  Timely chatter identification for robotic drilling using a local maximum synchrosqueezing-based method , 2019, J. Intell. Manuf..

[32]  Farid Najafi,et al.  Application of a new EWT-based denoising technique in bearing fault diagnosis , 2019, Measurement.