Metric-based meta-learning model for few-shot fault diagnosis under multiple limited data conditions

Abstract The real-world large industry has gradually become a data-rich environment with the development of information and sensor technology, making the technology of data-driven fault diagnosis acquire a thriving development and application. The success of these advanced methods depends on the assumption that enough labeled samples for each fault type are available. However, in some practical situations, it is extremely difficult to collect enough data, e.g., when the sudden catastrophic failure happens, only a few samples can be acquired before the system shuts down. This phenomenon leads to the few-shot fault diagnosis aiming at distinguishing the failure attribution accurately under very limited data conditions. In this paper, we propose a new approach, called Feature Space Metric-based Meta-learning Model (FSM3), to overcome the challenge of the few-shot fault diagnosis under multiple limited data conditions. Our method is a mixture of general supervised learning and episodic metric meta-learning, which will exploit both the attribute information from individual samples and the similarity information from sample groups. The experiment results demonstrate that our method outperforms a series of baseline methods on the 1-shot and 5-shot learning tasks of bearing and gearbox fault diagnosis across various limited data conditions. The time complexity and implementation difficulty have been analyzed to show that our method has relatively high feasibility. The feature embedding is visualized by t-SNE to investigate the effectiveness of our proposed model.

[1]  Yaguo Lei,et al.  Applications of machine learning to machine fault diagnosis: A review and roadmap , 2020 .

[2]  Tao Zhang,et al.  Deep Model Based Domain Adaptation for Fault Diagnosis , 2017, IEEE Transactions on Industrial Electronics.

[3]  Xiang Li,et al.  Deep Learning-Based Machinery Fault Diagnostics With Domain Adaptation Across Sensors at Different Places , 2020, IEEE Transactions on Industrial Electronics.

[4]  Hang Li,et al.  Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[5]  Xiang Li,et al.  Cross-Domain Fault Diagnosis of Rolling Element Bearings Using Deep Generative Neural Networks , 2019, IEEE Transactions on Industrial Electronics.

[6]  Feng Liu,et al.  Triplet Loss Guided Adversarial Domain Adaptation for Bearing Fault Diagnosis , 2020, Sensors.

[7]  Lei Wang,et al.  Revisiting Local Descriptor Based Image-To-Class Measure for Few-Shot Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Yaguo Lei,et al.  Deep Convolutional Transfer Learning Network: A New Method for Intelligent Fault Diagnosis of Machines With Unlabeled Data , 2019, IEEE Transactions on Industrial Electronics.

[9]  Alexandre Lacoste,et al.  TADAM: Task dependent adaptive metric for improved few-shot learning , 2018, NeurIPS.

[10]  Yaguo Lei,et al.  A Polynomial Kernel Induced Distance Metric to Improve Deep Transfer Learning for Fault Diagnosis of Machines , 2020, IEEE Transactions on Industrial Electronics.

[11]  Hugo Larochelle,et al.  Few-Shot Learning , 2020, Computer Vision.

[12]  Gregory R. Koch,et al.  Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[13]  Hugo Larochelle,et al.  Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[14]  Wei Zhang,et al.  A robust intelligent fault diagnosis method for rolling element bearings based on deep distance metric learning , 2018, Neurocomputing.

[15]  Tao Xiang,et al.  Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Yannis Avrithis,et al.  Dense Classification and Implanting for Few-Shot Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[18]  Yongsheng Zhu,et al.  A novel model with the ability of few-shot learning and quick updating for intelligent fault diagnosis , 2020 .

[19]  Chao Liu,et al.  Deep Transfer Network with Joint Distribution Adaptation: A New Intelligent Fault Diagnosis Framework for Industry Application , 2018, ISA transactions.

[20]  Bernt Schiele,et al.  Meta-Transfer Learning for Few-Shot Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Xilin Chen,et al.  Cross Attention Network for Few-shot Classification , 2019, NeurIPS.

[22]  Tang Tang,et al.  A simple data augmentation algorithm and a self-adaptive convolutional architecture for few-shot fault diagnosis under different working conditions , 2020 .

[23]  Wei Zhang,et al.  Multi-Layer domain adaptation method for rolling bearing fault diagnosis , 2019, Signal Process..

[24]  Oriol Vinyals,et al.  Matching Networks for One Shot Learning , 2016, NIPS.

[25]  Zhiheng Li,et al.  A Deep Transfer Model With Wasserstein Distance Guided Multi-Adversarial Networks for Bearing Fault Diagnosis Under Different Working Conditions , 2019, IEEE Access.

[26]  Wei Zhang,et al.  A New Deep Learning Model for Fault Diagnosis with Good Anti-Noise and Domain Adaptation Ability on Raw Vibration Signals , 2017, Sensors.

[27]  Tao Zhang,et al.  A hybrid approach with optimization-based and metric-based meta-learner for few-shot learning , 2019, Neurocomputing.

[28]  Ming Zhang,et al.  Domain Adaptation with Multilayer Adversarial Learning for Fault Diagnosis of Gearbox under Multiple Operating Conditions , 2019, 2019 Prognostics and System Health Management Conference (PHM-Qingdao).

[29]  Jun Yan,et al.  Multiscale Convolutional Neural Networks for Fault Diagnosis of Wind Turbine Gearbox , 2019, IEEE Transactions on Industrial Electronics.

[30]  Yuxin Cui,et al.  Limited Data Rolling Bearing Fault Diagnosis With Few-Shot Learning , 2019, IEEE Access.

[31]  Liang Gao,et al.  A New Convolutional Neural Network-Based Data-Driven Fault Diagnosis Method , 2018, IEEE Transactions on Industrial Electronics.

[32]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[33]  Robert X. Gao,et al.  Deep learning and its applications to machine health monitoring , 2019, Mechanical Systems and Signal Processing.