Handling Imbalanced Datasets for Robust Deep Neural Network-Based Fault Detection in Manufacturing Systems

Over the recent years, Industry 4.0 (I4.0) technologies such as the Industrial Internet of Things (IIoT), Artificial Intelligence (AI), and the presence of Industrial Big Data (IBD) have helped achieve intelligent Fault Detection (FD) in manufacturing. Notably, data-driven approaches in FD apply Deep Learning (DL) techniques to help generate insights required for monitoring complex manufacturing processes. However, due to the ratio of instances where actual faults occur, FD datasets tend to be imbalanced, leading to training challenges that result in inefficient DL-based FD models. In this paper, we propose Dual Logits Weights Perturbation (DLWP) loss, a method featuring weight vectors for improved dataset generalization in FD systems. The weight vectors act as hyperparameters adjusted on a case-by-case basis to regulate focus accorded to individual minority classes during training. In particular, our proposed method is suitable for imbalanced datasets from safety-related FD tasks as it generates DL models that minimize false negatives. Subsequently, we integrate human experts into the workflow as a strategy to help safeguard the system. A subset of the results, model predictions with uncertainties exceeding a preset threshold, are considered a preliminary output subject to cross-checking by human experts. We demonstrate that DLWP achieves improved Recall, AUC, F1 scores.

[1]  Jialin Liu,et al.  Fault diagnosis using contribution plots without smearing effect on non-faulty variables , 2012 .

[2]  Bo Lu,et al.  Big Data Analytics in Chemical Engineering. , 2017, Annual review of chemical and biomolecular engineering.

[3]  Hong Wang,et al.  Data Driven Fault Diagnosis and Fault Tolerant Control: Some Advances and Possible New Directions: Data Driven Fault Diagnosis and Fault Tolerant Control: Some Advances and Possible New Directions , 2009 .

[4]  Paul M. Frank,et al.  Fuzzy logic and neural network applications to fault diagnosis , 1997, Int. J. Approx. Reason..

[5]  Alexandra Brintrup,et al.  Learning With Imbalanced Data in Smart Manufacturing: A Comparative Analysis , 2021, IEEE Access.

[6]  Mohd Azlan Hussain,et al.  Fault diagnosis of Tennessee Eastman process with multi- scale PCA and ANFIS , 2013 .

[7]  Tanzila Saba,et al.  Brain tumor segmentation in multi‐spectral MRI using convolutional neural networks (CNN) , 2018, Microscopy research and technique.

[8]  H. Scott Matthews,et al.  Smart Everything: Will Intelligent Systems Reduce Resource Use? , 2013 .

[9]  A. Kouadri,et al.  An improved plant‐wide fault detection scheme based on PCA and adaptive threshold for reliable process monitoring: Application on the new revised model of Tennessee Eastman process , 2018 .

[10]  Iqbal H. Sarker Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions , 2021, SN Comput. Sci..

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  M. Buscema MetaNet: the theory of independent judges. , 1998, Substance use & misuse.

[13]  José Manuel Benítez,et al.  Fault detection based on time series modeling and multivariate statistical process control , 2018, Chemometrics and Intelligent Laboratory Systems.

[14]  Muhammad Saqlain,et al.  A Deep Convolutional Neural Network for Wafer Defect Identification on an Imbalanced Dataset in Semiconductor Manufacturing Processes , 2020, IEEE Transactions on Semiconductor Manufacturing.

[15]  Seokgoo Kim,et al.  Transfer Learning-Based Fault Diagnosis under Data Deficiency , 2020 .

[16]  Charless C. Fowlkes,et al.  Do We Need More Training Data? , 2015, International Journal of Computer Vision.

[17]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[18]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[19]  Raghunathan Rengaswamy,et al.  A review of process fault detection and diagnosis: Part III: Process history based methods , 2003, Comput. Chem. Eng..

[20]  Raghunathan Rengaswamy,et al.  A review of process fault detection and diagnosis: Part II: Qualitative models and search strategies , 2003, Comput. Chem. Eng..

[21]  Ruimao Zhang,et al.  Cost-Effective Active Learning for Deep Image Classification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Chen Huang,et al.  Deep Imbalanced Learning for Face Recognition and Attribute Prediction , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Chang Ouk Kim,et al.  A Deep Learning Model for Robust Wafer Fault Monitoring With Sensor Measurement Noise , 2017, IEEE Transactions on Semiconductor Manufacturing.

[24]  Witold Pedrycz,et al.  Dual autoencoders features for imbalance classification problem , 2016, Pattern Recognit..

[25]  Hongbing Liu,et al.  Improving undersampling-based ensemble with rotation forest for imbalanced problem , 2019, TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES.

[26]  Chang Ouk Kim,et al.  A Convolutional Neural Network for Fault Classification and Diagnosis in Semiconductor Manufacturing Processes , 2017, IEEE Transactions on Semiconductor Manufacturing.

[27]  Shu-Kai S. Fan,et al.  A Review on Fault Detection and Process Diagnostics in Industrial Processes , 2020, Processes.

[28]  Huilan Jiang,et al.  Towards Robustness in Neural Network Based Fault Diagnosis , 2008, Int. J. Appl. Math. Comput. Sci..

[29]  Young Chul Lee,et al.  Fault detection based on one-class deep learning for manufacturing applications limited to an imbalanced database , 2020 .

[30]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[31]  Wooyeol Choi,et al.  Leveraging Uncertainties in Softmax Decision-Making Models for Low-Power IoT Devices , 2020, Sensors.

[32]  Eunseo Oh,et al.  An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network , 2020, Symmetry.

[33]  Klaus-Dieter Thoben,et al.  "Industrie 4.0" and Smart Manufacturing - A Review of Research Issues and Application Examples , 2017, Int. J. Autom. Technol..

[34]  Raghunathan Rengaswamy,et al.  A review of process fault detection and diagnosis: Part I: Quantitative model-based methods , 2003, Comput. Chem. Eng..

[35]  Donghua Zhou,et al.  Recursive transformed component statistical analysis for incipient fault detection , 2017, Autom..

[36]  Jianhua Ma,et al.  Variational LSTM Enhanced Anomaly Detection for Industrial Big Data , 2021, IEEE Transactions on Industrial Informatics.

[37]  Klaus-Dieter Thoben,et al.  Machine learning in manufacturing: advantages, challenges, and applications , 2016 .

[38]  Dawn M. Tilbury Cyber-Physical Manufacturing Systems , 2019 .

[39]  Si-Zhao Joe Qin,et al.  Survey on data-driven industrial process monitoring and diagnosis , 2012, Annu. Rev. Control..

[40]  K. Khorasani,et al.  Fault detection and isolation of gas turbine engines using a bank of neural networks , 2015 .

[41]  Atsuto Maki,et al.  A systematic study of the class imbalance problem in convolutional neural networks , 2017, Neural Networks.