Enhanced Random Forest With Concurrent Analysis of Static and Dynamic Nodes for Industrial Fault Classification

In recent years, machine learning algorithms have been successfully applied to industrial processes. However, the concurrent analysis of static and dynamic representations has not been comprehensively addressed for industrial process fault classification. In this paper, an enhanced random forest algorithm with a concurrent analysis of static and dynamic nodes is proposed to address this issue for fault classification. First, the standard slow feature analysis is modified by designing a new slowness index that is more suitable for a supervised fault classification problem. Second, a feature ranking process is conducted to determine the significant features. These features, which substitute the raw variables in the nodes, are used to build the enhanced random forest. Using this scheme, the significant static and dynamic nodes are selected to enhance the discriminative ability and interpretation. Additionally, the slow features that are uncorrelated are more suitable for training the forest than the initial correlated variables, and the dynamic characteristics of industrial processes are thus comprehensively addressed. The application of the proposed method to fault classification is evaluated by both the Tennessee Eastman benchmark and a real-world three-phase flow process. The experimental results show that the proposed method outperforms the traditional learning algorithms with remarkable accuracy and F1 score that both exceed 70% for the 16-class Tennessee Eastman process and exceed 99% for the 4-class three-phase flow process. The selected significant features reveal that both the static and dynamic information play important roles in fault classification.

[1]  In-So Kweon,et al.  Ambiguous Surface Defect Image Classification of AMOLED Displays in Smartphones , 2016, IEEE Transactions on Industrial Informatics.

[2]  Christos Georgakis,et al.  Disturbance detection and isolation by dynamic principal component analysis , 1995 .

[3]  Yi Cao,et al.  Nonlinear Dynamic Process Monitoring Using Canonical Variate Analysis and Kernel Density Estimations , 2009 .

[4]  Shen Yin,et al.  Recent Advances in Key-Performance-Indicator Oriented Prognosis and Diagnosis With a MATLAB Toolbox: DB-KIT , 2019, IEEE Transactions on Industrial Informatics.

[5]  Lei Huang,et al.  Bayesian Networks in Fault Diagnosis , 2017, IEEE Transactions on Industrial Informatics.

[6]  Chunhui Zhao,et al.  A full‐condition monitoring method for nonstationary dynamic chemical processes with cointegration and slow feature analysis , 2018 .

[7]  Qinghua Zhang,et al.  An Information Fusion Fault Diagnosis Method Based on Dimensionless Indicators With Static Discounting Factor and KNN , 2016, IEEE Sensors Journal.

[8]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .

[9]  Stephen V. Stehman,et al.  Selecting and interpreting measures of thematic classification accuracy , 1997 .

[10]  Chunhui Zhao,et al.  Fault Subspace Selection Approach Combined With Analysis of Relative Changes for Reconstruction Modeling and Multifault Diagnosis , 2016, IEEE Transactions on Control Systems Technology.

[11]  Bo-Suk Yang,et al.  Random forests classifier for machine fault diagnosis , 2008 .

[12]  Yi Cao,et al.  Canonical Variate Dissimilarity Analysis for Process Incipient Fault Detection , 2018, IEEE Transactions on Industrial Informatics.

[13]  Okyay Kaynak,et al.  Data-Driven Monitoring and Safety Control of Industrial Cyber-Physical Systems: Basics and Beyond , 2018, IEEE Access.

[14]  Chunhui Zhao,et al.  Sparse Exponential Discriminant Analysis and Its Application to Fault Diagnosis , 2018, IEEE Transactions on Industrial Electronics.

[15]  Junghui Chen,et al.  On-line batch process monitoring using dynamic PCA and dynamic PLS models , 2002 .

[16]  Fei Liu,et al.  Fault Detection and Diagnosis of Multiple-Model Systems With Mismodeled Transition Probabilities , 2015, IEEE Transactions on Industrial Electronics.

[17]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[18]  Jie Yu,et al.  A novel dynamic bayesian network‐based networked process monitoring approach for fault detection, propagation identification, and root cause diagnosis , 2013 .

[19]  Chunhui Zhao,et al.  Online Fault Diagnosis in Industrial Processes Using Multimodel Exponential Discriminant Analysis Algorithm , 2019, IEEE Transactions on Control Systems Technology.

[20]  Pieter J. Mosterman,et al.  Monitoring, Prediction, and Fault Isolation in Dynamic Physical Systems , 1997, AAAI/IAAI.

[21]  Hao Wu,et al.  Deep convolutional neural network model based chemical process fault diagnosis , 2018, Comput. Chem. Eng..

[22]  Shen Yin,et al.  Recursive Total Principle Component Regression Based Fault Detection and Its Application to Vehicular Cyber-Physical Systems , 2018, IEEE Transactions on Industrial Informatics.

[23]  Arthur K. Kordon,et al.  Fault diagnosis based on Fisher discriminant analysis and support vector machines , 2004, Comput. Chem. Eng..

[24]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[25]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[26]  Guang Wang,et al.  Quality-Related Fault Detection Approach Based on Orthogonal Signal Correction and Modified PLS , 2015, IEEE Transactions on Industrial Informatics.

[27]  Chunhui Zhao,et al.  Dynamic Distributed Monitoring Strategy for Large-Scale Nonstationary Processes Subject to Frequently Varying Conditions Under Closed-Loop Control , 2019, IEEE Transactions on Industrial Electronics.

[28]  Steven X. Ding,et al.  Model-based Fault Diagnosis Techniques: Design Schemes, Algorithms, and Tools , 2008 .

[29]  Chunhui Zhao,et al.  Simultaneous Static and Dynamic Analysis for Fine-Scale Identification of Process Operation Statuses , 2019, IEEE Transactions on Industrial Informatics.

[30]  Shane Strasser,et al.  Diagnostic alarm sequence maturation in timed failure propagation graphs , 2011, 2011 IEEE AUTOTESTCON.

[31]  Dexian Huang,et al.  Slow feature analysis for monitoring and diagnosis of control performance , 2016 .

[32]  Richard D. Braatz,et al.  Fault Detection and Diagnosis in Industrial Systems , 2001 .

[33]  Sheng Chen,et al.  Nonlinear Process Fault Diagnosis Based on Serial Principal Component Analysis , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Fang Deng,et al.  Sensor Multifault Diagnosis With Improved Support Vector Machines , 2017, IEEE Transactions on Automation Science and Engineering.

[35]  Cristobal Ruiz-Carcel,et al.  Statistical process monitoring of a multiphase flow facility , 2015 .

[36]  Qiang Yu,et al.  Random Forest Classifier for Zero-Shot Learning Based on Relative Attribute , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[38]  Chunhui Zhao,et al.  A nested-loop Fisher discriminant analysis algorithm , 2015 .

[39]  Hongbo Shi,et al.  Fault Detection and Classification Using Quality-Supervised Double-Layer Method , 2018, IEEE Transactions on Industrial Electronics.

[40]  Chunhui Zhao,et al.  Recursive Exponential Slow Feature Analysis for Fine-Scale Adaptive Processes Monitoring With Comprehensive Operation Status Identification , 2019, IEEE Transactions on Industrial Informatics.

[41]  Chunhui Zhao,et al.  Slow-Feature-Analysis-Based Batch Process Monitoring With Comprehensive Interpretation of Operation Condition Deviation and Dynamic Anomaly , 2019, IEEE Transactions on Industrial Electronics.

[42]  Sergey A. Shevchik,et al.  Prediction of Failure in Lubricated Surfaces Using Acoustic Time–Frequency Features and Random Forest Algorithm , 2017, IEEE Transactions on Industrial Informatics.

[43]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[44]  Johan A. K. Suykens,et al.  Concurrent monitoring of operating condition deviations and process dynamics anomalies with slow feature analysis , 2015 .

[45]  Diego Cabrera,et al.  Fault diagnosis in spur gears based on genetic algorithm and random forest , 2016 .

[46]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[47]  Hong Zhou,et al.  Decentralized Fault Diagnosis of Large-Scale Processes Using Multiblock Kernel Partial Least Squares , 2010, IEEE Transactions on Industrial Informatics.

[48]  Xin Gao,et al.  An improved SVM integrated GS-PCA fault diagnosis approach of Tennessee Eastman process , 2016, Neurocomputing.

[49]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[51]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[52]  Chunhui Zhao,et al.  Linearity Evaluation and Variable Subset Partition Based Hierarchical Process Modeling and Monitoring , 2018, IEEE Transactions on Industrial Electronics.

[53]  Chonghun Han,et al.  Fault Detection and Operation Mode Identification Based on Pattern Classification with Variable Selection , 2004 .

[54]  Gilles Louppe,et al.  Understanding Random Forests: From Theory to Practice , 2014, 1407.7502.