Generalized grouped contributions for hierarchical fault diagnosis with group Lasso

Abstract In process industries, it is necessary to conduct fault diagnosis after abnormality is found, with the aim to identify root cause variables and further provide instructive information for maintenance. Contribution plots along with multivariate statistical process monitoring are standard tools towards this goal, which, however, suffer from the smearing effect and high diagnostic complexity on large-scale processes. In fact, process variables tend to be naturally grouped, and in this work, a novel fault identification strategy based on group Lasso penalty along with a hierarchical fault diagnosis scheme is proposed by leveraging group information among variables. By introducing the group Lasso as a regularization approach, groups of irrelevant variables tend to yield exactly zero contributions collectively, which help find the exact root cause, alleviate the smearing effect, and furnish clear diagnostic information for process practitioners. For online computational convenience, an efficient numerical solution strategy is also presented. Besides, it turns out that the proposed approach also applies to dynamic monitoring models with lagged measurements augmented, thereby enjoying widespread generality. Its effectiveness is evaluated on both the Tennessee Eastman benchmark process and a pilot-scale experiment apparatus.

[1]  Jialin Liu,et al.  Fault diagnosis using contribution plots without smearing effect on non-faulty variables , 2012 .

[2]  S. Joe Qin,et al.  Process data analytics in the era of big data , 2014 .

[3]  Christos Georgakis,et al.  Disturbance detection and isolation by dynamic principal component analysis , 1995 .

[4]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[5]  Tao Chen,et al.  A branch and bound method for isolation of faulty variables through missing variable analysis , 2010 .

[6]  Tao Chen,et al.  Reconstruction-based multivariate contribution analysis for fault isolation: A branch and bound approach , 2012 .

[7]  Ping Zhang,et al.  A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process , 2012 .

[8]  John F. MacGregor,et al.  Process monitoring and diagnosis by multiblock PLS methods , 1994 .

[9]  Lei Xie,et al.  Structured Joint Sparse Principal Component Analysis for Fault Detection and Isolation , 2019, 2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS).

[10]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[11]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[12]  Biao Huang,et al.  Review and Perspectives of Data-Driven Distributed Monitoring for Industrial Plant-Wide Processes , 2019, Industrial & Engineering Chemistry Research.

[13]  Fan Yang,et al.  Recursive Slow Feature Analysis for Adaptive Monitoring of Industrial Processes , 2018, IEEE Transactions on Industrial Electronics.

[14]  Henrik Ohlsson,et al.  Scalable anomaly detection in large homogeneous populations , 2013, Autom..

[15]  Si-Zhao Joe Qin,et al.  Survey on data-driven industrial process monitoring and diagnosis , 2012, Annu. Rev. Control..

[16]  Qiang Yang,et al.  Identifying main effects and epistatic interactions from large-scale SNP data via adaptive group Lasso , 2010, BMC Bioinformatics.

[17]  P. Miller,et al.  Contribution plots: a missing link in multivariate quality control , 1998 .

[18]  Gitta Kutyniok,et al.  1 . 2 Sparsity : A Reasonable Assumption ? , 2012 .

[19]  Hong Zhou,et al.  Decentralized Fault Diagnosis of Large-Scale Processes Using Multiblock Kernel Partial Least Squares , 2010, IEEE Transactions on Industrial Informatics.

[20]  Zhibin Zhao,et al.  Enhanced Sparse Period-Group Lasso for Bearing Fault Diagnosis , 2019, IEEE Transactions on Industrial Electronics.

[21]  Okyay Kaynak,et al.  Big Data for Modern Industry: Challenges and Trends [Point of View] , 2015, Proc. IEEE.

[22]  Johan A. K. Suykens,et al.  Concurrent monitoring of operating condition deviations and process dynamics anomalies with slow feature analysis , 2015 .

[23]  Yi Yang,et al.  A fast unified algorithm for solving group-lasso penalize learning problems , 2014, Statistics and Computing.

[24]  S. Joe Qin,et al.  Analysis and generalization of fault diagnosis methods for process monitoring , 2011 .

[25]  Ming Liu,et al.  Bearings fault diagnosis based on adaptive local iterative filtering–multiscale permutation entropy and multinomial logistic model with group-lasso , 2019 .

[26]  Yuan Yao,et al.  Variable selection method for fault isolation using least absolute shrinkage and selection operator (LASSO) , 2015 .

[27]  Christos Georgakis,et al.  Plant-wide control of the Tennessee Eastman problem , 1995 .

[28]  ChangKyoo Yoo,et al.  Statistical monitoring of dynamic processes based on dynamic independent component analysis , 2004 .

[29]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .

[30]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[31]  Donghua Zhou,et al.  Exponential Smoothing Reconstruction Approach for Incipient Fault Isolation , 2018 .

[32]  Jialin Liu,et al.  Bayesian filtering of the smearing effect: Fault isolation in chemical process monitoring , 2014 .

[33]  Tianyou Chai,et al.  Fault diagnosis of continuous annealing processes using a reconstruction-based method , 2012 .

[34]  Haiyong Zheng,et al.  Extracting Dissimilarity of Slow Feature Analysis between Normal and Different Faults for Monitoring Process Status and Fault Diagnosis , 2019 .

[35]  Fan Yang,et al.  Capturing Connectivity and Causality in Complex Industrial Processes , 2014 .

[36]  Michael J. Piovoso,et al.  On unifying multiblock analysis with application to decentralized process monitoring , 2001 .

[37]  Dexian Huang,et al.  Probabilistic slow feature analysis‐based representation learning from massive process data for soft sensor modeling , 2015 .

[38]  Zhiqiang Ge,et al.  Data Mining and Analytics in the Process Industry: The Role of Machine Learning , 2017, IEEE Access.

[39]  Chunhui Zhao,et al.  Comprehensive process decomposition for closed-loop process monitoring with quality-relevant slow feature analysis , 2019, Journal of Process Control.

[40]  Xuefeng Yan,et al.  Multiblock Independent Component Analysis Integrated with Hellinger Distance and Bayesian Inference for Non-Gaussian Plant-Wide Process Monitoring , 2015 .

[41]  Leo H. Chiang,et al.  Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis , 2000 .

[42]  Chunhui Zhao,et al.  A Sparse Reconstruction Strategy for Online Fault Diagnosis in Nonstationary Processes with No a Priori Fault Information , 2017 .

[43]  ChangKyoo Yoo,et al.  Statistical process monitoring with independent component analysis , 2004 .

[44]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[45]  Age K. Smilde,et al.  Generalized contribution plots in multivariate statistical process monitoring , 2000 .

[46]  Ali Abur,et al.  Fault Location Using Wide-Area Measurements and Sparse Estimation , 2016, IEEE Transactions on Power Systems.

[47]  Hao Ye,et al.  Weighted Reconstruction-Based Contribution for Improved Fault Diagnosis , 2013 .

[48]  Biao Huang,et al.  Identification of robust probabilistic slow feature regression model for process data contaminated with outliers , 2018 .

[49]  Ali Cinar,et al.  Monitoring, fault diagnosis, fault-tolerant control and optimization: Data driven methods , 2012, Comput. Chem. Eng..

[50]  Dexian Huang,et al.  Detecting and isolating plant-wide oscillations via slow feature analysis , 2015, 2015 American Control Conference (ACC).

[51]  S. Joe Qin,et al.  Subspace approach to multidimensional fault identification and reconstruction , 1998 .

[52]  Chang-Soo Kim,et al.  Efficient Process Monitoring via the Integrated Use of Markov Random Fields Learning and the Graphical Lasso , 2018 .

[53]  Chao Shang Dynamic Modeling of Complex Industrial Processes: Data-driven Methods and Application Research , 2018 .

[54]  Dexian Huang,et al.  Slow feature analysis for monitoring and diagnosis of control performance , 2016 .

[55]  Ming Yan,et al.  Non-convex Penalties with Analytical Solutions for One-bit Compressive Sensing , 2017, Signal Process..

[56]  Xuefeng Yan,et al.  Nonlinear plant-wide process monitoring using MI-spectral clustering and Bayesian inference-based multiblock KPCA , 2015 .

[57]  Volker Roth,et al.  The Group-Lasso for generalized linear models: uniqueness of solutions and efficient algorithms , 2008, ICML '08.

[58]  Aditya Tulsyan,et al.  Advances in industrial biopharmaceutical batch process monitoring: Machine‐learning methods for small data problems , 2018, Biotechnology and bioengineering.

[59]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[60]  Michel Verhaegen,et al.  N2SID: Nuclear norm subspace identification of innovation models , 2016, Autom..

[61]  Donghua Zhou,et al.  On the use of reconstruction-based contribution for fault diagnosis , 2016 .

[62]  S. Joe Qin,et al.  Statistical process monitoring: basics and beyond , 2003 .

[63]  Biao Huang,et al.  A Bayesian sparse reconstruction method for fault detection and isolation , 2015 .

[64]  Leo H. Chiang,et al.  Advances and opportunities in machine learning for process data analytics , 2019, Comput. Chem. Eng..

[65]  Si-Zhao Joe Qin,et al.  Reconstruction-based contribution for process monitoring , 2009, Autom..

[66]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .