Sparse PARAFAC2 decomposition: Application to fault detection and diagnosis in batch processes

Abstract The PARAFAC2 decomposition is often used to modeling a set of matrices that have the same number of columns but different numbers of rows. However, the PARAFAC2 model lacks of interpretability because most of elements in factor vectors are nonzero. To overcome this deficiency, a sparse PARAFAC2 (SPARAFAC2) decomposition is developed. SPARAFAC2 yields sparse factor vectors (SFVs) with only a few nonzero elements. Because of the sparsity in factor vectors, the SPARAFAC2 model has much better interpretability than the ordinary PARAFAC2 model. SPARAFAC2 is attractive for the applications in batch processes, because it not only can directly handle the three-way structure of batch data and naturally solve the unequal batch length problem, but also can reveal meaningful connections between process variables. Therefore, based on the SPARAFAC2 decomposition, fault detection and diagnosis methods are proposed for batch processes. To improve the fault detection capability, a cumulative percent contribution criterion is used to adaptively select SFVs for each sample from the fault detection point of view. Two fault detection indices are then defined using the selected SFVs. A contribution-based fault diagnosis method is also proposed. This method identifies faulty variables by evaluating contributions of SFVs and active variables (with nonzero elements) in each SFV to the detection of faults. The effectiveness of the proposed methods is demonstrated with a case study in an industrial-scale fermentation process.

[1]  Furong Gao,et al.  A survey on multistage/multiphase statistical modeling methods for batch processes , 2009, Annu. Rev. Control..

[2]  Jianfeng Mao,et al.  Industrial Process Monitoring Based on Knowledge–Data Integrated Sparse Model and Two-Level Deviation Magnitude Plots , 2018 .

[3]  Han Liu,et al.  Provable sparse tensor decomposition , 2015, 1502.01425.

[4]  Chudong Tong,et al.  Sparse Robust Principal Component Analysis with Applications to Fault Detection and Diagnosis , 2019, Industrial & Engineering Chemistry Research.

[5]  Ali Cinar,et al.  Monitoring, fault diagnosis, fault-tolerant control and optimization: Data driven methods , 2012, Comput. Chem. Eng..

[6]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[7]  A. J. Morris,et al.  On‐line monitoring of batch processes using a PARAFAC representation , 2003 .

[8]  Hadi Fanaee-T,et al.  Tensor-based anomaly detection: An interdisciplinary survey , 2016, Knowl. Based Syst..

[9]  Si-Zhao Joe Qin,et al.  Survey on data-driven industrial process monitoring and diagnosis , 2012, Annu. Rev. Control..

[10]  A. J. Morris,et al.  Non-parametric confidence bounds for process performance monitoring charts☆ , 1996 .

[11]  Genevera I. Allen,et al.  Sparse Higher-Order Principal Components Analysis , 2012, AISTATS.

[12]  J. Kruskal Rank, decomposition, and uniqueness for 3-way and n -way arrays , 1989 .

[13]  Di Tang,et al.  Fault Detection and Diagnosis Based on Sparse PCA and Two-Level Contribution Plots , 2017 .

[14]  Jianfeng Mao,et al.  Adaptive Selection of Latent Variables for Process Monitoring , 2019 .

[15]  Pedro A. Valdes-Sosa,et al.  Penalized PARAFAC analysis of spontaneous EEG recordings , 2008 .

[16]  Barry M. Wise,et al.  Application of PARAFAC2 to fault detection and diagnosis in semiconductor etch , 2001 .

[17]  Henk A. L. Kiers,et al.  Hierarchical relations among three-way methods , 1991 .

[18]  Haiping Lu,et al.  A survey of multilinear subspace learning for tensor data , 2011, Pattern Recognit..

[19]  Lars Kai Hansen,et al.  Algorithms for Sparse Nonnegative Tucker Decompositions , 2008, Neural Computation.

[20]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[21]  Bülent Yener,et al.  Unsupervised Multiway Data Analysis: A Literature Survey , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[23]  Biao Huang,et al.  Review and Perspectives of Data-Driven Distributed Monitoring for Industrial Plant-Wide Processes , 2019, Industrial & Engineering Chemistry Research.

[24]  Barry Lennox,et al.  The development of an industrial-scale fed-batch fermentation simulation. , 2015, Journal of biotechnology.

[25]  Rasmus Bro,et al.  A tutorial on the Lasso approach to sparse modeling , 2012 .

[26]  A. Smilde,et al.  Multivariate statistical process control of batch processes based on three-way models , 2000 .

[27]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[28]  L. Luo,et al.  Phase Partition and Phase-Based Process Monitoring Methods for Multiphase Batch Processes with Uneven Durations , 2016 .

[29]  Xuefeng Yan,et al.  Multivariate Statistical Monitoring of Key Operation Units of Batch Processes Based on Time-Slice CCA , 2019, IEEE Transactions on Control Systems Technology.

[30]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[31]  R. Bro,et al.  PARAFAC2—Part I. A direct fitting algorithm for the PARAFAC2 model , 1999 .

[32]  Zengliang Gao,et al.  Batch Process Monitoring with GTucker2 Model , 2014 .

[33]  L. Luo,et al.  Monitoring Batch Processes Using Sparse Parallel Factor Decomposition , 2017 .