Similarity based robust probability latent variable regression model and its kernel extension for process monitoring

Abstract In most industries, process and quality measurements with outliers are often collected. The outliers would have negative influences on data-based modelling and process monitoring. In our previous work on probability latent variable regression (PLVR), the model is constructed under the assumption that the data quality of the process characteristics is good and the operation processes are linear. In this article, a robust PLVR (RPLVR) model is developed. Then it is extended to its nonlinear form, called robust probability kernel latent variable regression (RPKLVR). Both models can reduce the effects of outliers. RPLVR and RPKLVR are the weighted probability models. The similarity of each sample among all the collected data would be chosen as the weighting factor for each sample. Thus, the outliers for modelling are weakened. With the weighted training data, an expectation-maximization algorithm of training RPLVR and RPKLVR are derived. The corresponding statistics are also systematically constructed for the fault detection. Two case studies are presented to illustrate the effectiveness of the proposed methods.

[1]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification , 1954 .

[2]  Zhiqiang Ge,et al.  Probabilistic latent variable regression model for process-quality monitoring , 2014 .

[3]  Si-Zhao Joe Qin,et al.  Survey on data-driven industrial process monitoring and diagnosis , 2012, Annu. Rev. Control..

[4]  Tao Chen,et al.  Robust probabilistic PCA with missing data and contribution analysis for outlier detection , 2009, Comput. Stat. Data Anal..

[5]  S. Joe Qin,et al.  Quality‐relevant and process‐relevant fault monitoring with concurrent projection to latent structures , 2013 .

[6]  Junghui Chen,et al.  On-line batch process monitoring using dynamic PCA and dynamic PLS models , 2002 .

[7]  Steven X. Ding,et al.  A Review on Basic Data-Driven Approaches for Industrial Process Monitoring , 2014, IEEE Transactions on Industrial Electronics.

[8]  Christos Georgakis,et al.  Plant-wide control of the Tennessee Eastman problem , 1995 .

[9]  Zhiqiang Ge,et al.  Semisupervised Bayesian method for soft sensor modeling with unlabeled data samples , 2011 .

[10]  Steven X. Ding,et al.  Data-driven design of monitoring and diagnosis systems for dynamic processes: A review of subspace technique based schemes and some recent results , 2014 .

[11]  Richard D. Braatz,et al.  Fault detection in industrial processes using canonical variate analysis and dynamic principal component analysis , 2000 .

[12]  Zhiqiang Ge,et al.  Robust modeling of mixture probabilistic principal component analysis and process monitoring application , 2014 .

[13]  In-Beum Lee,et al.  Nonlinear dynamic process monitoring based on dynamic kernel PCA , 2004 .

[14]  Furong Gao,et al.  Review of Recent Research on Data-Based Process Monitoring , 2013 .

[15]  M Daszykowski,et al.  Dealing with missing values and outliers in principal component analysis. , 2007, Talanta.

[16]  Donghua Zhou,et al.  Total projection to latent structures for process monitoring , 2009 .

[17]  Donghua Zhou,et al.  Geometric properties of partial least squares for process monitoring , 2010, Autom..

[18]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .

[19]  Zhiqiang Ge,et al.  Semi-supervised PLVR models for process monitoring with unequal sample sizes of process variables and quality variables , 2015 .

[20]  C. Yoo,et al.  Nonlinear process monitoring using kernel principal component analysis , 2004 .