Monitoring Statistics and Tuning of Kernel Principal Component Analysis With Radial Basis Function Kernels

Kernel Principal Component Analysis (KPCA) using Radial Basis Function (RBF) kernels can capture data nonlinearity by projecting the original variable space to a high-dimensional kernel feature space and obtaining the kernel principal components. This article examines the tuning of the kernel width when using RBF kernels in KPCA, showing that inappropriate kernel widths result in RBF-KPCA being unable to capture nonlinearity present in data. The paper also considers the choice of monitoring statistics when RBF-KPCA is applied to anomaly detection. Linear PCA requires two monitoring statistics. The Hotelling’s $T^{2}$ monitoring statistic detects when a sample exceeds the healthy operating range, while the Squared Prediction Error (SPE) monitoring statistic detects the case when the sample does not follow the model of the training data. The analysis in this article shows that SPE for RBF-KPCA can detect both cases. Moreover, unlike the case of linear PCA, the $T^{2}$ monitoring statistic for RBF-KPCA is non-monotonic with respect to the magnitude of the anomaly, making it not optimal as a monitoring statistic. The paper presents examples to illustrate these points. The paper also provides a detailed mathematical analysis which explains the observations from a theoretical perspective. Tuning strategies are proposed for setting the kernel width and the detection threshold of the monitoring statistic. The performance of optimally tuned RBF-KPCA for anomaly detection is demonstrated via numerical simulation and a benchmark dataset from an industrial-scale facility.

[1]  In-Beum Lee,et al.  Fault Detection of Non-Linear Processes Using Kernel Independent Component Analysis , 2008 .

[2]  Mahmood Shafiee,et al.  Mixed kernel canonical variate dissimilarity analysis for incipient fault monitoring in nonlinear dynamic processes , 2019, Comput. Chem. Eng..

[3]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[4]  Nitin Kumar,et al.  Kernel Generalized-Gaussian Mixture Model for Robust Abnormality Detection , 2017, MICCAI.

[5]  Khaoula Ben Abdellafou,et al.  New online kernel method with the Tabu search algorithm for process monitoring , 2018, Trans. Inst. Meas. Control.

[6]  Sheng Chen,et al.  Nonlinear Process Fault Diagnosis Based on Serial Principal Component Analysis , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Nojun Kwak,et al.  Principal Component Analysis by $L_{p}$ -Norm Maximization , 2014, IEEE Transactions on Cybernetics.

[8]  Yan-Wei Pang,et al.  An Iterative Algorithm for Robust Kernel Principal Component Analysis , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[9]  Nan Li,et al.  Ensemble Kernel Principal Component Analysis for Improved Nonlinear Process Monitoring , 2015 .

[10]  James R. Ottewill,et al.  A heterogeneous benchmark dataset for data analytics: Multiphase flow facility case study , 2019, Journal of Process Control.

[11]  Carlos F. Alcala,et al.  Reconstruction-based contribution for process monitoring with kernel principal component analysis , 2010, Proceedings of the 2010 American Control Conference.

[12]  Tao Xu,et al.  Fault detection and diagnosis strategy based on a weighted and combined index in the residual subspace associated with PCA , 2018 .

[13]  Furong Gao,et al.  Review of Recent Research on Data-Based Process Monitoring , 2013 .

[14]  Nader Meskin,et al.  Sensor fault detection and isolation of an industrial gas turbine using partial adaptive KPCA , 2018 .

[15]  Jin Hyun Park,et al.  Fault detection and identification of nonlinear processes based on kernel PCA , 2005 .

[16]  In-Beum Lee,et al.  Nonlinear dynamic process monitoring based on dynamic kernel PCA , 2004 .

[17]  Xuefeng Yan,et al.  Parallel PCA–KPCA for nonlinear process monitoring , 2018, Control Engineering Practice.

[18]  Heiko Hoffmann,et al.  Kernel PCA for novelty detection , 2007, Pattern Recognit..

[19]  Nitin Kumar,et al.  Kernel generalized Gaussian and robust statistical learning for abnormality detection in medical images , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[20]  C. Yoo,et al.  Nonlinear process monitoring using kernel principal component analysis , 2004 .

[21]  Fei He,et al.  Nonlinear fault detection of batch processes based on functional kernel locality preserving projections , 2018, Chemometrics and Intelligent Laboratory Systems.

[22]  Abdelhafid Benyounes,et al.  Diagnosis of uncertain nonlinear systems using interval kernel principal components analysis: Application to a weather station. , 2018, ISA transactions.

[23]  Zhiqiang Ge,et al.  Improved kernel PCA-based monitoring approach for nonlinear processes , 2009 .

[24]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[25]  Fernando De la Torre,et al.  Robust Kernel Principal Component Analysis , 2008, NIPS.

[26]  Ning Wang,et al.  The optimization of the kind and parameters of kernel function in KPCA for process monitoring , 2012, Comput. Chem. Eng..

[27]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[28]  Christopher J. Taylor,et al.  The use of kernel principal component analysis to model data distributions , 2003, Pattern Recognit..