Two methods of selecting Gaussian kernel parameters for one-class SVM and their application to fault detection

As one of the methods to solve one-class classification problems (OCC), one-class support vector machines (OCSVM) have been applied to fault detection in recent years. Among all the kernels available for OCSVM, the Gaussian kernel is the most commonly used one. The selection of Gaussian kernel parameters influences greatly the performances of classifiers, which remains as an open problem. In this paper two methods are proposed to select Gaussian kernel parameters in OCSVM: according to the first one, the parameters are selected using the information of the farthest and the nearest neighbors of each sample; using the second one, the parameters are determined via detecting the ''tightness'' of the decision boundaries. The two proposed methods are tested on UCI data sets and Tennessee Eastman Process benchmark data sets. The results show that, the two proposed methods can be used to select suitable parameters for the Gaussian kernel, enabling the resulting OCSVM models to perform well on fault detection.

[1]  Jean-Pierre Martens,et al.  A Practical Approach to Model Selection for Support Vector Machines With a Gaussian Kernel , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[3]  Deyu Meng,et al.  Fast and Efficient Strategies for Model Selection of Gaussian Support Vector Machine , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Manabu Kano,et al.  Comparison of statistical process monitoring methods: application to the Eastman challenge problem , 2000 .

[5]  Richard D. Braatz,et al.  Fault Detection and Diagnosis in Industrial Systems , 2001 .

[6]  Joachim Denzler,et al.  One-class classification with Gaussian processes , 2010, Pattern Recognit..

[7]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[8]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[9]  Boleslaw K. Szymanski,et al.  Some Properties of the Gaussian Kernel for One Class Learning , 2007, ICANN.

[10]  S. Joe Qin,et al.  Statistical process monitoring: basics and beyond , 2003 .

[11]  In-Beum Lee,et al.  Fault detection and diagnosis based on modified independent component analysis , 2006 .

[12]  Wenli Xu,et al.  An Approach to Choosing Gaussian Kernel Parameter for One-Class SVMs via Tightness Detecting , 2012, 2012 4th International Conference on Intelligent Human-Machine Systems and Cybernetics.

[13]  Yoram Baram,et al.  Learning by Kernel Polarization , 2005, Neural Computation.

[14]  Roger Xu,et al.  Model Selection for Anomaly Detection in Wireless Ad Hoc Networks , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[15]  Lei Wang,et al.  Two Criteria for Model Selection in Multiclass Support Vector Machines , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Miguel Lázaro-Gredilla,et al.  Low-cost model selection for SVMs using local features , 2012, Eng. Appl. Artif. Intell..

[17]  Tu Bao Ho,et al.  An efficient kernel matrix evaluation measure , 2008, Pattern Recognit..

[18]  Heiko Hoffmann,et al.  Kernel PCA for novelty detection , 2007, Pattern Recognit..

[19]  Robert P. W. Duin,et al.  Minimum spanning tree based one-class classifier , 2009, Neurocomputing.

[20]  Gyemin Lee,et al.  Nested support vector machines , 2010, IEEE Trans. Signal Process..

[21]  Manabu Kano,et al.  Comparison of multivariate statistical process monitoring methods with applications to the Eastman challenge problem , 2002 .

[22]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[23]  Saeid Homayouni,et al.  Anomaly Detection in Hyperspectral Images Based on an Adaptive Support Vector Method , 2011, IEEE Geoscience and Remote Sensing Letters.

[24]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .

[25]  Sirish L. Shah,et al.  Fault detection and diagnosis in process data using one-class support vector machines , 2009 .

[26]  Sheng-De Wang,et al.  Choosing the kernel parameters for support vector machines by the inter-cluster distance in the feature space , 2009, Pattern Recognit..

[27]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.