A Kernel Fuzzy c-Means Clustering-Based Fuzzy Support Vector Machine Algorithm for Classification Problems With Outliers or Noises

The support vector machine (SVM) has provided higher performance than traditional learning machines and has been widely applied in real-world classification problems and nonlinear function estimation problems. Unfortunately, the training process of the SVM is sensitive to the outliers or noises in the training set. In this paper, a common misunderstanding of Gaussian-function-based kernel fuzzy clustering is corrected, and a kernel fuzzy c-means clustering-based fuzzy SVM algorithm (KFCM-FSVM) is developed to deal with the classification problems with outliers or noises. In the KFCM-FSVM algorithm, we first use the FCM clustering to cluster each of two classes from the training set in the high-dimensional feature space. The farthest pair of clusters, where one cluster comes from the positive class and the other from the negative class, is then searched and forms one new training set with membership degrees. Finally, we adopt FSVM to induce the final classification results on this new training set. The computational complexity of the KFCM-FSVM algorithm is analyzed. A set of experiments is conducted on six benchmarking datasets and four artificial datasets for testing the generalization performance of the KFCM-FSVM algorithm. The results indicate that the KFCM-FSVM algorithm is robust for classification problems with outliers or noises.

[1]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[2]  Jin Hyung Kim,et al.  Face Recognition using Support Vector Machines with Local Correlation Kernels , 2002, Int. J. Pattern Recognit. Artif. Intell..

[3]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[4]  Smriti Srivastava,et al.  A New Kernel based Hybrid c-Means Clustering Model , 2007, 2007 IEEE International Fuzzy Systems Conference.

[5]  Dao-Qiang Zhang,et al.  A novel kernelized fuzzy C-means algorithm with application in medical image segmentation , 2004, Artif. Intell. Medicine.

[6]  Xindong Wu Knowledge Acquisition from Databases , 1995 .

[7]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.

[8]  Shaoning Pang,et al.  Membership authentication in the dynamic group by face classification using SVM ensemble , 2003, Pattern Recognit. Lett..

[9]  Jacek M. Leski Neuro-fuzzy system with learning tolerant to imprecision , 2003, Fuzzy Sets Syst..

[10]  Meng Joo Er,et al.  Robust Data Clustering in Mercer Kernel-Induced Feature Space , 2006, ISNN.

[11]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[13]  J.M. Leski,et al.  An /spl epsiv/-margin nonlinear classifier based on fuzzy if-then rules , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  David R. Musicant,et al.  Lagrangian Support Vector Machines , 2001, J. Mach. Learn. Res..

[15]  Frank Klawonn,et al.  A contribution to convergence theory of fuzzy c-means and derivatives , 2003, IEEE Trans. Fuzzy Syst..

[16]  Zhou Yong,et al.  Robust Fuzzy-Possibilistic C-Means Algorithm , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[17]  Dug Hun Hong,et al.  Interval regression analysis using quadratic loss support vector machine , 2005, IEEE Transactions on Fuzzy Systems.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  Chin-Teng Lin,et al.  Support-vector-based fuzzy neural network for pattern classification , 2006, IEEE Transactions on Fuzzy Systems.

[20]  Zhang Yi,et al.  Fuzzy SVM with a new fuzzy membership function , 2006, Neural Computing & Applications.

[21]  Javier Montero,et al.  Accuracy statistics for judging soft classification , 2008 .

[22]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[23]  Vincenzo Cutello,et al.  Fuzzy classification systems , 2004, Eur. J. Oper. Res..

[24]  Xiao-Hong Wu,et al.  Possibilistic Fuzzy c-Means Clustering Model Using Kernel Methods , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[25]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[26]  Reshma Khemchandani,et al.  Fast and robust learning through fuzzy linear proximal support vector machines , 2004, Neurocomputing.

[27]  Dug Hun Hong,et al.  Support vector fuzzy regression machines , 2003, Fuzzy Sets Syst..

[28]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[29]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[30]  Jens Jäkel,et al.  A New Convergence Proof of Fuzzy c-Means , 2005, IEEE Trans. Fuzzy Syst..

[31]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[32]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[33]  Kohei Inoue,et al.  Robust Kernel Fuzzy Clustering , 2005, FSKD.

[34]  Xuegong Zhang,et al.  Using class-center vectors to build support vector machines , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[35]  Boudewijn P. F. Lelieveldt,et al.  A new cluster validity index for the fuzzy c-mean , 1998, Pattern Recognit. Lett..

[36]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[37]  Modesto Castrillón,et al.  Face recognition using independent component analysis and support vector machines , 2003 .

[38]  Kin Keung Lai,et al.  A new fuzzy support vector machine to evaluate credit risk , 2005, IEEE Transactions on Fuzzy Systems.

[39]  V. Vapnik,et al.  Bounds on Error Expectation for Support Vector Machines , 2000, Neural Computation.

[40]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[41]  Jacek M Leski Epsilon-insensitive fuzzy c-regression models: introduction to epsilon-insensitive fuzzy modeling. , 2004, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society.

[42]  Miin-Shen Yang,et al.  A Gaussian kernel-based fuzzy c-means algorithm with a spatial bias correction , 2008, Pattern Recognit. Lett..

[43]  William-Chandra Tjhi,et al.  Dual Fuzzy-Possibilistic Coclustering for Categorization of Documents , 2009, IEEE Transactions on Fuzzy Systems.

[44]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[45]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[46]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[47]  Zhongdong Wu,et al.  Fuzzy C-means clustering algorithm based on kernel method , 2003, Proceedings Fifth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2003.

[48]  Jung-Hsien Chiang,et al.  In Silico Prediction of Human Protein Interactions Using Fuzzy–SVM Mixture Models and Its Application to Cancer Research , 2008, IEEE Transactions on Fuzzy Systems.

[49]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Jacek M. Leski,et al.  TSK-fuzzy modeling based on /spl epsiv/-insensitive learning , 2005, IEEE Transactions on Fuzzy Systems.

[51]  Wu Tie-jun Support vector machines for pattern recognition , 2003 .

[52]  H. P. Huang,et al.  Fuzzy Support Vector Machines for Pattern Recognition and Data Mining , 2002 .

[53]  Yixin Chen,et al.  Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[54]  Jacek M Leski An epsilon-margin nonlinear classifier based on fuzzy if-then rules. , 2004, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society.

[55]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[56]  Byung-In Choi,et al.  Kernel approach to possibilistic C -means clustering , 2009 .

[57]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[58]  Doheon Lee,et al.  On cluster validity index for estimation of the optimal number of fuzzy clusters , 2004, Pattern Recognit..

[59]  Doheon Lee,et al.  Evaluation of the performance of clustering algorithms in kernel-induced feature space , 2005, Pattern Recognit..

[60]  Liang Liao,et al.  MRI brain image segmentation and bias field correction based on fast spatially constrained kernel clustering approach , 2008, Pattern Recognit. Lett..

[61]  Jacek M. Leski Epsiv-insensitive Fuzzy C-regression Models: Introduction to Epsiv-insensitive Fuzzy Modeling , 2004, IEEE Trans. Syst. Man Cybern. Part B.

[62]  Javier Montero,et al.  Improvements to Remote Sensing Using Fuzzy Classification, Graphs and Accuracy Statistics , 2008 .

[63]  Jiri Matas,et al.  Support vector machines for face authentication , 2002, Image Vis. Comput..

[64]  Daoqiang Zhang,et al.  Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[65]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[66]  Noureddine Zahid,et al.  A new cluster-validity for fuzzy clustering , 1999, Pattern Recognit..

[67]  Jacek M. Łȩski,et al.  Neuro-fuzzy system with learning tolerant to imprecision , 2003 .

[68]  Sadaaki Miyamoto,et al.  Possibilistic Approach to Kernel-Based Fuzzy c-Means Clustering with Entropy Regularization , 2005, MDAI.