Constraint Projections for Discriminative Support Vector Machines

Abstract— The objection of the standard support vector machine (SVM) is to achieve the maximum margin and control the generalization capability of SVM classifier. It only considers the punishment constraints, but does not introduce the within-class and between-class constraint projections. A novel robust discriminative support vector machines algorithm with pairwise constraint projections called CPSVM is discussed for linearly separable and linearly non-separable datasets. CPSVM can preserve the structure of the original samples data as well as the pairwise constraints in the projective feature spaces. The central idea is to find a projective vector such that can assure the maximum margin of SVM hyperplane, and simultaneously considers improving the tightness among the distances between the similar patterns under the must-link constraints, while expanding the distances between those dissimilar pairs under the cannot-link constraints after the restricted data are projected onto the projection vector in the high-dimensional kernel spaces. The projection process can make the nonlinear data separable in the feature spaces. We demonstrate the practical usefulness and good performance of CPSVM algorithm in data visualization and classification tasks through extensive simulations on eleven UCI datasets. Experimental results show that CPSVM algorithm can almost always achieve the highest accuracies and has better robustness to the distant data noise.

[1]  Peng Xiyuan A New Support Vector Machine Method for Unbalanced Data Treatment , 2006 .

[2]  Li Hong,et al.  A Learning Strategy of SVM Used to Large Training Set , 2004 .

[3]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[4]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[5]  Ivor W. Tsang,et al.  Learning with Idealized Kernels , 2003, ICML.

[6]  Wu Tie-jun,et al.  Weighted Support Vector Machine Based Classification Algorithm for Uneven Class Size Problems , 2003 .

[7]  Xuegong Zhang,et al.  Editing support vector machines , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[8]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[9]  Robert A. Lordo,et al.  Learning from Data: Concepts, Theory, and Methods , 2001, Technometrics.

[10]  Daphna Weinshall,et al.  Enhancing image and video retrieval: learning via equivalence constraints , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[11]  Dao-Qiang Zhang,et al.  Clustering Incomplete Data Using Kernel-Based Fuzzy C-means Algorithm , 2003, Neural Processing Letters.

[12]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[13]  Rekha Govil,et al.  Neural Networks in Signal Processing , 2000 .

[14]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.