论文信息 - Reducing Kernel Matrix Diagonal Dominance Using Semi-definite Programming

Reducing Kernel Matrix Diagonal Dominance Using Semi-definite Programming

Kernel-based learning methods revolve around the notion of a kernel or Gram matrix between data points. These square, symmetric, positive semi-definite matrices can informally be regarded as encoding pairwise similarity between all of the objects in a data-set. In this paper we propose an algorithm for manipulating the diagonal entries of a kernel matrix using semi-definite programming. Kernel matrix diagonal dominance reduction attempts to deal with the problem of learning with almost orthogonal features, a phenomenon commonplace in kernel matrices derived from string kernels or Gaussian kernels with small width parameter. We show how this task can be formulated as a semi-definite programming optimization problem that can be solved with readily available optimizers. Theoretically we provide an analysis using Rademacher based bounds to provide an alternative motivation for the 1-norm SVM motivated from kernel diagonal reduction. We assess the performance of the algorithm on standard data sets with encouraging results in terms of approximation and prediction.

[1] Alexander J. Smola,et al. Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[2] David P. Williamson,et al. Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[3] John Shawe-Taylor,et al. Structural Risk Minimization Over Data-Dependent Hierarchies , 1998, IEEE Trans. Inf. Theory.

[4] Bernhard Schölkopf,et al. A Kernel Approach for Learning from Almost Orthogonal Patterns , 2002, PKDD.

[5] John D. Lafferty,et al. Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[6] Stephen P. Boyd,et al. Semidefinite Programming , 1996, SIAM Rev..

[7] Nello Cristianini,et al. Margin Distribution Bounds on Generalization , 1999, EuroCOLT.

[8] Nello Cristianini,et al. An introduction to Support Vector Machines , 2000 .

[9] Nello Cristianini,et al. Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[10] Ralf Herbrich,et al. Learning Kernel Classifiers , 2001 .