论文信息 - Kernel Selection in Support Vector Machines Using Gram-Matrix Properties

Kernel Selection in Support Vector Machines Using Gram-Matrix Properties

We describe an approach to kernel selection in Support Vector Machines (SVMs) driven by the Gram matrix. Our study extracts properties from this matrix (e.g., Fisher’s discriminant, Bregman’s divergence) using different kernel functions (linear, polynomial, Gaussian, Laplacian, Bessel and ANOVARBF), and incorporates such properties as meta-features within a meta-learning framework. The goal is to predict the best kernel in SVMs. Results show how introducing a new metafeature, Distance Ratio, capturing inter-class and intra-class distances in the feature space, yields substantial improvements during kernel selection.

Ricardo Vilalta | Roberto Valerio

[1] L. Bregman. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , 1967 .

[2] R. Fisher. THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[3] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[4] Aleix M. Martínez,et al. Kernel Optimization in Discriminant Analysis , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Ricardo Vilalta,et al. Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.

[6] Lei Wang,et al. A Kernel-Induced Space Selection Approach to Model Selection in KLDA , 2008, IEEE Transactions on Neural Networks.