Heuristics for Kernels Adaptation in Support Vector Machines

Support Vector Machines are an algorithm introduced by Vapnik and coworkers [9], [10]. They are based on the idea that if input points are mapped to a high dimensional feature space then, a separating hyperplane can be easily found. SVM and kernel methods have been applied to a wide class of problems including approximation and classification and they have proven a remarkable performance on real world problems. An important step in their design is the setting of the kernels parameters which defines the structure of the high dimensional feature space where a maximal margin hyperplane will be found. Too rich feature space, e.g. small kernel parameters, will over-fit the data and hence result in a poor generalisation error, whereas if the kernel parameter is too big, the model will not be able to separate the data. In this paper we firstly propose a heuristic that permits the individual control of the growth in each kernel, which results in more sparse models with higher prediction accuracy. Secondly, a heuristic resulting from the combination of SVM trained by linear programming (LP) and EC for the optimisation of the kernels width is proposed.

[1]  Thomas F. Coleman,et al.  Large-Scale Numerical Optimization , 1990 .

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  Terrence J. Sejnowski,et al.  Analysis of hidden units in a layered network trained to classify sonar targets , 1988, Neural Networks.

[4]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[5]  Nello Cristianini,et al.  The Kernel-Adatron Algorithm: A Fast and Simple Learning Procedure for Support Vector Machines , 1998, ICML.

[6]  V. Kecman,et al.  Support vector machines trained by linear programming: theory and application in image compression and data classification , 2000, Proceedings of the 5th Seminar on Neural Network Applications in Electrical Engineering. NEUREL 2000 (IEEE Cat. No.00EX287).

[7]  O. Mangasarian,et al.  Pattern Recognition Via Linear Programming: Theory and Application to Medical Diagnosis , 1989 .

[8]  Ingo Renners,et al.  Methodology for Optimizing Fuzzy Classifiers Based on Computational Intelligence , 2001, Fuzzy Days.

[9]  Ingo Renners,et al.  Optimizing fuzzy classifiers by evolutionary algorithms , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[10]  Nello Cristianini,et al.  Dynamically Adapting Kernels in Support Vector Machines , 1998, NIPS.