Evolutionary Optimisation of Kernel and Hyper-Parameters for SVM

Support Vector Machines (SVMs) concern a new generation learning systems based on recent advances in statistical learning theory. A key problem of these methods is how to choose an optimal kernel and how to optimise its parameters. A (multiple) kernel adapted to the problem to be solved could improve the SVM performance. Therefore, our goal is to develop a model able to automatically generate a complex kernel combination (linear or non-linear, weighted or un-weighted, according to the data) and to optimise both the kernel parameters and SVM parameters by evolutionary means in a unified framework. Furthermore we try to analyse the architecture of such kernel of kernels (KoK). Numerical experiments show that the SVM algorithm, involving the evolutionary KoK performs statistically better than some well-known classic kernels and its architecture is adapted to each problem.

[1]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[2]  Jens Gottlieb,et al.  Evolutionary Computation in Combinatorial Optimization , 2006, Lecture Notes in Computer Science.

[3]  Laura Diosan,et al.  Optimising Multiple Kernels for SVM by Genetic Programming , 2008, EvoCOP.

[4]  Gilbert Syswerda,et al.  A Study of Reproduction in Generational and Steady State Genetic Algorithms , 1990, FOGA.

[5]  Sayan Mukherjee,et al.  Choosing Multiple Parameters for Support Vector Machines , 2002, Machine Learning.

[6]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[7]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[8]  S. Sathiya Keerthi,et al.  An Efficient Method for Gradient-Based Adaptation of Hyperparameters in SVM Models , 2006, NIPS.

[9]  Peter J. Angeline,et al.  Two self-adaptive crossover operators for genetic programming , 1996 .

[10]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[11]  John R. Koza,et al.  Genetic Programming II , 1992 .

[12]  Jing Hu,et al.  Model Selection via Bilevel Optimization , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[13]  Riccardo Poli,et al.  Introduction to genetic programming , 2009, GECCO '09.

[14]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[15]  Christian Igel,et al.  Evolutionary tuning of multiple SVM parameters , 2005, ESANN.

[16]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[17]  Michael I. Jordan,et al.  Computing regularization paths for learning multiple kernels , 2004, NIPS.