论文信息 - Multi-task feature and kernel selection for SVMs

Multi-task feature and kernel selection for SVMs

We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is advantageous when multiple classification tasks and differently labeled datasets exist over a common input space. Different datasets can mutually reinforce a common choice of representation or relevant features for their various classifiers. We derive a multi-task representation learning approach using the maximum entropy discrimination formalism. The resulting convex algorithms maintain the global solution properties of support vector machines. However, in addition to multiple SVM classification/regression parameters they also jointly estimate an optimal subset of features or optimal combination of kernels. Experiments are shown on standardized datasets.

Tony Jebara | T. Jebara | Tony Jebara

[1] Jonathan Baxter,et al. A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..

[2] Sayan Mukherjee,et al. Feature Selection for SVMs , 2000, NIPS.

[3] Jonathan Baxter,et al. Learning internal representations , 1995, COLT '95.

[4] Nello Cristianini,et al. Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[5] T. Ben-David,et al. Exploiting Task Relatedness for Multiple , 2003 .

[6] Tommi S. Jaakkola,et al. Maximum Entropy Discrimination , 1999, NIPS.

[7] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[8] N. Cristianini,et al. On Kernel-Target Alignment , 2001, NIPS.

[9] Olivier Chapelle,et al. Model Selection for Support Vector Machines , 1999, NIPS.

[10] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.

[11] Tommi S. Jaakkola,et al. Feature Selection and Dualities in Maximum Entropy Discrimination , 2000, UAI.

[12] Sebastian Thrun,et al. Learning to Learn , 1998, Springer US.