We develop a fully-automated pattern search methodology for model selection of support vector machines (SVMs) for regression and classification. Pattern search (PS) is a derivative-free optimization method suitable for low-dimensional optimization problems for which it is difficult or impossible to calculate derivatives. This methodology was motivated by an application in drug design in which regression models are constructed based on a few high-dimensional exemplars. Automatic model selection in such underdetermined problems is essential to avoid overfitting and overestimates of generalization capability caused by selecting parameters based on testing results. We focus on SVM model selection for regression based on leave-one-out (LOO) and cross-validated estimates of mean squared error, but the search strategy is applicable to any model criterion. Because the resulting error surface produces an extremely noisy map of the model quality with many local minima, the resulting generalization capacity of any single local optimal model illustrates high variance. Thus several locally optimal SVMmodels are generated and then bagged or averaged to produce the final SVM. This strategy of pattern search combined with model averaging has proven to be very effective on benchmark tests and in high-variance drug design domains with high potential of overfitting. ∗This work is supported by the NSF Grants IIS-9979860 and IRI-97092306. †Dept. of Decision Sciences and Engineering Systems, Rensselaer Polytechnic Institute, Troy, NY 12180, mommam@rpi.edu ‡Dept. of Mathematical Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, bennek@rpi.edu
[1]
Samy Bengio,et al.
SVMTorch: Support Vector Machines for Large-Scale Regression Problems
,
2001,
J. Mach. Learn. Res..
[2]
Robert Michael Lewis,et al.
Pattern Search Algorithms for Bound Constrained Minimization
,
1999,
SIAM J. Optim..
[3]
J. Weston,et al.
Support vector regression with ANOVA decomposition kernels
,
1999
.
[4]
Bernhard Schölkopf,et al.
Shrinking the Tube: A New Support Vector Regression Algorithm
,
1998,
NIPS.
[5]
Vladimir N. Vapnik,et al.
The Nature of Statistical Learning Theory
,
2000,
Statistics for Engineering and Information Science.
[6]
S. Sathiya Keerthi,et al.
Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms
,
2002,
IEEE Trans. Neural Networks.
[7]
F. Lombardo,et al.
Computation of brain-blood partitioning of organic solutes via free energy calculations.
,
1996,
Journal of medicinal chemistry.
[8]
Alexander J. Smola,et al.
Support Vector Regression Machines
,
1996,
NIPS.
[9]
Sayan Mukherjee,et al.
Choosing Multiple Parameters for Support Vector Machines
,
2002,
Machine Learning.
[10]
Lawrence Davis,et al.
Genetic Algorithms and Simulated Annealing
,
1987
.
[11]
Virginia Torczon,et al.
DERIVATIVE-FREE PATTERN SEARCH METHODS FOR MULTIDISCIPLINARY DESIGN PROBLEMS
,
1994
.
[12]
Olivier Chapelle,et al.
Model Selection for Support Vector Machines
,
1999,
NIPS.