论文信息 - Sequential Support Vector Classifiers and Regression

Sequential Support Vector Classifiers and Regression

Support Vector Machines(SVMs) map the input training data into a high dimensional feature space and nds a maximal margin hyperplane separating the data in that feature space. Extensions of this approach account for non-separable or noisy training data (soft classi ers) as well as support vector based regression. The optimal hyperplane is usually found by solving a quadratic programming problem which is usually quite complex, time consuming and prone to numerical instabilities. In this work, we introduce a sequential gradient ascent based algorithm for fast and simple implementation of the SVM for classi cation with soft classi ers. The fundamental idea is similar to applying the Adatron algorithm to SVM as developed independently in the Kernel-Adatron [7], although the details are di erent in many respects. We modify the formulation of the bias and consider a modi ed dual optimization problem. This formulation has made it possible to extend the framework for solving the SVM regression in an online setting. This paper looks at theoretical justi cations of the algorithm, which is shown to converge robustly to the optimal solution very fast in terms of number of iterations, is orders of magnitude faster than conventional SVM solutions and is extremely simple to implement even for large sized problems. Experimental evaluations on benchmark classi cation problems of sonar data and USPS and MNIST databases substantiate the speed and robustness of the learning procedure.

Sethu Vijayakumar | Si Wu | S. Vijayakumar | Si Wu

[1] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[2] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[3] Alexander J. Smola,et al. Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[4] Bernhard Schölkopf,et al. Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[5] R. C. Williamson,et al. Support vector regression with automatic accuracy control. , 1998 .

[6] P. Bartlett,et al. Generalization Performance of Support Vector Machines and Other Pattern Classifiers , 1999 .

[7] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[8] Nello Cristianini,et al. The Kernel-Adatron Algorithm: A Fast and Simple Learning Procedure for Support Vector Machines , 1998, ICML.

[9] J. C. BurgesChristopher. A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[10] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .

[11] B. Schölkopf,et al. Asymptotically Optimal Choice of ε-Loss for Support Vector Machines , 1998 .

[12] B. Schölkopf,et al. Advances in kernel methods: support vector learning , 1999 .

[13] S. Amari. Natural Gradient Works Eciently in Learning , 2022 .