论文信息 - A fast algorithm for training support vector regression via smoothed primal function minimization

A fast algorithm for training support vector regression via smoothed primal function minimization

The support vector regression (SVR) model is usually fitted by solving a quadratic programming problem, which is computationally expensive. To improve the computational efficiency, we propose to directly minimize the objective function in the primal form. However, the loss function used by SVR is not differentiable, which prevents the well-developed gradient based optimization methods from being applicable. As such, we introduce a smooth function to approximate the original loss function in the primal form of SVR, which transforms the original quadratic programming into a convex unconstrained minimization problem. The properties of the proposed smoothed objective function are discussed and we prove that the solution of the smoothly approximated model converges to the original SVR solution. A conjugate gradient algorithm is designed for minimizing the proposed smoothly approximated objective function in a sequential minimization manner. Extensive experiments on real-world datasets show that, compared to the quadratic programming based SVR, the proposed approach can achieve similar prediction accuracy with significantly improved computational efficiency, specifically, it is hundreds of times faster for linear SVR model and multiple times faster for nonlinear SVR model.

Songfeng Zheng

[1] Federico Girosi,et al. An improved training algorithm for support vector machines , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[2] Chih-Jen Lin,et al. Coordinate Descent Method for Large-scale L2-loss Linear Support Vector Machines , 2008, J. Mach. Learn. Res..

[3] Olvi L. Mangasarian,et al. A class of smoothing functions for nonlinear and mixed complementarity problems , 1996, Comput. Optim. Appl..

[4] Alexander J. Smola,et al. Learning with kernels , 1998 .

[5] G. Wahba,et al. Some results on Tchebycheffian spline functions , 1971 .

[6] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[7] Yuh-Jye Lee,et al. SSVM: A Smooth Support Vector Machine for Classification , 2001, Comput. Optim. Appl..

[8] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .

[9] I-Cheng Yeh,et al. Modeling of strength of high-performance concrete using artificial neural networks , 1998 .

[10] Chih-Jen Lin,et al. A dual coordinate descent method for large-scale linear SVM , 2008, ICML '08.

[11] Songfeng Zheng,et al. Gradient descent algorithms for quantile regression with smooth approximation , 2011, Int. J. Mach. Learn. Cybern..

[12] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[13] Vladimir Vapnik,et al. Statistical learning theory , 1998 .