An approximation of the Gaussian RBF kernel for efficient classification with SVMs

Gaussian RBF kernels are approximated to speed up SVM classifications.An upper bound for the relative approximation error is given.Error decreases with factorial growth if approximation quality is linearly increased.Experiments showed an average 18-fold speed-up without losing accuracy. In theory, kernel support vector machines (SVMs) can be reformulated to linear SVMs. This reformulation can speed up SVM classifications considerably, in particular, if the number of support vectors is high. For the widely-used Gaussian radial basis function (RBF) kernel, however, this theoretical fact is impracticable because the reproducing kernel Hilbert space (RKHS) of this kernel has infinite dimensionality. Therefore, we derive a finite-dimensional approximative feature map, based on an orthonormal basis of the kernels RKHS, to enable the reformulation of Gaussian RBF SVMs to linear SVMs. We show that the error of this approximative feature map decreases with factorial growth if the approximation quality is linearly increased. Experimental evaluations demonstrated that the approximative feature map achieves considerable speed-ups (about 18-fold on average), mostly without losing classification accuracy. Therefore, the proposed approximative feature map provides an efficient SVM evaluation method with minimal loss of precision.

[1]  S. Sathiya Keerthi,et al.  Building Support Vector Machines with Reduced Classifier Complexity , 2006, J. Mach. Learn. Res..

[2]  Davide Anguita,et al.  Support vector machines and strictly positive definite kernel: The regularization hyperparameter is more important than the kernel hyperparameters , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[3]  H. Minh,et al.  Some Properties of Gaussian Reproducing Kernel Hilbert Spaces and Their Implications for Function Approximation and Learning Theory , 2010 .

[4]  Davide Anguita,et al.  A FPGA Core Generator for Embedded Classification Systems , 2011, J. Circuits Syst. Comput..

[5]  Johan A. K. Suykens,et al.  Fast Prediction with SVM Models Containing RBF Kernels , 2014, ArXiv.

[6]  Fuxin Li,et al.  Chebyshev Approximations to the Histogram $\chi^2$ Kernel , 2012 .

[7]  Florent Perronnin,et al.  Large-scale image categorization with explicit data embedding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[9]  José Carlos Príncipe,et al.  An Explicit Construction Of A Reproducing Gaussian Kernel Hilbert Space , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Andrea Boni,et al.  Low-Power and Low-Cost Implementation of SVMs for Smart Sensors , 2007, IEEE Transactions on Instrumentation and Measurement.

[11]  Cristian Sminchisescu,et al.  Chebyshev approximations to the histogram χ2 kernel , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Ingo Steinwart,et al.  Sparseness of Support Vector Machines---Some Asymptotically Sharp Bounds , 2003, NIPS.

[13]  Subhransu Maji,et al.  Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jocelyn Chanussot,et al.  Speeding up Support Vector Machine (SVM) image classification by a kernel series expansion , 2008, 2008 15th IEEE International Conference on Image Processing.

[15]  Subhransu Maji,et al.  Max-margin additive classifiers for detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16]  Ming-Syan Chen,et al.  Efficient Kernel Approximation for Large-Scale Support Vector Machine Classification , 2011, SDM.

[17]  Alexander J. Smola,et al.  Fastfood - Computing Hilbert Space Expansions in loglinear time , 2013, ICML.

[18]  S. Saitoh Integral Transforms, Reproducing Kernels and Their Applications , 1997 .

[19]  Harish Karnick,et al.  Random Feature Maps for Dot Product Kernels , 2012, AISTATS.

[20]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[21]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[22]  Hui Cao,et al.  Approximate RBF Kernel SVM and Its Applications in Pedestrian Classification , 2008 .

[23]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[24]  Nathan Srebro,et al.  Explicit Approximations of the Gaussian Kernel , 2011, ArXiv.

[25]  Ondrej Chum Low Dimensional Explicit Feature Maps , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Shiliang Sun,et al.  A review of Nyström methods for large-scale machine learning , 2015, Inf. Fusion.

[27]  Andrew Zisserman,et al.  Efficient Additive Kernels via Explicit Feature Maps , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Don R. Hush,et al.  An Explicit Description of the Reproducing Kernel Hilbert Spaces of Gaussian RBF Kernels , 2006, IEEE Transactions on Information Theory.

[29]  Christopher J. C. Burges,et al.  Simplified Support Vector Decision Rules , 1996, ICML.

[30]  Subhransu Maji,et al.  Efficient Classification for Additive Kernel SVMs , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[32]  Cristian Sminchisescu,et al.  Random Fourier Approximations for Skewed Multiplicative Histogram Kernels , 2010, DAGM-Symposium.

[33]  C. V. Jawahar,et al.  Generalized RBF feature maps for Efficient Detection , 2010, BMVC.

[34]  Ingo Steinwart,et al.  Sparseness of Support Vector Machines , 2003, J. Mach. Learn. Res..

[35]  Tom Downs,et al.  Exact Simplification of Support Vector Solutions , 2002, J. Mach. Learn. Res..

[36]  Matthias W. Seeger,et al.  Using the Nyström Method to Speed Up Kernel Machines , 2000, NIPS.

[37]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[38]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[39]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.