Trust Region Levenberg-Marquardt Method for Linear SVM

Support Vector Machine is an optimal margin based classification technique in Machine Learning. In this paper, we have proposed Trust Region Levenberg-Marquardt (TRLM) method as a novel problem solver for L2 regularized L2 loss (L2RL2) primal SVM classification problem. Levenberg-Marquardt (LM) method is an extension of Gauss-Newton method for solving least squares non-linear optimization problems and Trust-Region (TR) method is used to find the step size. In LM method, LM parameter $A$ is changed by an arbitrary factor but in TRLM instead of changing A, the trust region radius Δ is changed. The proposed solver for L2RL2 primal SVM, performs well with medium sized problems. Experimental results establish TRLM as a solver for linear SVM as it performs at par and better in selective cases than existing state of the art solvers in terms of test accuracy.

[1]  Li Zhang,et al.  Fisher-regularized support vector machine , 2016, Inf. Sci..

[2]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[3]  Chih-Jen Lin,et al.  A Study on L2-Loss (Squared Hinge-Loss) Multiclass SVM , 2013, Neural Computation.

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[7]  D. Sorensen Newton's method with a model trust region modification , 1982 .

[8]  Chih-Jen Lin,et al.  Decomposition Methods for Linear Support Vector Machines , 2004, Neural Computation.

[9]  Chih-Jen Lin,et al.  Trust Region Newton Method for Logistic Regression , 2008, J. Mach. Learn. Res..

[10]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[11]  S. Sathiya Keerthi,et al.  A Modified Finite Newton Method for Fast Solution of Large Scale Linear SVMs , 2005, J. Mach. Learn. Res..

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Koby Crammer,et al.  On the Learnability and Design of Output Codes for Multiclass Problems , 2002, Machine Learning.

[14]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[15]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[16]  Ananth Ranganathan,et al.  The Levenberg-Marquardt Algorithm , 2004 .

[17]  Nathan L. Gibson Gradient-based Methods for Optimization. Part I. , 2011 .

[18]  Alexander J. Smola,et al.  Bundle Methods for Machine Learning , 2007, NIPS.

[19]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..