Budget Online Learning Algorithm for Least Squares SVM

Batch-mode least squares support vector machine (LSSVM) is often associated with unbounded number of support vectors (SVs’), making it unsuitable for applications involving large-scale streaming data. Limited-scale LSSVM, which allows efficient updating, seems to be a good solution to tackle this issue. In this paper, to train the limited-scale LSSVM dynamically, we present a budget online LSSVM (BOLSSVM) algorithm. Methodologically, by setting a fixed budget for SVs’, we are able to update the LSSVM model according to the updated SVs’ set dynamically without retraining from scratch. In particular, when a new small chunk of SVs’ substitute for the old ones, the proposed algorithm employs a low rank correction technology and the Sherman–Morrison–Woodbury formula to compute the inverse of saddle point matrix derived from the LSSVM’s Karush-Kuhn-Tucker (KKT) system, which, in turn, updates the LSSVM model efficiently. In this way, the proposed BOLSSVM algorithm is especially useful for online prediction tasks. Another merit of the proposed BOLSSVM is that it can be used for <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula>-fold cross validation. Specifically, compared with batch-mode learning methods, the computational complexity of the proposed BOLSSVM method is significantly reduced from <inline-formula> <tex-math notation="LaTeX">$\mathcal {O}(n^{4})$ </tex-math></inline-formula> to <inline-formula> <tex-math notation="LaTeX">$\mathcal {O}(n^{3})$ </tex-math></inline-formula> for leave-one-out cross validation with <inline-formula> <tex-math notation="LaTeX">$n$ </tex-math></inline-formula> training samples. The experimental results of classification and regression on benchmark data sets and real-world applications show the validity and effectiveness of the proposed BOLSSVM algorithm.

[1]  Yoram Singer,et al.  The Forgetron: A Kernel-Based Perceptron on a Budget , 2008, SIAM J. Comput..

[2]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[3]  Zhengguo Li,et al.  Model-Based Online Learning With Kernels , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[5]  Jie Chen,et al.  Online Dictionary Learning for Kernel LMS , 2014, IEEE Transactions on Signal Processing.

[6]  Mark Girolami,et al.  Orthogonal Series Density Estimation and the Kernel Eigenvalue Problem , 2002, Neural Computation.

[7]  Zhengrong Xiang,et al.  Online training algorithm research based on improved weighted LSSVM , 2010, Proceedings of the 29th Chinese Control Conference.

[8]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[9]  JianghuaLiu,et al.  Online LS-SVM for function estimation and classification , 2003 .

[10]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[11]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[12]  Koby Crammer,et al.  Breaking the curse of kernelization: budgeted stochastic gradient descent for large-scale SVM training , 2012, J. Mach. Learn. Res..

[13]  Okan K. Ersoy,et al.  Recursive Update Algorithm for Least Squares Support Vector Machines , 2004, Neural Processing Letters.

[14]  William W. Hager,et al.  Updating the Inverse of a Matrix , 1989, SIAM Rev..

[15]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[16]  Weifeng Liu,et al.  The Kernel Least-Mean-Square Algorithm , 2008, IEEE Transactions on Signal Processing.

[17]  Johan A. K. Suykens,et al.  Sparse approximation using least squares support vector machines , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[18]  Slobodan Vucetic,et al.  Online training on a budget of support vector machines using twin prototypes , 2010 .

[19]  Yuichi Motai,et al.  Kernel Association for Classification and Prediction: A Survey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Gavin C. Cawley,et al.  Fast exact leave-one-out cross-validation of sparse least-squares support vector machines , 2004, Neural Networks.

[21]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[22]  Xiao-hu Zhao,et al.  Study of a new online Least Squares Support Vector Machine algorithm in gas prediction , 2008, 2008 6th IEEE International Conference on Industrial Informatics.

[23]  Hsuan-Tien Lin A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods , 2005 .

[24]  Chuanhou Gao,et al.  Data-Driven Time Discrete Models for Dynamic Prediction of the Hot Metal Silicon Content in the Blast Furnace—A Review , 2013, IEEE Transactions on Industrial Informatics.

[25]  Slobodan Vucetic,et al.  BudgetedSVM: a toolbox for scalable SVM approximations , 2013, J. Mach. Learn. Res..

[26]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[27]  Steven C. H. Hoi,et al.  Fast Bounded Online Gradient Descent Algorithms for Scalable Kernel-Based Online Learning , 2012, ICML.

[28]  Claudio Gentile,et al.  Tracking the best hyperplane with a simple budget Perceptron , 2006, Machine Learning.

[29]  Terrance E. Boult,et al.  Reliable Posterior Probability Estimation for Streaming Face Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[30]  Chuanhou Gao,et al.  Binary Coding SVMs for the Multiclass Problem of Blast Furnace System , 2013, IEEE Transactions on Industrial Electronics.

[31]  Barbara Caputo,et al.  The projectron: a bounded kernel-based Perceptron , 2008, ICML '08.

[32]  Badong Chen,et al.  Online efficient learning with quantized KLMS and L1 regularization , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[33]  S. Sathiya Keerthi,et al.  Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms , 2002, IEEE Trans. Neural Networks.

[34]  Dale Schuurmans,et al.  implicit Online Learning with Kernels , 2006, NIPS.

[35]  Wei Chu,et al.  An improved conjugate gradient scheme to the solution of least squares SVM , 2005, IEEE Transactions on Neural Networks.

[36]  Koby Crammer,et al.  Online Classification on a Budget , 2003, NIPS.