A P-ADMM for sparse quadratic kernel-free least squares semi-supervised support vector machine

Abstract In this paper, we propose a sparse quadratic kernel-free least squares semi-supervised support vector machine model by adding an L1 norm regularization term to the objective function and using the least squares method, which results in a nonconvex and nonsmooth quadratic programming problem. For computational considerations, we use the smoothing technique and consensus technique. Then we adopt the proximal alternating direction method of multipliers (P-ADMM) to solve it, as well as propose a strategy of parameter selection. Then we not only derive the convergence analysis of algorithm, but also estimate the convergence rate as o ( 1 / k ) , where k is the number of iteration. This gives the best bound of P-ADMM known so far for nonconvex consensus problem. To demonstrate the efficiency of our model, we compare the proposed method with several state-of-the-art methods. The numerical results show that our model can achieve both better accuracy and sparsity.

[1]  Jason Weston,et al.  Large Scale Transductive SVMs , 2006, J. Mach. Learn. Res..

[2]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[3]  Issam Dagher,et al.  Quadratic kernel-free non-linear support vector machine , 2008, J. Glob. Optim..

[4]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[5]  Yuan-Hai Shao,et al.  Least squares recursive projection twin support vector machine for classification , 2012, Pattern Recognit..

[6]  Xin Yan,et al.  Conic Relaxations for Semi-supervised Support Vector Machines , 2016, J. Optim. Theory Appl..

[7]  Andreas Fischer,et al.  Efficient optimization of hyper-parameters for least squares support vector regression , 2015, Optim. Methods Softw..

[8]  Shu-Cherng Fang,et al.  A kernel-free quadratic surface support vector machine for semi-supervised learning , 2016, J. Oper. Res. Soc..

[9]  Zhi-Quan Luo,et al.  Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems , 2014, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Wotao Yin,et al.  Parallel Multi-Block ADMM with o(1 / k) Convergence , 2013, Journal of Scientific Computing.

[11]  S. Sathiya Keerthi,et al.  Branch and Bound for Semi-Supervised Support Vector Machines , 2006, NIPS.

[12]  Ya-Xiang Yuan,et al.  A parallel decomposition algorithm for training multiclass kernel-based vector machines , 2011, Optim. Methods Softw..

[13]  Shu-Cherng Fang,et al.  FUZZY QUADRATIC SURFACE SUPPORT VECTOR MACHINE BASED ON FISHER DISCRIMINANT ANALYSIS , 2015 .

[14]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[15]  Alexander Zien,et al.  Semi-Supervised Classification by Low Density Separation , 2005, AISTATS.

[16]  Fei Wang,et al.  Cuts3vm: a fast semi-supervised svm algorithm , 2008, KDD.

[17]  S. Sathiya Keerthi,et al.  Deterministic annealing for semi-supervised kernel machines , 2006, ICML.

[19]  O. Burdakov,et al.  Global Search Strategies for Solving Multilinear Least-squares Problems , 2012 .

[20]  Hua Yu,et al.  Quadratic kernel-free least squares support vector machine for target diseases classification , 2015, Journal of Combinatorial Optimization.