D.C. programming for sparse proximal support vector machines

Abstract Proximal support vector machine (PSVM), as a variant of support vector machine (SVM), is to generate a pair of non-parallel hyperplanes for classification. Although PSVM is one of the powerful classification tools, its ability on feature selection is still weak. To overcome this defect, we introduce l 0 -norm regularization in PSVM which enables PSVM to select important features and remove redundant features simultaneously for classification. This PSVM is called as a sparse proximal support vector machine (SPSVM). Due to the presence of l 0 -norm, the resulting optimization problem of SPSVM is neither convex nor smooth and thus, is difficult to solve. In this paper, we introduce a continuous nonconvex function to approximate l 0 -norm, and propose a novel difference of convex functions algorithms (DCA) to solve SPSVM. The main merit of the proposed method is that all subproblems are smooth and admit closed form solutions. The effectiveness of the proposed method is illustrated by theoretical analysis as well as some numerical experiments on both simulation datasets and real world datasets.

[1]  Romain Briandet,et al.  Discrimination of Arabica and Robusta in Instant Coffee by Fourier Transform Infrared Spectroscopy and Chemometrics , 1996 .

[2]  Panos M. Pardalos,et al.  A classification method based on generalized eigenvalue problems , 2007, Optim. Methods Softw..

[3]  Julio López,et al.  Double regularization methods for robust feature selection and SVM classification via DC programming , 2018, Inf. Sci..

[4]  Zeshui Xu,et al.  Bibliometric analysis of support vector machines research trend: a case study in China , 2019, International Journal of Machine Learning and Cybernetics.

[5]  Hoai An Le Thi,et al.  DC programming and DCA: thirty years of developments , 2018, Mathematical Programming.

[6]  Yuan-Hai Shao,et al.  Generalized elastic net Lp-norm nonparallel support vector machine , 2020, Eng. Appl. Artif. Intell..

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Haofeng Zhang,et al.  Distance difference and linear programming nonparallel plane classifier , 2011, Expert Syst. Appl..

[9]  Olvi L. Mangasarian,et al.  Exact 1-Norm Support Vector Machines Via Unconstrained Convex Differentiable Minimization , 2006, J. Mach. Learn. Res..

[10]  Reshma Khemchandani,et al.  Twin Support Vector Machines for Pattern Classification , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Amir Ali Ahmadi,et al.  DC decomposition of nonconvex polynomials with algebraic techniques , 2015, Math. Program..

[12]  Korris Fu-Lai Chung,et al.  Extreme vector machine for fast training on large data , 2020, Int. J. Mach. Learn. Cybern..

[13]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[14]  Zhihui Lai,et al.  Structured optimal graph based sparse feature extraction for semi-supervised learning , 2020, Signal Process..

[15]  Bernhard Schölkopf,et al.  Use of the Zero-Norm with Linear Models and Kernel Methods , 2003, J. Mach. Learn. Res..

[16]  Hiroshi Mamitsuka,et al.  Selecting features in microarray classification using ROC curves , 2006, Pattern Recognit..

[17]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[18]  Xinjun Peng,et al.  TPMSVM: A novel twin parametric-margin support vector machine for pattern recognition , 2011, Pattern Recognit..

[19]  T. P. Dinh,et al.  Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[20]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[21]  Jun Zhang,et al.  Sparse and heuristic support vector machine for binary classifier and regressor fusion , 2019, International Journal of Machine Learning and Cybernetics.

[22]  Yuan-Hai Shao,et al.  Improvements on Twin Support Vector Machines , 2011, IEEE Transactions on Neural Networks.

[23]  Pei-Yi Hao,et al.  New support vector algorithms with parametric insensitive/margin model , 2010, Neural Networks.

[24]  Xi-Zhao Wang,et al.  Intuitionistic Fuzzy Twin Support Vector Machines , 2019, IEEE Transactions on Fuzzy Systems.

[25]  Panos M. Pardalos,et al.  Sparse Proximal Support Vector Machines for feature selection in high dimensional datasets , 2015, Expert Syst. Appl..

[26]  Julio López,et al.  Alternative second-order cone programming formulations for support vector classification , 2014, Inf. Sci..

[27]  Yuan-Hai Shao,et al.  Robust L1-norm non-parallel proximal support vector machine , 2016 .

[28]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Changzhi Wu,et al.  A DC PROGRAMMING APPROACH FOR SENSOR NETWORK LOCALIZATION WITH UNCERTAINTIES IN ANCHOR POSITIONS , 2013 .

[30]  Madan Gopal,et al.  Application of smoothing technique on twin support vector machines , 2008, Pattern Recognit. Lett..

[31]  Olvi L. Mangasarian,et al.  Multisurface proximal support vector machine classification via generalized eigenvalues , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Xinjun Peng,et al.  An $$l_1$$l1-norm loss based twin support vector regression and its geometric extension , 2018, Int. J. Mach. Learn. Cybern..

[33]  Kevin Baker,et al.  Classification of radar returns from the ionosphere using neural networks , 1989 .

[34]  Zhiquan Qi,et al.  Efficient sparse nonparallel support vector machines for classification , 2014, Neural Computing and Applications.

[35]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[36]  Yuan-Hai Shao,et al.  Improved Generalized Eigenvalue Proximal Support Vector Machine , 2013, IEEE Signal Processing Letters.

[37]  Yuan-Hai Shao,et al.  Sparse Lq-norm least squares support vector machine with feature selection , 2018, Pattern Recognit..

[38]  Ionut Emil Iacob,et al.  DCSVM: fast multi-class classification using support vector machines , 2018, International Journal of Machine Learning and Cybernetics.

[39]  Lidong Wang,et al.  Wavelet transform-based weighted $$\nu$$ν-twin support vector regression , 2019, Int. J. Mach. Learn. Cybern..

[40]  Yanqin Bai,et al.  New SDP models for protein homology detection with semi-supervised SVM , 2013 .