论文信息 - Optimizing ψ-learning via mixed integer programming

Optimizing ψ-learning via mixed integer programming

As a new margin-based classier, -learning shows great potential for high accuracy. However, the optimization of -learning involves non-convex min- imization and is very challenging to implement. In this article, we convert the optimization of -learning into a mixed integer programming (MIP) problem. This enables us to utilize the state-of-art algorithm of MIP to solve -learning. More- over, the new algorithm can solve -learning with a general piecewise linear loss and does not require continuity of the loss function. We also examine the variable selection property of 1-norm -learning and make comparisons with the SVM.

Yufeng Liu | Yichao Wu | Yufeng Liu | Yichao Wu

[1] Robert Tibshirani,et al. 1-norm Support Vector Machines , 2003, NIPS.

[2] Yufeng Liu,et al. Multicategory ψ-Learning , 2006 .

[3] Yufeng Liu,et al. Multicategory ψ-Learning and Support Vector Machine: Computational Tools , 2005 .

[4] Olvi L. Mangasarian,et al. Nuclear feature extraction for breast tumor diagnosis , 1993, Electronic Imaging.

[5] W. Wong,et al. On ψ-Learning , 2003 .

[6] G. Wahba,et al. Some results on Tchebycheffian spline functions , 1971 .

[7] Charles E. Blair,et al. Integer and Combinatorial Optimization (George L. Nemhauser and Laurence A. Wolsey) , 1990, SIAM Rev..

[8] Laurence A. Wolsey,et al. Integer and Combinatorial Optimization , 1988 .

[9] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[10] G. Nemhauser,et al. Integer Programming , 2020 .

[11] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[12] Xiaotong Shen,et al. Computational Developments of ψ-learning , 2005, SDM.

[13] Brian W. Kernighan,et al. AMPL: A Modeling Language for Mathematical Programming , 1993 .

[14] Takashi Takenouchi,et al. Statistical Learning Theory by Boosting Method , 2004 .

[15] Grace Wahba,et al. Spline Models for Observational Data , 1990 .

[16] Paul S. Bradley,et al. Feature Selection via Concave Minimization and Support Vector Machines , 1998, ICML.

[17] G. Wahba. Support vector machines, reproducing kernel Hilbert spaces, and randomized GACV , 1999 .

[18] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .