Robust Support Vector Regression with Generalized Loss Function and Applications

The classical support vector machine (SVM) is sensitive to outliers. This paper proposes a robust support vector regression based on a generalized non-convex loss function with flexible slope and margin. The robust model is more flexible for regression estimation. Meanwhile, it has strong ability of suppressing the impact of outliers. The generalized loss function is neither convex nor differentiable. We approximate it by combining two differentiable Huber functions, and the resultant optimization problem is a difference of convex functions (d.c.) program. We develop a Newton algorithm to solve the robust model. The numerical experiments on benchmark datasets, financial time series datasets and document retrieval dataset confirm the robustness and effectiveness of the proposed method. It also reduces the downside risk in financial time series prediction, and significantly outperforms ranking SVM for performing real information retrieval tasks.

[1]  Licheng Jiao,et al.  Recursive Finite Newton Algorithm for Support Vector Regression in the Primal , 2007, Neural Computation.

[2]  Koby Crammer,et al.  Robust Support Vector Machine Training via Convex Outlier Ablation , 2006, AAAI.

[3]  Lai-Wan Chan,et al.  Support Vector Machine Regression for Volatile Stock Market Prediction , 2002, IDEAL.

[4]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[5]  Meng Wang,et al.  Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis , 2012, IEEE Transactions on Image Processing.

[6]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[7]  Le Thi Hoai An,et al.  A D.C. Optimization Algorithm for Solving the Trust-Region Subproblem , 1998, SIAM J. Optim..

[8]  Le Thi Hoai An,et al.  The DC (Difference of Convex Functions) Programming and DCA Revisited with DC Models of Real World Nonconvex Optimization Problems , 2005, Ann. Oper. Res..

[9]  Jianguo Sun,et al.  Robust support vector regression in the primal , 2008, Neural Networks.

[10]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[11]  Bo Geng,et al.  Manifold Regularized Multi-task Learning for Semi-supervised Multi-label Image Classification , 2013 .

[12]  Gert R. G. Lanckriet,et al.  On the Convergence of the Concave-Convex Procedure , 2009, NIPS.

[13]  Jun Yu,et al.  Modern Machine Learning Techniques and Their Applications in Cartoon Animation Research , 2013 .

[14]  Cheng Wu,et al.  Robust Support Vector Regression for Uncertain Input and Output Data , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Ping Zhong,et al.  Training robust support vector regression with smooth non-convex loss function , 2012, Optim. Methods Softw..

[16]  Jason Weston,et al.  Trading convexity for scalability , 2006, ICML.

[17]  Xinjun Peng,et al.  TSVR: An efficient Twin Support Vector Machine for regression , 2010, Neural Networks.

[18]  Jun Yu,et al.  Complex Object Correspondence Construction in Two-Dimensional Animation , 2011, IEEE Transactions on Image Processing.

[19]  Yufeng Liu,et al.  Robust Truncated Hinge Loss Support Vector Machines , 2007 .

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[22]  Tao Qin,et al.  LETOR: A benchmark collection for research on learning to rank for information retrieval , 2010, Information Retrieval.

[23]  W. Gander,et al.  A D.C. OPTIMIZATION ALGORITHM FOR SOLVING THE TRUST-REGION SUBPROBLEM∗ , 1998 .

[24]  Ping Zhong,et al.  Training Robust Support Vector Regression via D. C. Program , 2010 .

[25]  Olivier Chapelle,et al.  Training a Support Vector Machine in the Primal , 2007, Neural Computation.

[26]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[27]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[28]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.

[29]  Meng Wang,et al.  Adaptive Hypergraph Learning and its Application in Image Classification , 2012, IEEE Transactions on Image Processing.

[30]  Yitian Xu,et al.  A weighted twin support vector regression , 2012, Knowl. Based Syst..

[31]  Yong Luo,et al.  Multiview Vector-Valued Manifold Regularization for Multilabel Image Classification , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[33]  H. P. Huang,et al.  Fuzzy Support Vector Machines for Pattern Recognition and Data Mining , 2002 .

[34]  Masao Fukushima,et al.  Second-Order Cone Programming Formulations for Robust Multiclass Classification , 2007, Neural Computation.

[35]  Federico Girosi,et al.  An improved training algorithm for support vector machines , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[36]  Alan L. Yuille,et al.  The Concave-Convex Procedure , 2003, Neural Computation.

[37]  Jie Li,et al.  Training robust support vector machine with smooth Ramp loss in the primal space , 2008, Neurocomputing.

[38]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[39]  Weifeng Liu,et al.  Multiview Hessian Regularization for Image Annotation , 2013, IEEE Transactions on Image Processing.

[40]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[41]  Cheng Soon Ong,et al.  Learning sparse classifiers with difference of convex functions algorithms , 2013, Optim. Methods Softw..

[42]  Theodore B. Trafalis,et al.  Robust classification and regression using support vector machines , 2006, Eur. J. Oper. Res..

[43]  Xian-Sheng Hua,et al.  Content-aware Ranking for visual search , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[44]  Feng-Li Lian,et al.  Robust classifier learning with fuzzy class labels for large-margin support vector machines , 2013, Neurocomputing.