An efficient method for learning nonlinear ranking SVM functions

The problem of learning ranking (or preference) functions has become important in recent years as various applications have been found in information retrieval. Among the rank learning methods, RankSVM has been favorably used in various applications, e.g., optimizing search engines and improving data retrieval quality. Fast learning methods for linear RankSVM (RankSVM with a linear kernel) have been extensively developed, whereas methods for nonlinear RankSVM (RankSVM with nonlinear kernels) are lacking. This paper proposes an efficient method for learning with nonlinear kernels, called Ranking Vector SVM (RV-SVM). RV-SVM utilizes training vectors rather than pairwise difference vectors to determine the support vectors, and is thus faster to train than conventional RankSVMs. Experimental comparisons with the state-of-the-art RankSVM implementation provided in SVM-light show that RV-SVM is substantially faster for nonlinear kernels, although our method is slower for linear kernels. RV-SVM also uses far fewer support vectors, and thus the trained models are much simpler than those built by RankSVMs while maintaining comparable accuracy. Our implementation of RV-SVM is accessible at http://dm.hwanjoyu.org/rv-svm.

[1]  Xizhao Wang,et al.  Fast Fuzzy Multicategory SVM Based on Support Vector Domain Description , 2008, Int. J. Pattern Recognit. Artif. Intell..

[2]  Xi-Zhao Wang,et al.  Improving Generalization of Fuzzy IF--THEN Rules by Maximizing Fuzzy Entropy , 2009, IEEE Transactions on Fuzzy Systems.

[3]  Olvi L. Mangasarian,et al.  Generalized Support Vector Machines , 1998 .

[4]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[5]  Richard Weber,et al.  Simultaneous feature selection and classification using kernel-penalized support vector machines , 2011, Inf. Sci..

[6]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[7]  Qiang Yang,et al.  Feature selection in a kernel space , 2007, ICML '07.

[8]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[9]  Juan Manuel Górriz,et al.  Computer-aided diagnosis of Alzheimer's type dementia combining support vector machines and discriminant set of features , 2013, Inf. Sci..

[10]  Seung-won Hwang,et al.  Exact indexing for support vector machines , 2011, SIGMOD '11.

[11]  Tao Qin,et al.  Ranking with multiple hyperplanes , 2007, SIGIR.

[12]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[13]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[14]  Congxin Wu,et al.  Separating theorem of samples in Banach space for support vector machine learning , 2011, Int. J. Mach. Learn. Cybern..

[15]  Ralf Herbrich,et al.  Large margin rank boundaries for ordinal regression , 2000 .

[16]  H. Yu,et al.  Discovering compact and highly discriminative features or combinations of drug activities using support vector machines , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[17]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[18]  Glenn Fung,et al.  A Feature Selection Newton Method for Support Vector Machine Classification , 2004, Comput. Optim. Appl..

[19]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[20]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[21]  Gang Kou,et al.  Feature Selection for Nonlinear Kernel Support Vector Machines , 2007 .

[22]  Matthias Baumgarten,et al.  Optimal model selection for posture recognition in home-based healthcare , 2011, Int. J. Mach. Learn. Cybern..

[23]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[24]  Seung-won Hwang,et al.  RV-SVM: An Efficient Method for Learning Ranking SVM , 2009, PAKDD.

[25]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[26]  Francisco Sepulveda,et al.  Classifying mental tasks based on features of higher-order statistics from EEG signals in brain-computer interface , 2008, Inf. Sci..

[27]  Robert Tibshirani,et al.  1-norm Support Vector Machines , 2003, NIPS.

[28]  Sun I. Kim,et al.  Nonlinear Support Vector Machine Visualization for Risk Factor Analysis Using Nomograms and Localized Radial Basis Function Kernels , 2008, IEEE Transactions on Information Technology in Biomedicine.

[29]  C. L. Philip Chen,et al.  Adaptive least squares support vector machines filter for hand tremor canceling in microsurgery , 2011, Int. J. Mach. Learn. Cybern..

[30]  Xizhao Wang,et al.  Induction of multiple fuzzy decision trees based on rough set technique , 2008, Inf. Sci..

[31]  Juan Manuel Górriz,et al.  18F-FDG PET imaging analysis for computer aided Alzheimer's diagnosis , 2011, Inf. Sci..

[32]  Olvi L. Mangasarian,et al.  Exact 1-Norm Support Vector Machines Via Unconstrained Convex Differentiable Minimization , 2006, J. Mach. Learn. Res..

[33]  Hang Li,et al.  AdaRank: a boosting algorithm for information retrieval , 2007, SIGIR.

[34]  Xizhao Wang,et al.  Maximum Ambiguity-Based Sample Selection in Fuzzy Decision Tree Induction , 2012, IEEE Transactions on Knowledge and Data Engineering.

[35]  Seung-won Hwang,et al.  Enabling soft queries for data retrieval , 2007, Inf. Syst..

[36]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[37]  Yuh-Jye Lee,et al.  RSVM: Reduced Support Vector Machines , 2001, SDM.

[38]  Bernhard Schölkopf,et al.  A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[39]  Hwanjo Yu,et al.  SVM selective sampling for ranking with application to data retrieval , 2005, KDD '05.

[40]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[41]  Yuanyuan Wang,et al.  A rough margin based support vector machine , 2008, Inf. Sci..