An Alternative Ranking Problem for Search Engines

This paper examines in detail an alternative ranking problem for search engines, movie recommendation, and other similar ranking systems motivated by the requirement to not just accurately predict pairwise ordering but also preserve the magnitude of the preferences or the difference between ratings. We describe and analyze several cost functions for this learning problem and give stability bounds for their generalization error, extending previously known stability results to nonbipartite ranking and magnitude of preference-preserving algorithms. We present algorithms optimizing these cost functions, and, in one instance, detail both a batch and an on-line version. For this algorithm, we also show how the leave-one-out error can be computed and approximated efficiently, which can be used to determine the optimal values of the trade-off parameter in the cost function. We report the results of experiments comparing these algorithms on several datasets and contrast them with those obtained using an AUC-maximization algorithm. We also compare training times and performance results for the on-line and batch versions, demonstrating that our on-line algorithm scales to relatively large datasets with no significant loss in accuracy.

[1]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[2]  André Elisseeff,et al.  Algorithmic Stability and Generalization Performance , 2000, NIPS.

[3]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[4]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[5]  Mehryar Mohri,et al.  Magnitude-preserving ranking algorithms , 2007, ICML '07.

[6]  Ralf Herbrich,et al.  Large margin rank boundaries for ordinal regression , 2000 .

[7]  Cynthia Rudin,et al.  Margin-Based Ranking Meets Boosting in the Middle , 2005, COLT.

[8]  Amnon Shashua,et al.  Ranking with Large Margin Principle: Two Approaches , 2002, NIPS.

[9]  M. Habib Probabilistic methods for algorithmic discrete mathematics , 1998 .

[10]  James Bennett,et al.  The Netflix Prize , 2007 .

[11]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[12]  Koby Crammer,et al.  Pranking with Ranking , 2001, NIPS.

[13]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[14]  Shivani Agarwal,et al.  Stability and Generalization of Bipartite Ranking Algorithms , 2005, COLT.

[15]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Thorsten Joachims,et al.  Evaluating Retrieval Performance Using Clickthrough Data , 2003, Text Mining.

[18]  André Elisseeff,et al.  Stability and Generalization , 2002, J. Mach. Learn. Res..

[19]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[20]  Wei Chu,et al.  New approaches to support vector ordinal regression , 2005, ICML.