Reconciling Real Scores with Binary Comparisons: A New Logistic Based Model for Ranking

The problem of ranking arises ubiquitously in almost every aspect of life, and in particular in Machine Learning/Information Retrieval. A statistical model for ranking predicts how humans rank subsets V of some universe U . In this work we define a statistical model for ranking that satisfies certain desirable properties. The model automatically gives rise to a logistic regression based approach to learning how to rank, for which the score and comparison based approaches are dual views. This offers a new generative approach to ranking which can be used for IR. There are two main contexts for this work. The first is the theory of econometrics and study of statistical models explaining human choice of alternatives. In this context, we will compare our model with other well known models. The second context is the problem of ranking in machine learning, usually arising in the context of information retrieval. Here, much work has been done in the discriminative setting, where different heuristics are used to define ranking risk functions. Our model is built rigorously and axiomatically based on very simple desirable properties defined locally for comparisons, and automatically implies the existence of a global score function serving as a natural model parameter which can be efficiently fitted to pairwise comparison judgment data by solving a convex optimization problem.

[1]  Mehryar Mohri,et al.  Magnitude-preserving ranking algorithms , 2007, ICML '07.

[2]  K. Arrow A Difficulty in the Concept of Social Welfare , 1950, Journal of Political Economy.

[3]  J. Marden Analyzing and Modeling Rank Data , 1996 .

[4]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[5]  R. Plackett The Analysis of Permutations , 1975 .

[6]  C. L. Mallows,et al.  Individual Choice Behaviour. , 1961 .

[7]  Tong Zhang,et al.  Subset Ranking Using Regression , 2006, COLT.

[8]  Mehryar Mohri,et al.  An Alternative Ranking Problem for Search Engines , 2007, WEA.

[9]  Koby Crammer,et al.  Pranking with Ranking , 2001, NIPS.

[10]  John Langford,et al.  Robust Reductions from Ranking to Classification , 2007, COLT.

[11]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[12]  David P. Williamson,et al.  Deterministic Algorithms for Rank Aggregation and Other Ranking and Clustering Problems , 2007, WAOA.

[13]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[14]  E. Lehmann,et al.  Nonparametrics: Statistical Methods Based on Ranks , 1976 .

[15]  Jerry A. Hausman,et al.  Assessing the potential demand for electric cars , 1981 .

[16]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[17]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[18]  D. Ariely,et al.  “Coherent Arbitrariness”: Stable Demand Curves Without Stable Preferences , 2003 .

[19]  Claire Mathieu,et al.  Electronic Colloquium on Computational Complexity, Report No. 144 (2006) How to rank with few errors A PTAS for Weighted Feedback Arc Set on Tournaments , 2006 .

[20]  Ralf Herbrich,et al.  Large margin rank boundaries for ordinal regression , 2000 .

[21]  Shivani Agarwal,et al.  Stability and Generalization of Bipartite Ranking Algorithms , 2005, COLT.

[22]  C. L. Mallows NON-NULL RANKING MODELS. I , 1957 .

[23]  Mehryar Mohri,et al.  An Efficient Reduction of Ranking to Classification , 2007, COLT.

[24]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS , 1952 .

[25]  Ronald Fagin,et al.  Comparing and aggregating rankings with ties , 2004, PODS '04.

[26]  Peter McCullagh,et al.  Permutations and Regression Models , 1993 .

[27]  Cynthia Rudin,et al.  Margin-Based Ranking Meets Boosting in the Middle , 2005, COLT.

[28]  Nir Ailon,et al.  A Simple Linear Ranking Algorithm Using Query Dependent Intercept Variables , 2008, ECIR.

[29]  Yoram Singer,et al.  Learning to Order Things , 1997, NIPS.

[30]  Nir Ailon,et al.  Aggregation of Partial Rankings, p-Ratings and Top-m Lists , 2007, SODA '07.

[31]  John D. Lafferty,et al.  Cranking: Combining Rankings Using Conditional Probability Models on Permutations , 2002, ICML.

[32]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..