MRA-based Statistical Learning from Incomplete Rankings

Statistical analysis of rank data describing preferences over small and variable subsets of a potentially large ensemble of items {1,..., n} is a very challenging problem. It is motivated by a wide variety of modern applications, such as recommender systems or search engines. However, very few inference methods have been documented in the literature to learn a ranking model from such incomplete rank data. The goal of this paper is twofold: it develops a rigorous mathematical framework for the problem of learning a ranking model from incomplete rankings and introduces a novel general statistical method to address it. Based on an original concept of multiresolution analysis (MRA) of incomplete rankings, it finely adapts to any observation setting, leading to a statistical accuracy and an algorithmic complexity that depend directly on the complexity of the observed data. Beyond theoretical guarantees, we also provide experimental results that show its statistical performance.

[1]  Eyke Hüllermeier,et al.  Preference-Based Rank Elicitation using Statistical Models: The Case of Mallows , 2014, ICML.

[2]  Craig Boutilier,et al.  Learning Mallows Models with Pairwise Preferences , 2011, ICML.

[3]  Mingxuan Sun,et al.  Estimating probabilities in recommendation systems , 2010, AISTATS.

[4]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[5]  Carlos Guestrin,et al.  Riffled Independence for Ranked Data , 2009, NIPS.

[6]  Ramakrishna Kakarala A Signal Processing Approach to Fourier Analysis of Ranking Data: The Importance of Phase , 2011, IEEE Transactions on Signal Processing.

[7]  Arun Rajkumar,et al.  A Statistical Convergence Perspective of Algorithms for Rank Aggregation from Pairwise Data , 2014, ICML.

[8]  P. Diaconis Group representations in probability and statistics , 1988 .

[9]  D. Critchlow Metric Methods for Analyzing Partially Ranked Data , 1986 .

[10]  D. Hunter MM algorithms for generalized Bradley-Terry models , 2003 .

[11]  Yi Mao,et al.  Non-parametric Modeling of Partially Ranked Data , 2007, NIPS.

[12]  Risi Kondor,et al.  Ranking with Kernels in Fourier space. , 2010, COLT 2010.

[13]  Leonidas J. Guibas,et al.  Fourier Theoretic Probabilistic Inference over Permutations , 2009, J. Mach. Learn. Res..

[14]  K. Pearson,et al.  Biometrika , 1902, The American Naturalist.

[15]  David C. Parkes,et al.  Generalized Method-of-Moments for Rank Aggregation , 2013, NIPS.

[16]  Marina Meila,et al.  Recursive Inversion Models for Permutations , 2014, NIPS.

[17]  Walter Dempsey,et al.  Multiresolution analysis on the symmetric group , 2012, NIPS.

[18]  Ramakrishna Kakarala Interpreting the Phase Spectrum in Fourier Analysis of Partial Ranking Data , 2012, Adv. Numer. Anal..

[19]  Nebojsa Jojic,et al.  Efficient Ranking from Pairwise Comparisons , 2013, ICML.

[20]  Devavrat Shah,et al.  Inferring Rankings Using Constrained Sensing , 2009, IEEE Transactions on Information Theory.

[21]  Ronald Fagin,et al.  Comparing Partial Rankings , 2006, SIAM J. Discret. Math..

[22]  Ashish Kapoor,et al.  Riffled Independence for Efficient Inference with Partial Rankings , 2012, J. Artif. Intell. Res..

[23]  C. L. Mallows NON-NULL RANKING MODELS. I , 1957 .

[24]  Xu Liqun,et al.  A multistage ranking model , 2000 .

[25]  John D. Lafferty,et al.  Cranking: Combining Rankings Using Conditional Probability Models on Permutations , 2002, ICML.

[26]  St'ephan Cl'emenccon,et al.  Multiresolution Analysis of Incomplete Rankings , 2014, 1403.1994.

[27]  John Guiver,et al.  Bayesian inference for Plackett-Luce ranking models , 2009, ICML '09.

[28]  José Antonio Lozano,et al.  Learning Probability Distributions over Permutations by Means of Fourier Coefficients , 2011, Canadian Conference on AI.

[29]  P. Diaconis A Generalization of Spectral Analysis with Application to Ranked Data , 1989 .

[30]  M. Fligner,et al.  Distance Based Ranking Models , 1986 .

[31]  Sergey Kitaev,et al.  Patterns in Permutations and Words , 2011, Monographs in Theoretical Computer Science. An EATCS Series.

[32]  R. Plackett The Analysis of Permutations , 1975 .

[33]  D. A. Bell,et al.  Applied Statistics , 1953, Nature.

[34]  Eyke Hüllermeier,et al.  Label ranking by learning pairwise preferences , 2008, Artif. Intell..