Ranking Inferences Based on the Top Choice of Multiway Comparisons

This paper considers ranking inference of n items based on the observed data on the top choice among M randomly selected items at each trial. This is a useful modification of the Plackett-Luce model for M -way ranking with only the top choice observed and is an extension of the celebrated Bradley-Terry-Luce model that corresponds to M “ 2 . Under a uniform sampling scheme in which any M distinguished items are selected for comparisons with probability p and the selected M items are compared L times with multinomial outcomes, we establish the statistical rates of convergence for underlying n preference scores using both (cid:96) 2 -norm and (cid:96) 8 -norm, with the min-imum sampling complexity. In addition, we establish the asymptotic normality of the maximum likelihood estimator that allows us to construct confidence intervals for the underlying scores. Furthermore, we propose a novel inference framework for ranking items through a sophisticated maximum pairwise difference statistic whose distribution is estimated via a valid Gaussian multiplier bootstrap. The estimated distribution is then used to construct simultaneous confidence intervals for the differences in the preference scores and the ranks of individual items. They also enable us to address various inference questions on the ranks of these items. Extensive simulation studies lend further support to our theoretical results. A real data application illustrates the usefulness of the proposed methods convincingly.

[1]  Jianqing Fan,et al.  Are Latent Factor Regression and Sparse Regression Adequate? , 2022, Journal of the American Statistical Association.

[2]  Ethan X. Fang,et al.  Lagrangian Inference for Ranking Problems , 2021, Oper. Res..

[3]  Chao Gao,et al.  Partial recovery for top-k ranking: Optimality of MLE and SubOptimality of the spectral method , 2020, The Annals of Statistics.

[4]  Kengo Kato,et al.  Improved central limit theorem and bootstrap approximations in high dimensions , 2019, The Annals of Statistics.

[5]  Chao Gao,et al.  Uncertainty quantification in the Bradley–Terry–Luce model , 2021, Information and Inference: A Journal of the IMA.

[6]  Kani Chen,et al.  Asymptotic theory of sparse Bradley–Terry model , 2020 .

[7]  David Simchi-Levi,et al.  Estimating and Exploiting the Impact of Photo Layout: A Structural Approach , 2019, Manag. Sci..

[8]  Yuxin Chen,et al.  Spectral Method and Regularized MLE Are Both Optimal for Top-$K$ Ranking , 2017, Annals of statistics.

[9]  Sunghyun Kim,et al.  Top-K Rank Aggregation From M-Wise Comparisons , 2018, IEEE J. Sel. Top. Signal Process..

[10]  Devavrat Shah,et al.  Rank Centrality: Ranking from Pairwise Comparisons , 2012, Oper. Res..

[11]  Mihyun Kang,et al.  Threshold and Hitting Time for High-Order Connectedness in Random Hypergraphs , 2016, Electron. J. Comb..

[12]  Jianqing Fan,et al.  Conditional Sure Independence Screening , 2012, Journal of the American Statistical Association.

[13]  Matthias Grossglauser,et al.  Fast and Accurate Inference of Plackett-Luce Models , 2015, NIPS.

[14]  Chenlei Leng,et al.  High dimensional ordinary least squares projection for screening variables , 2015, 1506.01782.

[15]  Kengo Kato,et al.  Central limit theorems and bootstrap in high dimensions , 2014, 1412.3661.

[16]  A. Culyer Thurstone’s Law of Comparative Judgment , 2014 .

[17]  David C. Parkes,et al.  Generalized Method-of-Moments for Rank Aggregation , 2013, NIPS.

[18]  Toby Walsh,et al.  PrefLib: A Library for Preferences http://www.preflib.org , 2013, ADT.

[19]  T. Walsh,et al.  PREFLIB: A Library for Preferences , 2013 .

[20]  Yee Whye Teh,et al.  Bayesian nonparametric Plackett-Luce models for the analysis of preferences for college degree programmes , 2015 .

[21]  Devavrat Shah,et al.  Iterative ranking from pair-wise comparisons , 2012, NIPS.

[22]  David Firth,et al.  Bradley-Terry Models in R: The BradleyTerry2 Package , 2012 .

[23]  Jun Zhang,et al.  Robust rank correlation based screening , 2010, 1012.4255.

[24]  Runze Li,et al.  Model-Free Feature Screening for Ultrahigh-Dimensional Data , 2011, Journal of the American Statistical Association.

[25]  Francesco Ricci,et al.  Group recommendations with rank aggregation and collaborative filtering , 2010, RecSys '10.

[26]  Jianqing Fan,et al.  Sure independence screening in generalized linear models with NP-dimensionality , 2009, The Annals of Statistics.

[27]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[28]  A. Starnes,et al.  Statistical Models Applied to the Rating of Sports Teams , 2005 .

[29]  Kenneth Y. Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[30]  Eyke Hüllermeier,et al.  Pairwise Preference Learning and Ranking , 2003, ECML.

[31]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[32]  D. Curtis,et al.  An extended transmission/disequilibrium test (TDT) for multi‐allele marker loci , 1995, Annals of human genetics.

[33]  S. Stigler Citation Patterns in the Journals of Statistics and Probability , 1994 .

[34]  L. Thurstone A law of comparative judgment. , 1994 .

[35]  R. Duncan Luce,et al.  Individual Choice Behavior: A Theoretical Analysis , 1979 .

[36]  R. Plackett The Analysis of Permutations , 1975 .

[37]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[38]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[39]  L. Thurstone The method of paired comparisons for social values , 1927 .

[40]  Ken Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm (cid:3) , 2022 .