论文信息 - Generalized Rank-Breaking: Computational and Statistical Tradeoffs

Generalized Rank-Breaking: Computational and Statistical Tradeoffs

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited. In the application of rank aggregation, for the Plackett-Luce model, we provide a hierarchy of rank-breaking mechanisms ordered by the complexity in thus generated sketch of the data. This allows the number of data points collected to be gracefully traded offs against computational resources available, while guaranteeing the desired level of accuracy. Theoretical guarantees on the proposed generalized rank-breaking implicitly provide such trade-offs, which can be explicitly characterized under certain canonical scenarios on the structure of the data. Further, the proposed generalized rank-breaking algorithm involves set-wise comparisons as opposed to traditional pairwise comparisons. The maximum likelihood estimate of pairwise comparisons is computed efficiently using the celebrated minorization maximization algorithm (Hunter, 2004). To compute the pseudo-maximum likelihood estimate of the set-wise comparisons, we provide a generalization of the minorization maximization algorithm and give guarantees on its convergence.

Ashish Khetan | Sewoong Oh

[1] Ashish Khetan,et al. Data-driven Rank Breaking for Efficient Rank Aggregation , 2016, J. Mach. Learn. Res..

[2] Devavrat Shah,et al. Rank Centrality: Ranking from Pairwise Comparisons , 2012, Oper. Res..

[3] Yi-Ching Yao,et al. Asymptotics when the number of parameters tends to infinity in the Bradley-Terry model for paired comparisons , 1999 .

[4] Nathan Srebro,et al. SVM optimization: inverse dependence on training set size , 2008, ICML '08.

[5] D. Hunter. MM algorithms for generalized Bradley-Terry models , 2003 .

[6] Andreas Krause,et al. Tradeoffs for Space, Time, Data and Risk in Unsupervised Learning , 2015, AISTATS.

[7] David C. Parkes,et al. Computing Parametric Ranking Models via Rank-Breaking , 2014, ICML.

[8] Léon Bottou,et al. The Tradeoffs of Large Scale Learning , 2007, NIPS.

[9] Andrzej Lingas,et al. Faster algorithms for finding lowest common ancestors in directed acyclic graphs , 2007, Theor. Comput. Sci..

[10] Martin J. Wainwright,et al. Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues , 2015, IEEE Transactions on Information Theory.

[11] Thomas P. Hayes. A large-deviation inequality for vector-valued martingales , 2003 .