Angle-based models for ranking data

A new class of general exponential ranking models is introduced which we label angle-based models for ranking data. A consensus score vector is assumed, which assigns scores to a set of items, where the scores reflect a consensus view of the relative preference of the items. The probability of observing a ranking is modeled to be proportional to its cosine of the angle from the consensus vector. Bayesian variational inference is employed to determine the corresponding predictive density. It can be seen from simulation experiments that the Bayesian variational inference approach not only has great computational advantage compared to the traditional MCMC, but also avoids the problem of overfitting inherent when using maximum likelihood methods. The model also works when a large number of items are ranked which is usually an NP-hard problem to find the estimate of parameters for other classes of ranking models. Model extensions to incomplete rankings and mixture models are also developed. Real data applications demonstrate that the model and extensions can handle different tasks for the analysis of ranking data.

[1]  Therese Sørlie,et al.  Presence of bone marrow micrometastasis is associated with different recurrence risk within molecular subtypes of breast cancer , 2007, Molecular oncology.

[2]  Toshihiro Kamishima,et al.  Nantonac collaborative filtering: recommendation based on order responses , 2003, KDD '03.

[3]  Inderjit S. Dhillon,et al.  Clustering on the Unit Hypersphere using von Mises-Fisher Distributions , 2005, J. Mach. Learn. Res..

[4]  Mayer Alvo,et al.  Statistical Methods for Ranking Data , 2014 .

[5]  Philip L. H. Yu,et al.  Author's Personal Copy Computational Statistics and Data Analysis Mixtures of Weighted Distance-based Models for Ranking Data with Applications in Political Studies , 2022 .

[6]  Kurt Hornik,et al.  movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions , 2014 .

[7]  Philip L. H. Yu,et al.  Factor analysis for ranked data with application to a job selection attitude survey , 2005 .

[8]  C. L. Mallows NON-NULL RANKING MODELS. I , 1957 .

[9]  E. Gutiérrez-Peña,et al.  A Bayesian Analysis of Directional Data Using the von Mises–Fisher Distribution , 2005 .

[10]  Joseph S. Verducci,et al.  Probability models on rankings. , 1991 .

[11]  Suvrit Sra,et al.  A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of Is(x) , 2012, Comput. Stat..

[12]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[13]  Philip L. H. Yu,et al.  Bayesian analysis of order-statistics models for ranking data , 2000 .

[14]  K. Mardia,et al.  A fast algorithm for sampling from the posterior of a von Mises distribution , 2014, 1402.3569.

[15]  P. Diaconis Group representations in probability and statistics , 1988 .

[16]  L. Thurstone A law of comparative judgment. , 1994 .

[17]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[18]  M. Fligner,et al.  Multistage Ranking Models , 1988 .

[19]  Guy Lebanon,et al.  Visualizing Incomplete and Partially Ranked Data , 2008, IEEE Transactions on Visualization and Computer Graphics.

[20]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[21]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[22]  L. Thurstone,et al.  A low of comparative judgement , 1927 .

[23]  Jalil Taghia,et al.  Bayesian Estimation of the von-Mises Fisher Mixture Model with Variational Inference , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Mayer Alvo,et al.  On the Balanced Incomplete Block Design for Rankings , 1991 .