论文信息 - Neural Choice by Elimination via Highway Networks

Neural Choice by Elimination via Highway Networks

We introduce Neural Choice by Elimination, a new framework that integrates deep neural networks into probabilistic sequential choice models for learning to rank. Given a set of items to chose from, the elimination strategy starts with the whole item set and iteratively eliminates the least worthy item in the remaining subset. We prove that the choice by elimination is equivalent to marginalizing out the random Gompertz latent utilities. Coupled with the choice model is the recently introduced Neural Highway Networks for approximating arbitrarily complex rank functions. We evaluate the proposed framework on a large-scale public dataset with over 425K items, drawn from the Yahoo! learning to rank challenge. It is demonstrated that the proposed method is competitive against state-of-the-art learning to rank methods.

Svetha Venkatesh | Truyen Tran | Dinh Q. Phung | S. Venkatesh | T. Tran

[1] Jianfeng Gao,et al. Deep stacking networks for information retrieval , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[3] Jaana Kekäläinen,et al. Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[4] R. Plackett. The Analysis of Permutations , 1975 .

[5] I. C. Gormley,et al. A mixture of experts model for rank data with applications in election studies , 2008, 0901.4203.

[6] Qiang Wu,et al. Learning to Rank Using an Ensemble of Lambda-Gradient Models , 2010, Yahoo! Learning to Rank Challenge.

[7] Olivier Chapelle,et al. Expected reciprocal rank for graded relevance , 2009, CIKM.

[8] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[9] Tie-Yan Liu,et al. Learning to rank for information retrieval , 2009, SIGIR.

[10] R. J. Henery,et al. Permutation Probabilities as Models for Horse Races , 1981 .

[11] J. Yellott. The relationship between Luce's Choice Axiom, Thurstone's Theory of Comparative Judgment, and the double exponential distribution , 1977 .

[12] Svetha Venkatesh,et al. A Sequential Decision Approach to Ordinal Preferences in Recommender Systems , 2012, AAAI.

[13] A. Tversky. Elimination by aspects: A theory of choice. , 1972 .