论文信息 - Online Learning to Rank: Absolute vs. Relative

Online Learning to Rank: Absolute vs. Relative

Online learning to rank holds great promise for learning personalized search result rankings. First algorithms have been proposed, namely absolute feedback approaches, based on contextual bandits learning; and relative feedback approaches, based on gradient methods and inferred preferences between complete result rankings. Both types of approaches have shown promise, but they have not previously been compared to each other. It is therefore unclear which type of approach is the most suitable for which online learning to rank problems. In this work we present the first empirical comparison of absolute and relative online learning to rank approaches.

Katja Hofmann | Yiwei Chen | Katja Hofmann | Yiwei Chen

[1] Chao Liu,et al. Efficient multiple-click models in web search , 2009, WSDM '09.

[2] Tao Qin,et al. LETOR: A benchmark collection for research on learning to rank for information retrieval , 2010, Information Retrieval.

[3] John Langford,et al. The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information , 2007, NIPS.

[4] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[5] Katja Hofmann,et al. Reusing historical interaction data for faster online learning to rank for IR , 2013, DIR.

[6] Thorsten Joachims,et al. Evaluating Retrieval Performance Using Clickthrough Data , 2003, Text Mining.

[7] Katja Hofmann,et al. Lerot: an online learning to rank framework , 2013, LivingLab '13.

[8] Filip Radlinski,et al. How does clickthrough data reflect retrieval quality? , 2008, CIKM '08.