A Query Dependent Approach to Learning to Rank for Information Retrieval

This paper proposes a new ranking approach for information retrieval, where the diversity among queries are taken into consideration. In information retrieval, the users' queries often vary a lot from one to another, so that the documents retrieved from different queries are also distributed differently. Due to this diversity, it is not appropriate to assume all the documents to be ranked are generated i.i.d. (independently and identically distributed) according to a fixed but unknown probability distribution. However, most of the existing learning to rank approaches are proposed on the basis of the conventional i.i.d. assumption. In this paper, the conventional i.i.d. assumption is relaxed to fit the real situations of information retrieval better, and then a new ranking approach, referred to as 'query dependent ranking', is proposed. In our approach, the ranking models for different queries have generality while each of them has its own speciality. The experimental results on both synthetic and real-world datasets show the advantage of our approach to conventional ranking approaches.

[1]  Amnon Shashua,et al.  Ranking with Large Margin Principle: Two Approaches , 2002, NIPS.

[2]  Wei Chu,et al.  New approaches to support vector ordinal regression , 2005, ICML.

[3]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[4]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[5]  Tao Qin,et al.  LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval , 2007 .

[6]  Volker Tresp Proceedings of the NIPS 2005 Workshop on Learning to Rank , 2005 .

[7]  Hang Li,et al.  AdaRank: a boosting algorithm for information retrieval , 2007, SIGIR.

[8]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[9]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[10]  Andrei Z. Broder,et al.  Robust classification of rare queries using web knowledge , 2007, SIGIR.

[11]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[12]  Qiang Yang,et al.  Building bridges for web query classification , 2006, SIGIR.

[13]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[14]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[15]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[16]  Ramesh Nallapati,et al.  Discriminative models for information retrieval , 2004, SIGIR '04.