An immune programming-based ranking function discovery approach for effective information retrieval

In this paper, we propose RankIP, the first immune programming (IP) based ranking function discovery approach. IP is a novel evolution based machine learning algorithm with the principles of immune systems, which is verified to be superior to Genetic Programming (GP) on the convergence of algorithm according to their experimental results in Musilek et al. (2006). However, such superiority of IP is mainly demonstrated for optimization problems. RankIP adapts IP to the learning to rank problem, a typical classification problem. In doing this, the solution representation, affinity function, and high-affinity antibody selection require completely different treatments. Besides, two formulae focusing on selecting best antibody for test are designed for learning to rank. Experimental results demonstrate that the proposed RankIP outperforms the state-of-the-art learning-based ranking methods significantly in terms of [email protected],MAP and [email protected]

[1]  Weiguo Fan,et al.  A generic ranking function discovery framework by genetic programming for information retrieval , 2004, Inf. Process. Manag..

[2]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[3]  Zhou Ji,et al.  Artificial immune system (AIS) research in the last five years , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[4]  Tie-Yan Liu,et al.  Learning to Rank for Information Retrieval , 2011 .

[5]  Jonathan Timmis,et al.  Artificial immune systems as a novel soft computing paradigm , 2003, Soft Comput..

[6]  Pável Calado,et al.  A combined component approach for finding collection-adapted ranking functions based on genetic programming , 2007, SIGIR.

[7]  Weiguo Fan,et al.  Learning to advertise , 2006, SIGIR.

[8]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[9]  Fernando José Von Zuben,et al.  Learning and optimization using the clonal selection principle , 2002, IEEE Trans. Evol. Comput..

[10]  Tao Qin,et al.  FRank: a ranking method with fidelity loss , 2007, SIGIR.

[11]  Andrew Trotman,et al.  Learning to Rank , 2005, Information Retrieval.

[12]  Azadeh Shakery,et al.  Relevance Propagation for Topic Distillation UIUC TREC 2003 Web Track Experiments , 2003, TREC.

[13]  Tao Qin,et al.  A study of relevance propagation for web search , 2005, SIGIR '05.

[14]  Harris Wu,et al.  The effects of fitness functions on genetic programming-based ranking discovery for Web search: Research Articles , 2004 .

[15]  Stephen E. Robertson,et al.  Overview of the Okapi projects , 1997, J. Documentation.

[16]  Weiguo Fan,et al.  Discovery of context-specific ranking functions for effective information retrieval using genetic programming , 2004, IEEE Transactions on Knowledge and Data Engineering.

[17]  Weiguo Fan,et al.  Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search , 2005, J. Manag. Inf. Syst..

[18]  G. Nossal,et al.  The molecular and cellular basis of affinity maturation in the antibody response , 1992, Cell.

[19]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[20]  Tie-Yan Liu,et al.  Learning to rank for information retrieval (LR4IR 2007) , 2007, SIGF.

[21]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[22]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[23]  Amit Singhal,et al.  Pivoted document length normalization , 1996, SIGIR 1996.

[24]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[25]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[26]  R. Collins Studies in artificial evolution , 1992 .

[27]  Weiguo Fan,et al.  Personalization of search engine services for effective retrieval and knowledge management , 2000, ICIS.

[28]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[29]  Petr Musílek,et al.  Immune programming , 2006, Inf. Sci..

[30]  Tao Qin,et al.  LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval , 2007 .

[31]  Hang Li,et al.  AdaRank: a boosting algorithm for information retrieval , 2007, SIGIR.