Ranking functions are instrumental for the success of an information retrieval (search engine) system. However nearly all existing ranking functions are manually designed based on experience, observations and probabilistic theories. This paper tested a novel ranking function discovery technique proposed in [Fan 2003a, Fan2003b] – ARRANGER (Automatic geneRation of RANking functions by GEnetic pRogramming), which uses Genetic Programming (GP) to automatically learn the “best” ranking function, for the robust retrieval task. Ranking function discovery is essentially an optimization problem. As the search space here is not a coordinate system, most of the traditional optimization algorithms could not work. However, this ranking discovery problem could be easily tackled by ARRANGER. In our evaluations on 150 queries from the ad-hoc track of TREC 6, 7, and 8, the performance of our system (in average precision) was improved by nearly 16%, after replacing Okapi BM25 function with a function automatically discovered by ARRANGER. By applying pseudo-relevance feedback and ranking fusion on newly discovered functions, we improved the retrieval performance by up to 30%. The results of our experiments showed that our ranking function discovery technique – ARRANGER – is very effective in discovering high-performing ranking functions.
[1]
Weiguo Fan,et al.
A generic ranking function discovery framework by genetic programming for information retrieval
,
2004,
Inf. Process. Manag..
[2]
Stephen E. Robertson,et al.
Okapi at TREC-3
,
1994,
TREC.
[3]
Gerard Salton,et al.
Term-Weighting Approaches in Automatic Text Retrieval
,
1988,
Inf. Process. Manag..
[4]
Stephen E. Robertson,et al.
Okapi at TREC-4
,
1995,
TREC.
[5]
Alistair Moffat,et al.
Exploring the similarity space
,
1998,
SIGF.
[6]
Weiguo Fan,et al.
Discovery of context-specific ranking functions for effective information retrieval using genetic programming
,
2004,
IEEE Transactions on Knowledge and Data Engineering.
[7]
John H. Holland,et al.
Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence
,
1992
.