Multiple query evaluation based on an enhanced genetic algorithm

Recent studies suggest that significant improvement in information retrieval performance can be achieved by combining multiple representations of an information need. The paper presents a genetic approach that combines the results from multiple query evaluations. The genetic algorithm aims to optimise the overall relevance estimate by exploring different directions of the document space. We investigate ways to improve the effectiveness of the genetic exploration by combining appropriate techniques and heuristics known in genetic theory or in the IR field. Indeed, the approach uses a niching technique to solve the relevance multimodality problem, a relevance feedback technique to perform genetic transformations on query formulations and evolution heuristics in order to improve the convergence conditions of the genetic process. The effectiveness of the global approach is demonstrated by comparing the retrieval results obtained by both genetic multiple query evaluation and classical single query evaluation performed on a subset of TREC-4 using the Mercure IRS. Moreover, experimental results show the positive effect of the various techniques integrated to our genetic algorithm model.

[1]  Alain Pétrowski,et al.  A New Selection Operator Dedicated to Speciation , 1997, ICGA.

[2]  Gerard Salton,et al.  Improving Retrieval Performance by Relevance Feedback , 1997 .

[3]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[4]  Thomas Bäck,et al.  Intelligent Mutation Rate Control in Canonical Genetic Algorithms , 1996, ISMIS.

[5]  Lynda Tamine Optimisation de requetes dans un Systeme de recherche d'informationapproche basee sur L'exploitation de Techniques Avancees de L'algorithmique Genetique , 2000 .

[6]  Donna K. Harman,et al.  Relevance feedback revisited , 1992, SIGIR '92.

[7]  Mohand Boughanem,et al.  Query modification based on relevance backpropagation , 1997, RIAO.

[8]  Jong-Hak Lee,et al.  Analyses of multiple evidence combination , 1997, SIGIR '97.

[9]  David B. Fogel,et al.  Evolutionary algorithms in theory and practice , 1997, Complex.

[10]  Michael D. Gordon Probabilistic and genetic algorithms in document retrieval , 1988, CACM.

[11]  Chris Buckley,et al.  The TREC-8 Query Track , 1999, TREC.

[12]  S. Robertson The probability ranking principle in IR , 1997 .

[13]  Jeffrey Katzer,et al.  A study of the overlap among document representations , 1983, SIGIR '83.

[14]  Mohand Boughanem,et al.  Query optimisation using an improved genetic algorithm , 2000, CIKM '00.

[15]  Donald H. Kraft,et al.  Applying Genetic Algorithms to Information Retrieval Systems Via Relevance Feedback , 1995 .

[16]  Mohamed Slimane,et al.  On Using Interactive Genetic Algorithms for Knowledge Discovery in Databases , 1997, ICGA.

[17]  Nicholas J. Belkin,et al.  The effect multiple query representations on information retrieval system performance , 1993, SIGIR.

[18]  Demichelis,et al.  Evaluation of the , 1992, Physical review. B, Condensed matter.

[19]  Mohand Boughanem,et al.  Genetic Approach to Query Space Exploration , 2004, Information Retrieval.

[20]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[21]  Kui-Lam Kwok,et al.  A network approach to probabilistic information retrieval , 1995, TOIS.

[22]  Jorng-Tzong Horng,et al.  Applying genetic algorithms to query optimization in document retrieval , 2000, Inf. Process. Manag..

[23]  Hsinchun Chen Machine learning for information retrieval: neural networks, symbolic learning, and genetic algorithms , 1995 .

[24]  G. Cottrell,et al.  Optimizing Similarity Using Multi-Query Relevance Feedback , 1998, J. Am. Soc. Inf. Sci..

[25]  Robert R. Korfhage,et al.  Query Optimization in Information Retrieval Using Genetic Algorithms , 1993, ICGA.