Bees Swarm Optimization Based Approach for Web Information Retrieval

This paper deals with large scale information retrieval aiming at contributing to web searching. The collections of documents considered are huge and not obvious to tackle with classical approaches. The greater the number of documents belonging to the collection, the more powerful approach required. A Bees Swarm Optimization algorithm called BSO-IR is designed to explore the prohibitive number of documents to find the information needed by the user. Extensive experiments were performed on CACM and RCV1 collections and more large corpuses in order to show the benefit gained from using such approach instead of the classic one. Performances in terms of solutions quality and runtime are compared between BSO and exact algorithms. Numerical results exhibit the superiority of BSO-IR on previous works in terms of scalability while yielding comparable quality.

[1]  Marco Dorigo,et al.  From Natural to Artificial Swarm Intelligence , 1999 .

[2]  Bahgat A. Abdel Latef,et al.  Using Genetic Algorithm to Improve Information Retrieval Systems , 2008 .

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  Habiba Drias,et al.  Cooperative Bees Swarm for Solving the Maximum Weighted Satisfiability Problem , 2005, IWANN.

[5]  Habiba Drias,et al.  Personalizing the Source Selection and the Result Merging Process , 2009, Int. J. Artif. Intell. Tools.

[6]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[7]  Václav Snásel,et al.  Implicit User Modelling Using Hybrid Meta-Heuristics , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[8]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[9]  Weiguo Fan,et al.  Effective information retrieval using genetic algorithms based matching functions adaptation , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[10]  Kechid Samir,et al.  Mutli-agent System for Personalizing Information Source Selection , 2009 .

[11]  Dana Vrajitoru,et al.  Crossover Improvement for the Genetic Algorithm in Information Retrieval , 1998, Information Processing & Management.

[12]  Hsinchun Chen Machine learning for information retrieval: neural networks, symbolic learning, and genetic algorithms , 1995 .

[13]  Hsinchun Chen,et al.  Machine Learning for Information Retrieval: Neural Networks, Symbolic Learning, and Genetic Algorithms , 1995, J. Am. Soc. Inf. Sci..