Query reformulation using WordNet and genetic algorithm

Search on the web is a delay process and it can be hard task especially for beginners when they attempt to use a keyword query language. Beginner (inexpert) searchers commonly attempt to find information with ambiguous queries. These ambiguous queries make the search engine returns irrelevant results. This work aims to get more relevant pages to query through query reformulation and expanding search space. The proposed system has three basic parts WordNet, Google search engine and Genetic Algorithm. Every part has a special task. The system uses WordNet to remove ambiguity from queries by displaying the meaning of every keyword in user query and selecting the proper meaning for keywords. The system obtains synonym for every keyword from WordNet and generates query list. Genetic algorithm is used to create generation for every query in query list. Every query in system is navigated using Google search engine to obtain results from group of documents on the Web. The system has been tested on number of ambiguous queries and it has obtained more relevant URL to user query especially when the query has one keyword. The results are promising and therefore open further research directions.

[1]  Ana Gabriela Maguitman,et al.  Genetic algorithm for information retrieval , 2009, 2009 International Conference on Intelligent Agent & Multi-Agent Systems.

[2]  José R. Pérez-Agüera Using genetic algorithms for query reformulation , 2007 .

[3]  A. A. Aly APPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM , 2007 .

[4]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[5]  Nitin Chopde,et al.  Hybrid Approach for Optimizing the Search Engine Result , 2014 .

[6]  Dana Vrajitoru,et al.  Crossover Improvement for the Genetic Algorithm in Information Retrieval , 1998, Information Processing & Management.

[7]  Sudipto Guha,et al.  ROCK: a robust clustering algorithm for categorical attributes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[8]  Lothar M. Schmitt,et al.  Theory of genetic algorithms , 2001, Theor. Comput. Sci..

[9]  S. V. Raghavan,et al.  Intelligent Search Engine: Simulation to Implementation , 2004, iiWAS.

[10]  Václav Snásel,et al.  Using Genetic Algorithms for Boolean Queries Optimization , 2005, IMSA.

[11]  Abdellatif Rahmoun,et al.  Using WordNet for Text Categorization , 2008, Int. Arab J. Inf. Technol..

[12]  Hsinchun Chen,et al.  A Machine Learning Approach to Inductive Query by Examples: An Experiment Using Relevance Feedback, ID3, Genetic Algorithms, and Simulated Annealing , 1998, J. Am. Soc. Inf. Sci..

[13]  Weiguo Fan,et al.  Effective information retrieval using genetic algorithms based matching functions adaptation , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[14]  Bahgat A. Abdel Latef,et al.  Using Genetic Algorithm to Improve Information Retrieval Systems , 2008 .

[15]  A. Govardhan,et al.  Intelligent Semantic Web Search Engines: A Brief Survey , 2011, ArXiv.

[16]  Michael D. Gordon Probabilistic and genetic algorithms in document retrieval , 1988, CACM.

[17]  V Rajaraman,et al.  As we may think , 1988 .

[18]  Belal Al Khateeb,et al.  Investigating Crossover Techniques for the 8-Queens Problem , 2014 .

[19]  Jan Nemrava Using WordNet Glosses to Refine Google Queries , 2006, DATESO.

[20]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[21]  Mandar Mitra,et al.  Information Retrieval from Documents: A Survey , 2000, Information Retrieval.