A Memetic-Based Approach for Web-Based Question Answering

In this paper we proposed an evolutionary approach for answering open-domain factoid questions, which include searching among sentences that are candidate for the final answer with Memetic Algorithm (MA), and using lexical and syntactic features for calculating fitness of the sentences. Our main purpose is making a search engine with accurate answering ability, or a web-based Question Answering (QA) system. The Text Retrieval Conference (TREC) QA Tracks data are used to develop and evaluate the approach. The answering process begins with retrieving related documents from a search engine. Then, MA searches among all the sentences of these documents and finds the best one. Finally, one or more words will be extracted based on our hand-made patterns. The results of different approaches for local search, mutation, and crossover, and also different values for number of reproduction and retrieved documents are investigated in the empirical study section. The results are promising with sufficient retrieved documents, and we have obtained a threshold value for this variable. Using MA instead of examining all the sentences is a trade-off between lowering the process time and sacrificing the accuracy, but the results show that the Mametic-based approach is more efficient.

[1]  Enrique Alba,et al.  Natural language tagging with genetic algorithms , 2006, Inf. Process. Lett..

[2]  Chris Fox,et al.  The Handbook of Computational Linguistics and Natural Language Processing , 2010 .

[3]  Carlos Cotta,et al.  Memetic algorithms and memetic computing optimization: A literature review , 2012, Swarm Evol. Comput..

[4]  Jimmy J. Lin,et al.  Overview of the TREC 2007 Question Answering Track , 2008, TREC.

[5]  Marie-Francine Moens,et al.  A survey on question answering technology from an information retrieval perspective , 2011, Inf. Sci..

[6]  Pablo Moscato,et al.  On Evolution, Search, Optimization, Genetic Algorithms and Martial Arts : Towards Memetic Algorithms , 1989 .

[7]  Jimmy J. Lin,et al.  The START Multimedia Information System: Current Technology and Future Directions , 2002, Multimedia Information Systems.

[8]  María-Dolores Olvera-Lobo,et al.  Question Answering Track Evaluation in TREC, CLEF and NTCIR , 2015, WorldCIST.

[9]  Sadaoki Furui,et al.  Question answering using statistical language modelling , 2012, Comput. Speech Lang..

[10]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[11]  Chih-Fong Tsai,et al.  Evolutionary instance selection for text classification , 2014, J. Syst. Softw..

[12]  Terence C. Fogarty,et al.  Varying the Probability of Mutation in the Genetic Algorithm , 1989, ICGA.

[13]  John Atkinson,et al.  Evolutionary optimization for ranking how-to questions based on user-generated contents , 2013, Expert Syst. Appl..

[14]  Mohammad Saniee Abadeh,et al.  AISQA - An Artificial Immune Question Answering System , 2012 .

[15]  Julian Kupiec,et al.  MURAX: a robust linguistic approach for question answering using an on-line encyclopedia , 1993, SIGIR.