Approaching Question Answering by means of Paragraph Validation

In this paper we describe the QA system developed for taking part in Res-PubliQA 2009. Our system was composed by an IR phase focused on improving QA results, a validation step for removing paragraphs that are not promising and a module based on ngrams overlapping for selecting the final answer. Furthermore, a selection module that uses lexical entailment in combination with ngrams overlapping was developed in English. The IR module achieved very promising results that were improved by the ngram ranking. Moreover, the ranking was slightly improved when lexical entailment was used.

[1]  I Levenshtein Vladimir BINARY CODES CAPABLE OF CORRECTING DELETIONS, INSERTIONS, AND REVERSALS , 1966 .

[2]  Günter Neumann,et al.  Information Synthesis for Answer Validation , 2008, CLEF.

[3]  Brigitte Grau,et al.  Justification of Answers by Verification of Dependency Relations - The French AVE Task , 2008, CLEF.

[4]  M. Felisa Verdejo,et al.  Overview of the Answer Validation Exercise 2006 , 2006, CLEF.

[5]  Anselmo Peñas,et al.  Overview of ResPubliQA 2009: Question Answering Evaluation over European Legislation , 2009, CLEF.

[6]  Alberto Téllez-Valero,et al.  Analyzing the Use of Non-overlap Features for Supervised Answer Validation , 2008, CLEF.

[7]  M. Felisa Verdejo,et al.  UNED at Answer Validation Exercise 2007 , 2007, CLEF.

[8]  Eduard H. Hovy,et al.  Question Answering in Webclopedia , 2000, TREC.

[9]  Sanda M. Harabagiu,et al.  The Structure and Performance of an Open-Domain Question Answering System , 2000, ACL.

[10]  Anselmo Peñas,et al.  UNED at PASCAL RTE-2 challenge , 2006 .

[11]  M. Felisa Verdejo,et al.  The Effect of Entity Recognition in Answer Validation , 2006, CLEF.

[12]  J. Williams Challenge! , 1978, British journal of sports medicine.

[13]  Fredric C. Gey,et al.  ENSM-SE at CLEF 2006 : Fuzzy Proximity Method with an Adhoc Influence Function in Evaluation of Multilingual and Multi-modal Information Retrieval 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain , 2007 .

[14]  Carol Peters,et al.  Evaluating Systems for Multilingual and Multimodal Information Access, 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008, Revised Selected Papers , 2009, CLEF.

[15]  Sven Hartrumpf,et al.  University of Hagen at QA@CLEF 2008: Efficient Question Answering with Question Decomposition and Multiple Answer Streams , 2008, CLEF.

[16]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[17]  Adrian Iftene,et al.  Answer Validation on English and Romanian Languages , 2008, CLEF.

[18]  Dragomir R. Radev,et al.  Question-answering by predictive annotation , 2000, SIGIR '00.

[19]  M. Felisa Verdejo,et al.  Overview of the Answer Validation Exercise 2007 , 2006, CLEF.

[20]  Xavier Carreras,et al.  FreeLing: An Open-Source Suite of Language Analyzers , 2004, LREC.

[21]  Alberto Téllez-Valero,et al.  INAOE at QA@CLEF 2008: Evaluating Answer Validation in Spanish Question Answering , 2008, CLEF.

[22]  M. Felisa Verdejo,et al.  Overview of the Answer Validation Exercise 2007 , 2007, CLEF.

[23]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[24]  Lourdes Araujo,et al.  Information Retrieval Baselines for the ResPubliQA Task , 2009, CLEF.

[25]  Carol Peters,et al.  Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers , 2008, CLEF.

[26]  Dan Roth,et al.  Learning Question Classifiers , 2002, COLING.

[27]  M. Felisa Verdejo,et al.  Textual Entailment Recognition Based on Dependency Analysis and WordNet , 2005, MLCW.