论文信息 - Question Answering on Web Data: The QA Evaluation in Quæro

Question Answering on Web Data: The QA Evaluation in Quæro

In the QA and information retrieval domains, progress has been assessed via evaluation campaigns(Clef, Ntcir, Equer, Trec). In these evaluations, the systems handle independent questions and should provide one answer to each question, extracted from textual data, for both open domain and restricted domain. Quaero is a program promoting research and industrial innovation on technologies for automatic analysis and classification of multimedia and multilingual documents. Among the many research areas concerned by Quaero. The Quaero project organized a series of evaluations of Question Answering on Web Data systems in 2008 and 2009. For each language, English and French the full corpus has a size of around 20Gb for 2.5M documents. We describe the task and corpora, and especially the methodologies used in 2008 to construct the test of question and a new one in the 2009 campaign. Six types of questions were addressed, factual, Non-factual(How, Why, What), List, Boolean. A description of the participating systems and the obtained results is provided. We show the difficulty for a question-answering system to work with complex data and questions.

[1] Bogdan Sacaleanu,et al. Overview of the CLEF 2008 Multilingual Question Answering Track , 2008, CLEF.

[2] José Luis Vicedo González,et al. TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[3] Noriko Kando,et al. Overview of the NTCIR-7 ACLIA Tasks: Advanced Cross-Lingual Information Access , 2008, NTCIR.

[4] Christian Jacquemin,et al. How NLP can improve Question Answering , 2002 .

[5] Ellen M. Voorhees,et al. TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[6] Brigitte Grau,et al. Utilisation de la syntaxe pour valider les réponses à des questions par plusieurs documents , 2009, CORIA.

[7] Jean-Pierre Chanod,et al. Robustness beyond shallowness: incremental deep parsing , 2002, Natural Language Engineering.

[8] Olivier Galibert,et al. The LIMSI Participation to the QAst Track , 2007, CLEF.

[9] Rieks op den Akker,et al. Handling speech input in the ritel QA dialogue system , 2007, INTERSPEECH.

[10] Olivier Galibert,et al. A Question-answer Distance Measure to Investigate QA System Progress , 2010, LREC.

[11] Brigitte Grau,et al. EQueR: the French Evaluation campaign of Question-Answering Systems , 2006, LREC.