论文信息 - Overview of the CLEF 2005 Multilingual Question Answering Track

Overview of the CLEF 2005 Multilingual Question Answering Track

The general aim of the third CLEF Multilingual Question Answering Track was to set up a common and replicable evaluation framework to test both monolingual and cross-language Question Answering (QA) systems that process queries and documents in several European languages. Nine target languages and ten source languages were exploited to enact 8 monolingual and 73 cross-language tasks. Twenty-four groups participated in the exercise. Overall results showed a general increase in performance in comparison to last year. The best performing monolingual system irrespective of target language answered 64.5% of the questions correctly (in the monolingual Portuguese task), while the average of the best performances for each target language was 42.6%. The cross-language step instead entailed a considerable drop in performance. In addition to accuracy, the organisers also measured the relation between the correctness of an answer and a system’s stated confidence in it, showing that the best systems did not always provide the most reliable confidence score. We provide an overview of the 2005 QA track, detail the procedure followed to build the test sets and present a general analysis of the results.

[1] Ellen M. Voorhees,et al. Overview of the TREC 2002 Question Answering Track , 2003, TREC.

[2] Karen Sparck Jones. Is question answering a rational task , 2003 .

[3] Diana Santos,et al. The Key to the First CLEF with Portuguese: Topics, Questions and Answers in CHAVE , 2004, CLEF.

[4] Maarten de Rijke,et al. Overview of the CLEF 2004 Multilingual Question Answering Track , 2004, CLEF.

[5] M. Felisa Verdejo,et al. Question Answering Pilot Task at CLEF 2004 , 2004, CLEF.

[6] Carol Peters,et al. Multilingual Information Access for Text, Speech and Images, 5th Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Bath, UK, September 15-17, 2004, Revised Selected Papers , 2005, CLEF.