Bridging Languages for Question Answering: DIOGENE at CLEF 2003

This paper presents the extension of the ITC-irst Diogene Question Answering system towards multilinguality. Diogene relies on a well tested three-components architecture built in the framework of our participation in the QA track at the Text Retrieval Conference (TREC 2002). The novelty factors are represented by the enhancement of the system with language-specific tools targeted to the Italian language (e.g. a module in charge of the answer-type extraction, and a named entities recognizer) and the introduction of a module for the translation of Italian queries into English queries. The overall architecture of the extended system, as well as the results obtained in the CLEF 2003 Monolingual Italian and Bilingual Italian/English QA tracks will be presented and discussed throughout the paper.

[1]  Marcello Federico,et al.  ITC-irst at CLEF 2002: Using N-best Query Translations for CLIR , 2002, CLEF.

[2]  Sanda M. Harabagiu,et al.  The Role of Lexico-Semantic Feedback in Open-Domain Textual Question-Answering , 2001, ACL.

[3]  Bernardo Magnini,et al.  Mining Knowledge from Repeated Co-Occurrences: DIOGENE at TREC 2002 , 2002, TREC.

[4]  Bernardo Magnini,et al.  Multilingual Question/Answering: the DIOGENE System , 2001, TREC.

[5]  William R. Hersh,et al.  Managing Gigabytes—Compressing and Indexing Documents and Images (Second Edition) , 2001, Information Retrieval.

[6]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[7]  Bernardo Magnini,et al.  Is It the Right Answer? Exploiting Web Redundancy for Answer Validation , 2002, ACL.

[8]  Bernardo Magnini,et al.  A WordNet-Based Approach to Named Entites Recognition , 2022 .

[9]  Bernardo Magnini,et al.  Comparing Statistical and Content-Based Techniques for Answer Validation on the Web , 2002 .

[10]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[11]  Ian H. Witten,et al.  Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .

[12]  Zhiping Zheng,et al.  AnswerBus question answering system , 2002 .

[13]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[14]  Changning Huang,et al.  Improving query translation for cross-language information retrieval using statistical models , 2001, SIGIR '01.