An Experimental Assessment of Direct Versus. Interlingual Translation for Cross-Language Information Retrieval

We introduce an interlingua-based approach to crosslanguage information retrieval, in which queries, as well as documents, are mapped onto a language-independent concept layer and retrieval operations are performed at the level of that interlingua. This approach is contrasted with one which operates without such an intermediary concept level. Non-English queries (German ones, in our experiments) are directly translated to English queries which, subsequently, are processed on English documents. We provide an empirical evaluation of both alternatives on a large medical document collection.

[1]  J. Scott McCarley Should we Translate the Documents or the Queries in Cross-language Information Retrieval? , 1999, ACL.

[2]  Miguel E. Ruiz,et al.  CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation , 1999, TREC.

[3]  Padmini Srinivasan,et al.  Cross-language information retrieval with the UMLS metathesaurus , 1998, SIGIR '98.

[4]  Douglas W. Oard,et al.  Cross-language Information Retrieval , 2021, ArXiv.

[5]  William R. Hersh,et al.  Information Retrieval: A Health and Biomedical Perspective , 2002 .

[6]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[7]  Stefan Schulz,et al.  Morpheme-based, cross-lingual indexing for medical document retrieval , 2000, Int. J. Medical Informatics.

[8]  F B ROGERS,et al.  Medical Subject Headings , 1948, Nature.

[9]  Paul Buitelaar,et al.  Semantic annotation for concept-based cross-language medical information retrieval , 2002, Int. J. Medical Informatics.

[10]  Jacques Savoy,et al.  Recherche d'information dans des corpus plurilingues , 2002, Ingénierie des Systèmes d Inf..

[11]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[12]  Turid Hedlund,et al.  Dictionary-Based Cross-Language Information Retrieval: Problems, Methods, and Research Findings , 2001, Information Retrieval.

[13]  William R. Hersh,et al.  SAPHIRE International: a tool for cross-language information retrieval , 1998, AMIA.

[14]  Koichi Takeda,et al.  Information retrieval on the web , 2000, CSUR.

[15]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[16]  M. Felisa Verdejo,et al.  Using Eurowordnet in a Concept-Based Approach to Cross-Language Text Retrieval , 1999, Appl. Artif. Intell..

[17]  Stefan Schulz,et al.  Cross-language MeSH Indexing using Morpho-Semantic Normalization , 2003, AMIA.

[18]  William R. Hersh Information Retrieval , 2003, Health Informatics.