Transitive probabilistic CLIR models Wessel Kraaij

Transitive translation could be a useful technique to enlarge the number of supported language pairs for a cross-language information retrieval (CLIR) system in a cost-effective manner. The paper describes several setups for transitive translation based on probabilistic translation models. The transitive CLIR models were evaluated on the CLEF test collection and yielded a retrieval effectiveness up to 83% of monolingual performance, which is significantly better than a baseline using the synonym operator.

[1]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[2]  Lisa Ballesteros,et al.  Cross-Language Retrieval via Transitive Translation , 2002 .

[3]  W. Bruce Croft,et al.  Proceedings of the 21th annual international acm sigir conference on research and development in inf , 1998 .

[4]  Michel Simard,et al.  Using cognates to align sentences in bilingual corpora , 1993, TMI.

[5]  Douglas W. Oard,et al.  A comparative study of query and document translation for cross-language information retrieval , 1998, AMTA.

[6]  Wessel Kraaij,et al.  Language Models for Topic Tracking , 2003 .

[7]  Wessel Kraaij,et al.  Different approaches to Cross Language Information Retrieval , 2000, CLIN.

[8]  George F. Foster A Maximum Entropy/Minimum Divergence Translation Model , 2000, ACL.

[9]  Salim Roukos,et al.  Ad hoc and Multilingual Information Retrieval at IBM , 1998, TREC.

[10]  Wessel Kraaij,et al.  Variations on language modeling for information retrieval , 2005, SIGF.

[11]  Jian-Yun Nie,et al.  Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web , 1999, SIGIR '99.

[12]  Djoerd Hiemstra,et al.  Cross Language Retrieval with the Twenty-One system , 1997, TREC.

[13]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.

[14]  Mark Sanderson,et al.  Addressing the lack of direct translation resources for cross-language retrieval , 2003, CIKM '03.

[15]  Philip Resnik,et al.  Parallel strands: a preliminary investigation into mining the Web for bilingual text , 1998, AMTA.

[16]  Douglas W. Oard,et al.  Probabilistic structured query methods , 2003, SIGIR.

[17]  John D. Lafferty,et al.  Information retrieval as statistical translation , 1999, SIGIR '99.

[18]  Marcello Federico,et al.  Statistical cross-language information retrieval using n-best query translations , 2002, SIGIR '02.

[19]  Mark Sanderson,et al.  Improving cross language retrieval with triangulated translation , 2001, SIGIR '01.

[20]  W. Bruce Croft,et al.  Relevance Models in Information Retrieval , 2003 .

[21]  Wessel Kraaij,et al.  Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval , 2003, CL.