English-Chinese Cross-Language Retrieval based on a Translation Package

An inexpensive COTS translation package, augmented with a downloadable bilingual dictionary, was employed for a study of English-Chinese cross-language information retrieval (CLIR) using the query translation approach. The experimental setting involved the 170 MB Chinese collections and 54 queries of TREC and their relevance judgment, and our PIRCS bi-lingual retrieval system. With some standard retrieval techniques such as pretranslation query expansion and combination of retrieval lists, we were able to achieve over 70% of monolingual results for both long and short queries. Insufficient context of short queries appears not a problem for machine translation for English-Chinese CLIR.