Detection and translation of OOV terms prior to query time
暂无分享,去创建一个
Accurate cross-language information retrieval requires that query terms be correctly translated. Several new techniques to improve the translation of out of vocabulary terms in English-Chinese cross-language information retrieval have been developed. However, these require queries and a document collection to enable translation disambiguation. Although effective, they involve much processing and searching of the Web at query time, and may not be practical in a production web search engine. In this work, we consider what tasks maybe carried out beforehand, the goal being to reduce the processing required at query time. We have successfully developed new techniques to extract and translate out of vocabulary terms using the Web and add them into a translation dictionary prior to query time.
[1] Ying Zhang,et al. Using the web for automated translation extraction in cross-language information retrieval , 2004, SIGIR '04.
[2] Iadh Ounis,et al. Building Bilingual Dictionaries from Parallel Web Documents , 2002, ECIR.
[3] Hsi-Jian Lee,et al. Translation of web queries using anchor text mining , 2002, TALIP.
[4] Jianqiang Wang,et al. Mandarin-English Information (MEI): investigating translingual speech retrieval , 2004, Comput. Speech Lang..