Chinese-English Cross-Lingual Information Retrieval based on Domain Ontology Knowledge

For improving the effectiveness of cross-lingual information retrieval (CLIR), a domain ontology knowledge based method is presented to apply to C-E CLIR. In this study, the domain ontology knowledge is acquired from both source language user queries and target documents to select target translation and re-rank initial retrieval documents set. The C-E CLIR dataset from NTCIR-4 Workshop is used to evaluate the effectiveness of this method. Different from previous works, we make use of source language user queries in total C-E CLIR and compared with previous works, this method improved the precision

[1]  Hsin-Hsi Chen,et al.  Overview of CLIR Task at the Fourth NTCIR Workshop , 2004, NTCIR.

[2]  Timothy W. Finin,et al.  Enabling Technology for Knowledge Sharing , 1991, AI Mag..

[3]  Jian Zhang,et al.  On the use of words and n-grams for Chinese information retrieval , 2000, IRAL '00.

[4]  Hae-Chang Rim,et al.  Improving query translation in English-Korean cross-language information retrieval , 2005, Inf. Process. Manag..

[5]  Jaap Kamps,et al.  Improving Retrieval Effectiveness by Reranking Documents Based on Controlled Vocabulary , 2004, ECIR.

[6]  Hsin-Hsi Chen,et al.  Resolving Translation Ambiguity and Target Polysemy in Cross-Language Information Retrieval , 1999, ACL.

[7]  Fredric C. Gey,et al.  English-Chinese Cross-Language IR Using Bilingual Dictionaries , 2000, TREC.

[8]  Fredric C. Gey,et al.  Combining multiple sources for short query translation in Chinese-English cross-language information retrieval , 2000, IRAL '00.

[9]  Douglas W. Oard,et al.  A Comparative Study of Knowledge-Based Approaches for Cross-Language , 1998 .

[10]  Jun Wang,et al.  Rerank Method Based on Individual Thesaurus , 2001, NTCIR.

[11]  Stefan Schulz,et al.  Bootstrapping dictionaries for cross-language information retrieval , 2005, SIGIR '05.

[12]  Turid Hedlund,et al.  Dictionary-Based Cross-Language Information Retrieval: Problems, Methods, and Research Findings , 2001, Information Retrieval.

[13]  Mike Uschold,et al.  Building Ontologies: Towards a Unified Methodology , 1996 .

[14]  Masatoshi Yoshikawa,et al.  Learning Bilingual Translations from Comparable Corpora to Cross-Language Information Retrieval: Hybrid Statistics-based and Linguistics-based Approach , 2003 .