论文信息 - Report on CLIR Task for the NTCIR-4 Evaluation Campaign

Report on CLIR Task for the NTCIR-4 Evaluation Campaign

This paper describes our second participation in an evaluation campaign involving the Chinese, Japanese, Korean and English languages (NTCIR-5). Our participation is motivated by four objectives: 1) study the retrieval performances of various IR models for these languages; 2) compare the relative retrieval effectiveness of bigram and automatic wordsegmenting approaches for Chinese and Japanese languages; 3) propose a new blind-query expansion hopefully capable of improving mean average precision; and 4) evaluate the relative performance of the various merging strategies used to combine separate result lists extracted from a corpus written in English, Chinese, Japanese or Korean.

Jacques Savoy

[1] Stephen E. Robertson,et al. Experimentation as a way of life: Okapi at TREC , 2000, Inf. Process. Manag..

[2] Jacques Savoy,et al. Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval , 2004, Information Retrieval.

[3] Hsin-Hsi Chen,et al. Overview of CLIR Task at the Fourth NTCIR Workshop , 2004, NTCIR.

[4] Jacques Savoy,et al. Statistical inference in retrieval effectiveness evaluation , 1997, Inf. Process. Manag..

[5] Fredric C. Gey,et al. Experiments on Cross-language and Patent Retrieval at NTCIR-3 Workshop , 2002, NTCIR.

[6] Hsin-Hsi Chen,et al. Overview of CLIR Task at the Sixth NTCIR Workshop , 2005, NTCIR.

[7] Jacques Savoy,et al. Comparative study of monolingual and multilingual search models for use with asian languages , 2005, TALIP.

[8] Amit Singhal,et al. AT&T at TREC-7 , 1998, TREC.

[9] Chris Buckley,et al. New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[10] Kui-Lam Kwok,et al. A comparison of Chinese document indexing strategies and retrieval models , 2002, TALIP.

[11] C. J. van Rijsbergen,et al. Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[12] Edward A. Fox,et al. Combination of Multiple Searches , 1993, TREC.