NTCIR-5 Query Expansion Experiments using Term Dependence Models

This paper reports the results of our experiments performed for the Query Term Expansion Subtask, a subtask of the WEB Task, at the Fifth NTCIR Workshop, and the results of our further experiments. In this paper we mainly investigated: (i) the effectiveness of query formulation by composing or decomposing compound words and phrases of the Japanese language, which is based on a theoretical framework via Markov random fields, but taking into account special features of the Japanese language; and(ii) the effectivenessof thecombinationof phrase-based query formulation and pseudo-relevance feedback. We showed that pseudo-relevance feedback worked well, particularly when using query formulation with compound words.

[1]  W. Bruce Croft,et al.  Indri at TREC 2004: Terabyte Track , 2004, TREC.

[2]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[3]  Gilad Mishne,et al.  Boosting Web Retrieval through Query Operations , 2005, BNAIC.

[4]  Isabelle Moulinier,et al.  Thomson Legal and Regulatory at NTCIR-3: Japanese, Chinese and English Retrieval Experiments , 2002, NTCIR.

[5]  W. Bruce Croft,et al.  A comparison of indexing techniques for Japanese text retrieval , 1993, SIGIR.

[6]  Fredric C. Gey,et al.  Experiments on Cross-language and Patent Retrieval at NTCIR-3 Workshop , 2002, NTCIR.

[7]  David Hawking,et al.  Overview of the TREC 2003 Web Track , 2003, TREC.

[8]  Yasushi Ogawa Effective & Efficient Document Ranking without using a Large Lexicon , 1996, VLDB.

[9]  Masaru Kitsuregawa,et al.  University of Tokyo/RICOH at NTCIR-3 Web Retrieval Task , 2002, NTCIR.

[10]  Noriko Kando,et al.  Overview of the Web Retrieval Task at the Third NTCIR Workshop , 2003, NTCIR.

[11]  Noriko Kando,et al.  Handling Orthographic Varieties in Japanese IR: Fusion of Word-, N-Gram-, and Yomi-Based Indices Across Different Document Collections , 2005, AIRS.

[12]  Yasushi Ogawa,et al.  RICOH at NTCIR-2 , 2001, NTCIR.

[13]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[14]  Ross Wilkinson,et al.  Experiments with Japanese Text Retrieval Using mg , 1999, NTCIR.

[15]  江口 浩二,et al.  Web retrieval task , 2002 .

[16]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[17]  Charles L. A. Clarke,et al.  Overview of the TREC 2004 Terabyte Track , 2004, TREC.

[18]  W. Bruce Croft,et al.  The use of phrases and structured queries in information retrieval , 1991, SIGIR '91.

[19]  Toru Matsuda,et al.  Overlapping statistical word indexing: a new indexing method for Japanese text , 1997, SIGIR '97.

[20]  Gareth J. F. Jones,et al.  Experiments in Japanese text retrieval and routing using the NEAT system , 1998, SIGIR '98.

[21]  Noriko Kando,et al.  System Evaluation Methods for Web Retrieval Tasks Considering Hyperlink Structure , 2003, WWW.

[22]  Keizo Oyama,et al.  Overview of the Informational Retrieval Task at NTCIR-4 WEB , 2004, NTCIR.