Query Expansion with the Minimum User Feedback by Transductive Learning

Query expansion techniques generally select new query terms from a set of top ranked documents. Although a user's manual judgment of those documents would much help to select good expansion terms, it is difficult to get enough feedback from users in practical situations. In this paper we propose a query expansion technique which performs well even if a user notifies just a relevant document and a non-relevant document. In order to tackle this specific condition, we introduce two refinements to a well-known query expansion technique. One is application of a transductive learning technique in order to increase relevant documents. The other is a modified parameter estimation method which laps the predictions by multiple learning trials and try to differentiate the importance of candidate terms for expansion in relevant documents. Experimental results show that our technique outperforms some traditional query expansion methods in several evaluation measures.

[1]  Donna K. Harman,et al.  Overview of the Eighth Text REtrieval Conference (TREC-8) , 1999, TREC.

[2]  Susan T. Dumais,et al.  SIGIR 2003 workshop report: implicit measures of user interests and preferences , 2003, SIGF.

[3]  Makoto Iwayama,et al.  Relevance feedback with a small number of relevance judgements: incremental relevance feedback vs. document clustering , 2000, SIGIR '00.

[4]  John D. Lafferty,et al.  Semi-supervised learning using randomized mincuts , 2004, ICML.

[5]  Yasuhiko Kitamura,et al.  Keyword Spices: A New Method for Building Domain-Specific Web Search Engines , 2001, IJCAI.

[6]  Michael Collins,et al.  AT&T at TREC-8 , 1999, TREC.

[7]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[8]  C. Lee Giles,et al.  Extracting query modifications from nonlinear SVMs , 2002, WWW '02.

[9]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[10]  Gareth J. F. Jones,et al.  Applying summarization techniques for term selection in relevance feedback , 2001, SIGIR '01.

[11]  Stephen E. Robertson,et al.  On Term Selection for Query Expansion , 1991, J. Documentation.

[12]  Jingrui He,et al.  Manifold-ranking based image retrieval , 2004, MULTIMEDIA '04.

[13]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[14]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[15]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[16]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[17]  Wei-Ying Ma,et al.  Improving pseudo-relevance feedback in web information retrieval using web page segmentation , 2003, WWW '03.

[18]  IJsbrand Jan Aalbersberg,et al.  Incremental relevance feedback , 1992, SIGIR '92.

[19]  Ian Ruthven,et al.  Re-examining the potential effectiveness of interactive query expansion , 2003, SIGIR.

[20]  Seiji Yamada,et al.  Non-relevance feedback document retrieval , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[21]  Robert D. Macredie,et al.  Cognitive styles and hypermedia navigation: Development of a learning model , 2002, J. Assoc. Inf. Sci. Technol..