Query Reformulation with Collaborative Concept-Based Expansion

Most information retrieval systems on the Internet try to supply the ‘best’ documents to a certain query by ranking the complete hit list with various algorithms. But this is only appropriate if the hit list contain relevant documents which can be positioned on the top. In the case of a small or empty hit list no ranking can improve the result and the user must reformulate the query again and again until he gets a sufficient result. This is a tedious task especially if the query is imprecise or too specific. We propose a method for improving the original query by an automatic reformulation method. Each phrase of the query corresponds to a concept where similar terms are stored. These terms are used to reformulate the original query and the user directly receives a hit list with more relevant documents without numerous searching circles.

[1]  C. J. van Rijsbergen,et al.  The selection of good search terms , 1981, Inf. Process. Manag..

[2]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[3]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[4]  Stephen E. Robertson,et al.  Evaluation of Interfaces for IRS: Modelling End-User Searching Behaviour , 1998, BCS-IRSG Annual Colloquium on IR Research.

[5]  Kui-Lam Kwok,et al.  Query modification and expansion in a network with adaptive architecture , 1991, SIGIR '91.

[6]  Ari Pirkola,et al.  Studies on Linguistic Problems and Methods in Text Retrieval: The Effects of Anaphor and Ellipsis Resolution in Proximity Searching, and Translation and Query Structuring Methods in Cross-Language Retrieval , 1999 .

[7]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[8]  Makoto Iwayama,et al.  Relevance feedback with a small number of relevance judgements: incremental relevance feedback vs. document clustering , 2000, SIGIR '00.

[9]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[10]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[11]  Vijay V. Raghavan,et al.  Enhancing Internet Search Engines to Achieve Concept-based Retrieval , 1999 .

[12]  James Allan,et al.  The effect of adding relevance information in a relevance feedback environment , 1994, SIGIR '94.

[13]  IJsbrand Jan Aalbersberg,et al.  Incremental relevance feedback , 1992, SIGIR '92.

[14]  Vijay V. Raghavan,et al.  Concept Based Retrieval by Minimal Term Sets , 1999, ISMIS.

[15]  Daniel G. Shapiro,et al.  RUBRIC: A System for Rule-Based Information Retrieval , 1985, IEEE Transactions on Software Engineering.

[16]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[17]  W. Bruce Croft Approaches to Intelligent Information Retrieval , 1987, Inf. Process. Manag..

[18]  Vijay V. Raghavan,et al.  Adaptive Concept-based Retrieval Using a Neural Network∗ , 2000 .