The Co-Effects of Query Structure and Expansion on Retrieval Performance in Probabilistic Text Retrieval

The effects of query structures and query expansion (QE) on retrieval performance were tested with a best match retrieval system (InQuery1). Query structure means the use of operators to express the relations between search keys. Six different structures were tested, representing strong structures (e.g., queries with facets or concepts identified) and weak structures (no concepts identified, a query is ‘a bag of search keys’). QE was based on concepts, which were first selected from a searching thesaurus, and then expanded by semantic relationships given in the thesaurus. The expansion levels were (a) no expansion, (b) a synonym expansion, (c) a narrower concept expansion, (d) an associative concept expansion, and (e) a cumulative expansion of all other expansions. With weak structures and Boolean structured queries, QE was not very effective. The best performance was achieved with a combination of a facet structure, where search keys within a facet were treated as instances of one search key (the SYN operator), and the largest expansion.

[1]  Jaana Kekäläinen,et al.  The effects of query complexity, expansion and structure on retrieval performance in probabilistic text retrieval , 1999 .

[2]  Martha W. Evens,et al.  Relational thesauri in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[3]  David J. Groggel,et al.  Practical Nonparametric Statistics , 2000, Technometrics.

[4]  E. Michael Keen,et al.  The Use of Term position Devices in Ranked output Experiments , 1991, J. Documentation.

[5]  C. J. van Rijsbergen,et al.  Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval , 1987, SIGIR 1987.

[6]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[7]  Rebecca Green The expression of Conceptual Syntagmatic Relationships: a Comparative Survey , 1995, J. Documentation.

[8]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[9]  Claire Cardie,et al.  Using clustering and SuperConcepts within SMART: TREC 6 , 1997, Inf. Process. Manag..

[10]  James Allan,et al.  INQUERY at TREC-5 , 1996, TREC.

[11]  Jaana Kekäläinen,et al.  The impact of query structure and query expansion on retrieval performance , 1998, SIGIR '98.

[12]  Ellen M. Voorhees,et al.  The fifth text REtrieval conference (TREC-5) , 1997 .

[13]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[14]  David A. Hull Using Structured Queries for Disambiguation in Cross-Language Information Retrieval , 1997 .

[15]  Timo Niemi,et al.  A deductive data model for query expansion , 1996, SIGIR '96.

[16]  Peter Bailey,et al.  ANU/ACSys TREC-5 Experiments , 1996, TREC.

[17]  Fausto Rabitti,et al.  Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval , 1986 .

[18]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.

[19]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[20]  Donna K. Harman,et al.  Overview of the Fourth Text REtrieval Conference (TREC-4) , 1995, TREC.

[21]  W. Bruce Croft,et al.  Combining automatic and manual index representations in probabilistic retrieval , 1995 .

[22]  P. Willett,et al.  An Introduction to Algorithmic and Cognitive Approaches for Information Retrieval , 1995 .

[23]  Raya Fidel,et al.  Terminological knowledge structure for intermediary expert systems , 1995 .

[24]  R. A. Groeneveld,et al.  Practical Nonparametric Statistics (2nd ed). , 1981 .

[25]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[26]  E. A. Fox,et al.  Combining the Evidence of Multiple Query Representations for Information Retrieval , 1995, Inf. Process. Manag..