We propose a new method for query expansion called "term similarity tree model" (TSTM). Term similarity tree is built to represent and estimate similarities between terms. Based on TSTM, we use similarity restriction and overlay restriction to implement query expansion. This method can cluster terms automatically, make the process of query expansion more flexible and controllable, and control noise effectively. In addition, the parameters of TSTM can be adjusted easily to meet the requirements of different types of queries. TREC data is used to test the method. The experiments show that TSTM method outperforms the existing methods in query expansion, such as WordNet-based method, local cooccurrence method and latent semantic indexing (LSI-based) method.
[1]
Takenobu Tokunaga,et al.
Combining multiple evidence from different types of thesaurus for query expansion
,
1999,
SIGIR '99.
[2]
He Hong.
Query Expansion Based on the Context in Chinese Information Retrieval
,
2002
.
[3]
W. Bruce Croft,et al.
A language modeling approach to information retrieval
,
1998,
SIGIR '98.
[4]
George A. Miller,et al.
Introduction to WordNet: An On-line Lexical Database
,
1990
.
[5]
T. Landauer,et al.
Indexing by Latent Semantic Analysis
,
1990
.