Ranking Documents in Thesaurus-Based Boolean Retrieval Systems

Abstract In this paper we investigate document ranking methods in thesaurus-based boolean retrieval systems, and propose a new thesaurus-based ranking algorithm called the Extended Relevance (E-Relevance) algorithm. The E-Relevance algorithm integrates the extended boolean model and the thesaurus-based relevance algorithm. Since the E-Relevance algorithm has all the desirable properties of the extended boolean model, it avoids the various problems of previous thesaurus-based ranking algorithms. The E-Relevance algorithm also ranks documents effectively by using term dependence information from the thesaurus. We have shown through performance comparison that the proposed algorithm achieves higher retrieval effectiveness than the others proposed earlier.

[1]  Roy Rada,et al.  A Graphical Thesaurus-Based Information Retrieval System , 1989, Int. J. Man Mach. Stud..

[2]  Terry Noreault,et al.  Automatic ranked output from boolean searches in SIRE , 1977, J. Am. Soc. Inf. Sci..

[3]  Davis B. McCarn Medline: An introduction to on-line searching , 1980, J. Am. Soc. Inf. Sci..

[4]  Roy Rada,et al.  A knowledge-base for retrieval evaluation , 1985, ACM '85.

[5]  Jean E. Sammet,et al.  The new (1982) Computing Reviews classification system—final version , 1982, CACM.

[6]  C. F. Kossack,et al.  Rank Correlation Methods , 1949 .

[7]  Elaine Svenonius,et al.  Unanswered questions in the design of controlled vocabularies , 1986, J. Am. Soc. Inf. Sci..

[8]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[9]  Jin H. Kim,et al.  A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph , 1990, J. Documentation.

[10]  Edward A. Fox,et al.  Advanced feedback methods in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[11]  Duncan A. Buell,et al.  A general model of query processing in information retrieval systems , 1981, Inf. Process. Manag..

[12]  W. Bruce Croft Boolean Queries and Term Dependencies in Probabilistic Retrieval Models. , 1986 .

[13]  Tadeusz Radecki A probabilistic approach to information retrieval in systems with boolean search request formulations , 1982, J. Am. Soc. Inf. Sci..

[14]  M. Kendall,et al.  Rank Correlation Methods , 1949 .

[15]  E. McCluskey Minimization of Boolean functions , 1956 .

[16]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[17]  Abraham Bookstein,et al.  Fuzzy requests: An approach to weighted boolean searches , 1980, J. Am. Soc. Inf. Sci..

[18]  Roy Rada,et al.  Ranking documents with a thesaurus , 1989, JASIS.

[19]  Hanspeter Giger Concept based retrieval in classical IR systems , 1988, SIGIR '88.

[20]  Edward A. Fox,et al.  Research Contributions , 2014 .