A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph

This paper discusses a knowledge based information retrieval model with hierarchical thesaurus. The model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus. The hierarchical thesaurus is represented by a hierarchical‐concept graph (HCG) in which nodes represent concepts and directed edges represent generalisation relationships. Rada et al. have developed a similar model. However, their model considered only a binary indexing scheme and revealed some counter‐intuitive results. Our proposed model extends theirs by allowing the index term and the edge of the HCG to be weighted. A new concept mapping method is devised to overcome Rada's counter‐intuitive results. In addition, a scheme for allowing Boolean operators in user queries is provided with a formula for computing conceptual distance from negated index terms. Experimental results have shown that our model simulates human performance more closely than Rada's model.

[1]  Gerard Salton Historical Note: The Past Thirty Years in Information Retrieval , 1987 .

[2]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[3]  Peter Freeman,et al.  Classifying Software for Reusability , 1987, IEEE Software.

[4]  E. McCluskey Minimization of Boolean functions , 1956 .

[5]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[6]  A. Tversky Features of Similarity , 1977 .

[7]  Roy Rada,et al.  A Graphical Thesaurus-Based Information Retrieval System , 1989, Int. J. Man Mach. Stud..

[8]  Gerard Salton,et al.  The past thirty years in information retrieval , 1987, J. Am. Soc. Inf. Sci..

[9]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[10]  James C. Bezdek,et al.  A Knowledge-Based System Approach to Document Retrieval , 1985, CAIA.

[11]  J E Backus,et al.  Searching for patterns in the MeSH vocabulary. , 1987, Bulletin of the Medical Library Association.

[12]  Davis B. McCarn Medline: An introduction to on-line searching , 1980, J. Am. Soc. Inf. Sci..

[13]  James C. Bezdek,et al.  Knowledge-assisted document retrieval. I: The natural-language interface , 1987 .

[14]  James C. Bezdek,et al.  Knowledge-assisted document retrieval. II: The retrieval process , 1987 .

[15]  M. Kendall Rank Correlation Methods , 1949 .

[16]  W. Bruce Croft,et al.  Retrieving documents by plausible inference: An experimental study , 1989, Inf. Process. Manag..

[17]  Wladyslaw M. Turski On a model of information retrieval system based on thesaurus , 1971, Inf. Storage Retr..

[18]  Jean E. Sammet,et al.  The new (1982) Computing Reviews classification system—final version , 1982, CACM.

[19]  Roy Rada,et al.  Merging Thesauri: Principles and Evaluation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Martha W. Evens,et al.  Relational thesauri in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[21]  Roy Rada,et al.  Ranking documents with a thesaurus , 1989, JASIS.

[22]  Tadeusz Radecki Mathematical model of information retrieval system based on the concept of Fuzzy thesaurus , 1976, Inf. Process. Manag..