On the measurement of inter-linker consistency and retrieval effectiveness in hypertext databases

An important stage in the process of retrieval of objects from a hypertext database is the creation of a set of inter-nodal links that are intended to represent the relationships existing between objects; this operation is often undertaken manually, just as index terms are often manually assigned to documents in a conventional retrieval system. In this paper, a study is reported in which several different sets of links were inserted, each by a different person, between the paragraphs of each of a number of full-text documents. The degree of similarity between the members of each pair of link-sets (i.e., the degree of inter-linker consistency) was then evaluated. The results indicated that little similarity existed amongst the link-sets, a finding that is comparable with those of studies of inter-indexer consistency, which suggest that there is generally only a low level of agreement between the sets of index terms assigned to a document by different indexers. These latter studies have historically been considered significant on account of their common assumption that there exists a positive relationship between recorded levels of inter-indexer consistency and the levels of retrieval effectiveness that may be achieved by the systems studied. In order to test the validity of making a similar assumption in the context of link-assignment, the paper continues with a description of an investigation into the nature of the relationship existing between (i) the levels of inter-linker consistency obtaining among the group of hypertext databases used in our earlier experiments and (ii) the levels of effectiveness of a number of searches carried out in those databases. An account is given of the implementation of the searches and of the methods used in the calculation of numerical values expressing their effectiveness, and conclusions are drawn regarding the consistency-effectiveness relationship.

[1]  MaryEllen C. Sievert,et al.  Indexing consistency in Information Science Abstracts , 1991, J. Am. Soc. Inf. Sci..

[2]  Roy Rada,et al.  Searching versus Browsing in Hypertext , 1992, Hypermedia.

[3]  Peter Willett,et al.  Comparison of fragment weighting schemes for substructural analysis , 1989 .

[4]  Johnz Willett Similarity and Clustering in Chemical Information Systems , 1987 .

[5]  Lawrence E. Leonard,et al.  Inter-indexer consistency studies, 1954-1975: a review of the literature and summary of study results , 1977 .

[6]  D. West Introduction to Graph Theory , 1995 .

[7]  Louise T. Su Evaluation Measures for Interactive Information Retrieval , 1992, Inf. Process. Manag..

[8]  Brendan Loughridge,et al.  The careers of MA graduates: training, education and practice , 1988 .

[9]  Nigel Ford Expert systems and artificial intelligence : an information manager's guide , 1991 .

[10]  V. R. Magnuson,et al.  Topological indices: their nature, mutual relatedness, and applications , 1987 .

[11]  Lawrence E. Leonard,et al.  Inter-Indexer Consistency and Retrieval Effectiveness: Measurement of Relationships , 1975 .

[12]  Peter Willett,et al.  Paragraph-based access to full-text documents using a hypertext system , 1991 .

[13]  J. Mathias,et al.  Program , 1970, Symposium on VLSI Technology.

[14]  Robin J. Wilson Introduction to Graph Theory , 1974 .

[15]  Mayer D. Schwartz,et al.  The Dexter Hypertext Reference Model , 1994, CACM.

[16]  Peter Willett,et al.  Measuring the degree of similarity between objects in text retrieval systems , 1993 .

[17]  David Ellis,et al.  On the Creation of Hypertext Links in Full-Text Documents: Measurement of Inter-Linker Consistency , 1994, J. Documentation.

[18]  Mark H. Chignell,et al.  A Model for Information Exploration , 1991, Hypermedia.

[19]  Jacques Savoy Effectiveness of Information Retrieval Systems Used in a Hypertext Environment , 1993, Hypermedia.

[20]  Ben Shneiderman,et al.  Structural analysis of hypertexts: identifying hierarchies and useful metrics , 1992, TOIS.