Automatic ranked output from boolean searches in SIRE

This study examined the effectiveness and efficiency of employing a fully automatic algorithm for ranking the results of Boolean searches of an inverted file design document retrieval system. The study indicated that with minor modification of file designs, such as those implemented in the Syracuse Information Retrieval Experiment (SIRE), document retrieval systems could efficiently provide users with output lists on which the rank order of a document is a good indicator of its probable relevance to the user's information need. The study found that relevant documents were ranked significantly higher than nonrelevant documents in the set of documents retrieved in response to a Boolean query. By utilizing an augmented inverted file design the variable incremental cost for ranked output was only ten cents per query. There was no increased user effort.

[1]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[2]  C. A. Cuadra,et al.  OPENING THE BLACK BOX OF ‘RELEVANCE’ , 1967 .

[3]  W. S. Cooper Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems , 1968 .

[4]  Everett H. Brenner,et al.  Ranking boolean search output , 1968 .

[5]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[6]  Susan Artandi Document retrieval and the concept of sets , 1971 .

[7]  Tefko Saracevic Selected results from an inquiry into testing of information retrieval systems , 1971 .

[8]  F. W. Lancaster,et al.  Information retrieval: on-line , 1973 .

[9]  Friedrich Gebhardt,et al.  A simple probabilistic model for the relevance assessment of documents , 1975, Inf. Process. Manag..

[10]  Kenneth H. Cook A threshold model of relevance decisions , 1975, Inf. Process. Manag..

[11]  Tefko Saracevic,et al.  RELEVANCE: A review of and a framework for the thinking on the notion in information science , 1997, J. Am. Soc. Inf. Sci..

[12]  W. Bruce Croft,et al.  Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[13]  William Cooper,et al.  A General Mathematical Model for Information Retrieval Systems , 1976, The Library Quarterly.

[14]  Michael McGill,et al.  Syracuse information retrieval experiment (SIRE): design of an on-line bibliographic retrieval system , 1976, SIGF.