Message extraction through estimated relevance

METER is a text analysis and retrieval system for non-expert computer users to exploit statistical associations between index terms of documents. It will run on a DEC PDP-11/45 minicomputer with continually changing collections of up to 20,000 documents at a time. A scaled version of METER with all major features of the full system has been implemented on a DEC PDP-11/70 as an experimental test bed for evaluation and comparison of associative retrieval algorithms. Although the basic structure of METER is similar to earlier statistical systems for retrospective document searches, the severe requirements of frequent updates of a document collection, of running on a small processor, and of meeting needs of users with little technical training have led to some novel developments. Among these are an update procedure that draws as much as possible on intermediate results from previous updates and a user interface that provides for control over the process of retrieval without calling for knowledge of how that process works.

[1]  F. W. Lancaster,et al.  Information retrieval: on-line , 1973 .

[2]  Gerard Salton,et al.  A theory of indexing , 1975, Regional conference series in applied mathematics.

[3]  E. B. Newman,et al.  Tests of a statistical explanation of the rank-frequency relation for words in written English. , 1958, American Journal of Psychology.

[4]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[5]  Karen Spärck Jones Index term weighting , 1973, Inf. Storage Retr..

[6]  Harry M. Hersh Statistical methods for technical document retrieval , 1977 .

[7]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[8]  C. J. van Rijsbergen,et al.  An Evaluation of feedback in Document Retrieval using Co‐Occurrence Data , 1978, J. Documentation.

[9]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[10]  John M. Morris,et al.  RADC On-Line Retrieval System Evaluation. , 1975 .

[11]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[12]  Van Rijsbergen,et al.  A theoretical basis for the use of co-occurence data in information retrieval , 1977 .

[13]  Christopher Landauer,et al.  Message extraction through estimation of relevance , 1980, SIGIR '80.

[14]  Aviezri S. Fraenkel,et al.  Local Feedback in Full-Text Retrieval Systems , 1977, JACM.

[15]  Nachum Dershowitz,et al.  KEDMA—Linguistic Tools for Retrieval Systems , 1978, JACM.