Term Weighting in Information Retrieval Using the Term Precision Model

At3STRACT It iS known that the use of weighted, as opposed to binary, content identifiers attached to the records of an information file improves the effectiveness of the retrieval operations Under well-defined conditions the term precision offers the best possible term weighting system A mathematscal model is used in the present study to relate the term precision weights to the frequency of occurrence of the terms in a given document collecuon and to the number of relevant documents a user wishes to retrieve in response to a query This provides for the assignment of user-dependent weights to the content identifiers and relates the term precision weights to other well-known term weighting systems

[1]  Karen Spärck Jones Experiments in relevance weighting of search terms , 1979, Inf. Process. Manag..

[2]  Michael E. Lesk,et al.  Computer Evaluation of Indexing and Text Processing , 1968, JACM.

[3]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[4]  Gerard Salton,et al.  Recent Studies in Automatic Text Analysis and Document Retrieval , 1973, JACM.

[5]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[6]  Don R. Swanson,et al.  A decision theoretic foundation for indexing , 1975, J. Am. Soc. Inf. Sci..

[7]  C. J. van Rijsbergen,et al.  An Evaluation of feedback in Document Retrieval using Co‐Occurrence Data , 1978, J. Documentation.

[8]  Van Rijsbergen,et al.  A theoretical basis for the use of co-occurence data in information retrieval , 1977 .

[9]  Clement T. Yu,et al.  Precision Weighting—An Effective Automatic Indexing Method , 1976, J. ACM.

[10]  Don R. Swanson,et al.  Probabilistic models for automatic indexing , 1974, J. Am. Soc. Inf. Sci..

[11]  Clement Tak Yu Theory of indexing and classification. , 1973 .

[12]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[13]  R. K. Waldstein,et al.  Term relevance weights in on-line information retrieval , 1977, Inf. Process. Manag..

[14]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[15]  Journal of the Association for Computing Machinery , 1961, Nature.

[16]  Clement T. Yu,et al.  A Statistical Model for Relevance Feedback in Information Retrieval , 1976, JACM.

[17]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..