A great many automatic indexing methods have been implemented and evaluated over the last few years, and automatic procedures comparable in effectiveness to conventional manual ones are now easy to generate. Two drawbacks of the available automatic indexing methods are the absence of reliable linguistic inputs during the indexing process and the lack of formal, analytical proofs concerning the effectiveness of the proposed methods. The precision weighting procedure described in the present study uses relevance criteria to weight the terms occurring in user queries as a function of the balance between relevant and nonrelevant documents in which these terms occur; this approximates a semantic know-how of term importance. Formal mathematical proofs are given under well-defined conditions of the effectiveness of the method.
[1]
Gerard Salton,et al.
The Performance of Interactive Information Retrieval
,
1971,
Inf. Process. Lett..
[2]
R. Staveley.
A Theory for Practical Education in Librarianship.
,
1972
.
[3]
Gerard Salton,et al.
Recent Studies in Automatic Text Analysis and Document Retrieval
,
1973,
JACM.
[4]
Clement T. Yu,et al.
On the construction of effective vocabularies for information retrieval
,
1974
.
[5]
Clement T. Yu,et al.
A theory of term importance in automatic text analysis
,
1974,
J. Am. Soc. Inf. Sci..