Thesaurus based term ranking for keyword extraction : IEEE Proceedings of the 7th International Workshop on Text-based Information Retrieval (TIR-10), Bilbao, Spain

A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account.