论文信息 - On computing text-based similarity in scientific literature

On computing text-based similarity in scientific literature

This paper addresses computing of similarity among papers using text-based measures. First, we analyze the accuracy of the similarities computed using different parts of a paper, and propose a method of Keyword-Extension, which is very useful when text information is incomplete.

[1] Jiawei Han,et al. Data Mining: Concepts and Techniques , 2000 .

[2] Jian Pei,et al. Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[3] Sunju Park,et al. A link-based similarity measure for scientific literature , 2010, WWW '10.