Near duplicate detection in an academic digital library
暂无分享,去创建一个
[1] Ophir Frieder,et al. Collection statistics for fast duplicate document detection , 2002, TOIS.
[2] Monika Henzinger,et al. Finding near-duplicate web pages: a large-scale evaluation of algorithms , 2006, SIGIR.
[3] Gurmeet Singh Manku,et al. Detecting near-duplicates for web crawling , 2007, WWW '07.
[4] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.
[5] Susan Gauch,et al. Document similarity based on concept tree distance , 2008, Hypertext.
[6] J. A. Chandulal,et al. Signature Based Duplication Detection in Digital Libraries , 2006 .
[7] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[8] R. Manmatha,et al. Partial duplicate detection for large book collections , 2011, CIKM '11.