Organizing News Archives by Near-Duplicate Copy Detection in Digital Libraries
暂无分享,去创建一个
[1] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.
[2] Justin Zobel,et al. Methods for Identifying Versioned and Plagiarized Documents , 2003, J. Assoc. Inf. Sci. Technol..
[3] Hector Garcia-Molina,et al. Copy detection mechanisms for digital documents , 1995, SIGMOD '95.
[4] Grace Hui Yang,et al. Near-duplicate detection by instance-level constrained clustering , 2006, SIGIR.
[5] Stuart W. Shulman. E-Rulemaking: Issues in Current Research and Practice [1] , 2005 .
[6] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[7] Wei-Ying Ma,et al. Building implicit links from content for forum search , 2006, SIGIR.
[8] Qiang Yang,et al. A comparison of implicit and explicit links for web page classification , 2006, WWW '06.
[9] Hector Garcia-Molina,et al. SCAM: A Copy Detection Mechanism for Digital Documents , 1995, DL.
[10] Monika Henzinger,et al. Finding near-duplicate web pages: a large-scale evaluation of algorithms , 2006, SIGIR.
[11] Dennis Shasha,et al. StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time , 2002, VLDB.