A Comparison of Techniques to Find Mirrored Hosts on the WWW
暂无分享,去创建一个
Andrei Z. Broder | Krishna Bharat | Jeffrey Dean | Monika R. Henzinger | J. Dean | A. Broder | M. Henzinger | K. Bharat
[1] John A. Hartigan,et al. Clustering Algorithms , 1975 .
[2] Editors , 1986, Brain Research Bulletin.
[3] Peter Willett,et al. Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..
[4] Gerard Salton,et al. Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..
[5] Andrei Z. Broder,et al. On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).
[6] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[7] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.
[8] Hector Garcia-Molina,et al. Finding Near-Replicas of Documents and Servers on the Web , 1998, WebDB.
[9] Andrei Z. Broder,et al. A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.
[10] Oren Etzioni,et al. Web document clustering: a feasibility demonstration , 1998, SIGIR '98.
[11] Krishna Bharat,et al. Improved algorithms for topic distillation in a hyperlinked environment , 1998, SIGIR '98.
[12] Jon M. Kleinberg,et al. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.
[13] M. KleinbergJon. Authoritative sources in a hyperlinked environment , 1999 .
[14] Andrei Z. Broder,et al. Mirror, Mirror on the Web: A Study of Host Pairs with Replicated Content , 1999, Comput. Networks.
[15] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[16] Andrei Z. Broder,et al. A Comparison of Techniques to Find Mirrored Hosts on the WWW , 2000, IEEE Data Engineering Bulletin.
[17] Hector Garcia-Molina,et al. Finding replicated Web collections , 2000, SIGMOD 2000.
[18] Edward T. O'Neill,et al. A Methodology for Sampling the World Wide Web , 2001 .