Scalable Techniques for Clustering the Web (Extended Abstract)
暂无分享,去创建一个
[1] Hector Garcia-Molina,et al. Detecting Digital Copyright Violations On The Internet , 1999 .
[2] Edith Cohen,et al. Finding interesting associations without support pruning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).
[3] Andrei Z. Broder,et al. On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).
[4] Oren Etzioni,et al. Web document clustering: a feasibility demonstration , 1998, SIGIR '98.
[5] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.
[6] Tom M. Mitchell,et al. Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.
[7] Sriram Raghavan,et al. WebBase: a repository of Web pages , 2000, Comput. Networks.
[8] Piotr Indyk,et al. Similarity Search in High Dimensions via Hashing , 1999, VLDB.
[9] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.
[10] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .
[11] David B. Shmoys,et al. A Best Possible Heuristic for the k-Center Problem , 1985, Math. Oper. Res..
[12] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[13] Alan M. Frieze,et al. Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..
[14] Rajeev Motwani,et al. Computing Iceberg Queries Efficiently , 1998, VLDB.