IKMC: An Improved K-Medoids Clustering Method for Near-Duplicated Records Detection
暂无分享,去创建一个
Jian Sun | Zhiwang Cen | Jungang Xu | Ying Pei | Jian Sun | Jungang Xu | Ying Pei | Zhiwang Cen
[1] Robert A. Wagner,et al. An Extension of the String-to-String Correction Problem , 1975, JACM.
[2] Christos Faloutsos,et al. FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.
[3] Surajit Chaudhuri,et al. Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.
[4] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[5] H B NEWCOMBE,et al. Automatic linkage of vital records. , 1959, Science.