PC-Filter: A Robust Filtering Technique for Duplicate Record Detection in Large Databases
暂无分享,去创建一个
Tok Wang Ling | Ji Zhang | Han Liu | Robert M. Bruckner | T. Ling | Han Liu | Ji Zhang
[1] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[2] Mauricio Antonio Hernandez-Sherrington. A generalization of band joins and the merge/purge problem , 1996 .
[3] Zhao Li,et al. A fast filtering scheme for large database cleansing , 2002, CIKM '02.
[4] Charles Elkan,et al. The Field Matching Problem: Algorithms and Applications , 1996, KDD.
[5] Surajit Chaudhuri,et al. Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.
[6] Charles Elkan,et al. An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records , 1997, DMKD.
[7] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.
[8] Luis Gravano,et al. Text joins for data cleansing and integration in an RDBMS , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).
[9] Tok Wang Ling,et al. A New Efficient Data Cleansing Method , 2002, DEXA.