An efficient approach for data-duplication detection based on RDBMS
暂无分享,去创建一个
[1] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[2] Esko Ukkonen,et al. Approximate String Matching with q-grams and Maximal Matches , 1992, Theor. Comput. Sci..
[3] Felix Naumann,et al. Industry-scale duplicate detection , 2008, Proc. VLDB Endow..
[4] Luis Gravano,et al. Approximate String Joins in a Database (Almost) for Free , 2001, VLDB.
[5] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.
[6] Gonzalo Navarro,et al. A guided tour to approximate string matching , 2001, CSUR.
[7] C. Lee Giles,et al. Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.