Fast Semantic Duplicate Detection Techniques in Databases
暂无分享,去创建一个
[1] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[2] Carlos Alberto Heuser,et al. A fast approach for parallel deduplication on multicore processors , 2011, SAC '11.
[3] James Allan,et al. Using Soundex Codes for Indexing Names in ASR Documents , 2004, HLT-NAACL 2004.
[4] Ashok Koujalagi. Determine Word Relevance in Document Queries Using TF-IDF , 2015 .
[5] Peter Christen,et al. A Comparison of Personal Name Matching: Techniques and Practical Issues , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).
[6] C. Lee Giles,et al. Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.
[7] Keizo Oyama,et al. A Fast Linkage Detection Scheme for Multi-Source Information Integration , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.
[8] Kadhum Alnoory,et al. Performance Evaluation of Similarity Functions for Duplicate Record Detection , 2011 .
[9] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[10] Juan Enrique Ramos,et al. Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .