Adaptive Windows for Duplicate Detection
暂无分享,去创建一个
[1] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[2] H B NEWCOMBE,et al. Automatic linkage of vital records. , 1959, Science.
[3] C. Lee Giles,et al. Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.
[4] Peter Christen,et al. Probabilistic Data Generation for Deduplication and Data Linkage , 2005, IDEAL.
[5] Charles Elkan,et al. An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records , 1997, DMKD.
[6] Hector Garcia-Molina,et al. Evaluating entity resolution results , 2010, Proc. VLDB Endow..
[7] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[8] Erhard Rahm,et al. Frameworks for entity matching: A comparison , 2010, Data Knowl. Eng..
[9] Felix Naumann,et al. Industry-scale duplicate detection , 2008, Proc. VLDB Endow..
[10] Lifang Gu,et al. Adaptive Filtering for Efficient Record Linkage , 2004, SDM.
[11] Felix Naumann,et al. DuDe: The Duplicate Detection Toolkit , 2010 .
[12] Salvatore J. Stolfo,et al. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.
[13] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[14] Peter Christen,et al. A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.
[15] Stephen Warshall,et al. A Theorem on Boolean Matrices , 1962, JACM.
[16] Felix Naumann,et al. An Introduction to Duplicate Detection , 2010, An Introduction to Duplicate Detection.
[17] Ahmad Abdollahzadeh Barforoush,et al. A Flexible Fuzzy Expert System for Fuzzy Duplicate Elimination in Data Cleaning , 2004, DEXA.
[18] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[19] Peter Christen,et al. Quality and Complexity Measures for Data Linkage and Deduplication , 2007, Quality Measures in Data Mining.
[20] Felix Naumann,et al. A Comparison and Generalization of Blocking and Windowing Algorithms for Duplicate Detection , 2009 .
[21] Andreas Polze,et al. Survey on Healthcare IT systems – Standards , Regulations and Security , 2011 .
[22] Pedro M. Domingos,et al. Object Identification with Attribute-Mediated Dependences , 2005, PKDD.
[23] Georgia Koutrika,et al. Entity resolution with iterative blocking , 2009, SIGMOD Conference.
[24] Jennifer Widom,et al. Swoosh: a generic approach to entity resolution , 2008, The VLDB Journal.