Evaluation of entity resolution approaches on real-world match problems
暂无分享,去创建一个
[1] Carlo Batini,et al. Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications) , 2006 .
[2] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[3] Carlo Batini,et al. Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.
[4] Erhard Rahm,et al. Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..
[5] Surajit Chaudhuri,et al. Example-driven design of efficient record matching queries , 2007, VLDB.
[6] Ahmed K. Elmagarmid,et al. TAILOR: a record linkage toolbox , 2002, Proceedings 18th International Conference on Data Engineering.
[7] Lifang Gu,et al. Decision Models for Record Linkage , 2006, Selected Papers from AusDM.
[8] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[9] Erhard Rahm,et al. Frameworks for entity matching: A comparison , 2010, Data Knowl. Eng..
[10] Jeffrey Xu Yu,et al. Efficient similarity joins for near-duplicate detection , 2011, TODS.
[11] Carlos Alberto Heuser,et al. SimEval - A Tool for Evaluating the Quality of Similarity Functions , 2007, ER.
[12] Renée J. Miller,et al. Framework for Evaluating Clustering Algorithms in Duplicate Detection , 2009, Proc. VLDB Endow..
[13] Andreas Thor,et al. MOMA - A Mapping-based Object Matching System , 2007, CIDR.
[14] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[15] Felix Naumann,et al. Object Identification Quality , 2003 .
[16] P. Ivax,et al. A THEORY FOR RECORD LINKAGE , 2004 .
[17] Felix Naumann,et al. DogmatiX tracks down duplicates in XML , 2005, SIGMOD '05.
[18] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[19] Marcos André Gonçalves,et al. Learning to deduplicate , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).
[20] Andreas Thor,et al. Learning-Based Approaches for Matching Web Data Entities , 2010, IEEE Internet Computing.
[21] Peter Christen,et al. A Comparison of Personal Name Matching: Techniques and Practical Issues , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).
[22] Surajit Chaudhuri,et al. Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.
[23] Henry A. Kautz,et al. Hardening soft information sources , 2000, KDD '00.
[24] Erhard Rahm,et al. Training selection for tuning entity matching , 2008, QDB/MUD.
[25] Divesh Srivastava,et al. Record linkage: similarity measures and algorithms , 2006, SIGMOD Conference.
[26] Andrew McCallum,et al. Joint deduplication of multiple record types in relational data , 2005, CIKM '05.
[27] Mikhail Bilenko and Raymond J. Mooney,et al. On Evaluation and Training-Set Construction for Duplicate Detection , 2003 .
[28] Pradeep Ravikumar,et al. A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.
[29] Lise Getoor,et al. Link-based Classifi-cation using Labeled and Unlabeled Data , 2003 .
[30] Craig A. Knoblock,et al. Learning Blocking Schemes for Record Linkage , 2006, AAAI.
[31] Felix Naumann,et al. A Duplicate Detection Benchmark for XML ( and Relational ) Data , 2006 .
[32] Andreas Thor,et al. Comparative evaluation of entity resolution approaches with FEVER , 2009, Proc. VLDB Endow..
[33] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[34] Peter Christen,et al. Febrl: a freely available record linkage system with a graphical user interface , 2008 .
[35] Pedro M. Domingos,et al. Object Identification with Attribute-Mediated Dependences , 2005, PKDD.