Towards Scalable Real-Time Entity Resolution using a Similarity-Aware Inverted Index Approach
暂无分享,去创建一个
[1] Philip S. Yu,et al. The IGrid index: reversing the dimensionality curse for similarity indexing in high dimensional space , 2000, KDD '00.
[2] Jack G. Conrad,et al. Online duplicate document detection: signature reliability in a dynamic retrieval environment , 2003, CIKM '03.
[3] Keizo Oyama,et al. A Fast Linkage Detection Scheme for Multi-Source Information Integration , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.
[4] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.
[5] Peter Christen,et al. Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface , 2008, KDD.
[6] Philip S. Yu,et al. LinkClus: efficient clustering via heterogeneous semantic links , 2006, VLDB.
[7] Melanie Herschel,et al. Space and Time Scalability of Duplicate Detection in Graph Data , 2008 .
[8] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[9] Ian H. Witten,et al. Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .
[10] W. Winkler. Overview of Record Linkage and Current Research Directions , 2006 .
[11] William W. Cohen,et al. Learning to match and cluster large high-dimensional data sets for data integration , 2002, KDD.
[12] Dmitri V. Kalashnikov,et al. Domain-independent data cleaning via analysis of entity-relationship graph , 2006, TODS.
[13] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[14] JUSTIN ZOBEL,et al. Inverted files for text search engines , 2006, CSUR.
[15] Lifang Gu,et al. Decision Models for Record Linkage , 2006, Selected Papers from AusDM.
[16] Lise Getoor,et al. Query-time entity resolution , 2006, KDD '06.
[17] J StolfoSalvatore,et al. The merge/purge problem for large databases , 1995 .
[18] C. Lee Giles,et al. Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.
[19] Chen Li,et al. Efficient record linkage in large data sets , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..
[20] Craig A. Knoblock,et al. Learning Blocking Schemes for Record Linkage , 2006, AAAI.
[21] Roberto J. Bayardo,et al. Scaling up all pairs similarity search , 2007, WWW '07.
[22] Peter Christen,et al. Quality and Complexity Measures for Data Linkage and Deduplication , 2007, Quality Measures in Data Mining.
[23] Pradeep Ravikumar,et al. A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.
[24] Raymond J. Mooney,et al. Adaptive Blocking: Learning to Scale Up Record Linkage , 2006, Sixth International Conference on Data Mining (ICDM'06).
[25] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[26] P. Ivax,et al. A THEORY FOR RECORD LINKAGE , 2004 .
[27] Ahmed K. Elmagarmid,et al. TAILOR: a record linkage toolbox , 2002, Proceedings 18th International Conference on Data Engineering.
[28] Peter Christen,et al. A Comparison of Personal Name Matching: Techniques and Practical Issues , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).
[29] Ian H. Witten,et al. Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .