Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction
暂无分享,去创建一个
[1] Jiaheng Lu,et al. Efficient Merging and Filtering Algorithms for Approximate String Searches , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[2] Xuemin Lin,et al. Ed-Join: an efficient algorithm for similarity joins with edit distance constraints , 2008, Proc. VLDB Endow..
[3] Chengqi Zhang,et al. Efficient approximate entity extraction with edit distance constraints , 2009, SIGMOD Conference.
[4] Sunita Sarawagi,et al. Efficient set joins on similarity predicates , 2004, SIGMOD '04.
[5] Surajit Chaudhuri,et al. A Primitive Operator for Similarity Joins in Data Cleaning , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[6] Raghav Kaushik,et al. Efficient exact set-similarity joins , 2006, VLDB.
[7] Surajit Chaudhuri,et al. Scalable ad-hoc entity extraction from text collections , 2008, Proc. VLDB Endow..
[8] Surajit Chaudhuri,et al. Mining Document Collections to Facilitate Accurate Approximate Entity Matching , 2009, Proc. VLDB Endow..
[9] Divesh Srivastava,et al. Fast Indexes and Algorithms for Set Similarity Selection Queries , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[10] Xuemin Lin,et al. Top-k Set Similarity Joins , 2009, 2009 IEEE 25th International Conference on Data Engineering.
[11] Anthony K. H. Tung,et al. Relaxing join and selection queries , 2006, VLDB.
[12] Roberto J. Bayardo,et al. Scaling up all pairs similarity search , 2007, WWW '07.
[13] Surajit Chaudhuri,et al. An efficient filter for approximate membership checking , 2008, SIGMOD Conference.
[14] Sunita Sarawagi,et al. Efficient Batch Top-k Search for Dictionary-based Entity Recognition , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[15] Luis Gravano,et al. Approximate String Joins in a Database (Almost) for Free , 2001, VLDB.
[16] Xiaofeng Meng,et al. Efficient algorithms for approximate member extraction using signature-based inverted lists , 2009, CIKM.
[17] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.
[18] Rajeev Motwani,et al. Robust identification of fuzzy duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).
[19] Kyuseok Shim,et al. Power-Law Based Estimation of Set Similarity Join Size , 2009, Proc. VLDB Endow..
[20] Xiaohui Yu,et al. Hashed samples: selectivity estimators for set similarity selection queries , 2008, Proc. VLDB Endow..
[21] Bin Wang,et al. VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams , 2007, VLDB.
[22] Lee Jae-Gil,et al. n-Gram/2L: A Space and Time Efficient Two-Level n-Gram Inverted Index Structure , 2006 .
[23] Kyuseok Shim,et al. Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance , 2007, VLDB.
[24] Divesh Srivastava,et al. Incremental maintenance of length normalized indexes for approximate string matching , 2009, SIGMOD Conference.
[25] Jeffrey Xu Yu,et al. Efficient similarity joins for near-duplicate detection , 2011, TODS.