An Efficient Trie-based Method for Approximate Entity Extraction with Edit-Distance Constraints
暂无分享,去创建一个
[1] Esko Ukkonen,et al. Finding Approximate Patterns in Strings , 1985, J. Algorithms.
[2] Guoliang Li,et al. Fast-join: An efficient method for fuzzy token matching based string similarity join , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[3] Chengqi Zhang,et al. Efficient approximate entity extraction with edit distance constraints , 2009, SIGMOD Conference.
[4] Divesh Srivastava,et al. Incremental maintenance of length normalized indexes for approximate string matching , 2009, SIGMOD Conference.
[5] Bin Wang,et al. VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams , 2007, VLDB.
[6] Surajit Chaudhuri,et al. Mining Document Collections to Facilitate Accurate Approximate Entity Matching , 2009, Proc. VLDB Endow..
[7] Guoliang Li,et al. Trie-join , 2010, Proc. VLDB Endow..
[8] Bin Wang,et al. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently , 2008, SIGMOD Conference.
[9] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.
[10] Jiaheng Lu,et al. Efficient Merging and Filtering Algorithms for Approximate String Searches , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[11] Jeffrey Xu Yu,et al. Efficient similarity joins for near-duplicate detection , 2011, TODS.
[12] Guoliang Li,et al. Efficient interactive fuzzy keyword search , 2009, WWW '09.
[13] Esko Ukkonen,et al. Approximate String Matching with q-grams and Maximal Matches , 1992, Theor. Comput. Sci..
[14] Lee Jae-Gil,et al. n-Gram/2L: A Space and Time Efficient Two-Level n-Gram Inverted Index Structure , 2006 .
[15] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.
[16] Surajit Chaudhuri,et al. A Primitive Operator for Similarity Joins in Data Cleaning , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[17] Surajit Chaudhuri,et al. Scalable ad-hoc entity extraction from text collections , 2008, Proc. VLDB Endow..
[18] Donald E. Knuth,et al. Fast Pattern Matching in Strings , 1977, SIAM J. Comput..
[19] Guoliang Li,et al. Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction , 2011, SIGMOD '11.
[20] Xiaofeng Meng,et al. Efficient algorithms for approximate member extraction using signature-based inverted lists , 2009, CIKM.
[21] Rajeev Motwani,et al. Robust identification of fuzzy duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).
[22] Anthony K. H. Tung,et al. Relaxing join and selection queries , 2006, VLDB.
[23] Divesh Srivastava,et al. Fast Indexes and Algorithms for Set Similarity Selection Queries , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[24] Xuemin Lin,et al. Ed-Join: an efficient algorithm for similarity joins with edit distance constraints , 2008, Proc. VLDB Endow..
[25] Guoliang Li,et al. PASS-JOIN: A Partition-based Method for Similarity Joins , 2011, Proc. VLDB Endow..
[26] Raghav Kaushik,et al. Efficient exact set-similarity joins , 2006, VLDB.
[27] Roberto J. Bayardo,et al. Scaling up all pairs similarity search , 2007, WWW '07.
[28] Surajit Chaudhuri,et al. An efficient filter for approximate membership checking , 2008, SIGMOD Conference.
[29] Sunita Sarawagi,et al. Efficient Batch Top-k Search for Dictionary-based Entity Recognition , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[30] T. H. Merrett,et al. Tries for Approximate String Matching , 1996, IEEE Trans. Knowl. Data Eng..