MF-Join: Efficient Fuzzy String Similarity Join with Multi-level Filtering
暂无分享,去创建一个
[1] Lijun Chang,et al. Leveraging Set Relations in Exact Set Similarity Join , 2017, Proc. VLDB Endow..
[2] D. West. Introduction to Graph Theory , 1995 .
[3] Xuemin Lin,et al. Efficient exact edit similarity query processing with the asymmetric signature scheme , 2011, SIGMOD '11.
[4] Jeffrey Xu Yu,et al. Efficient similarity joins for near-duplicate detection , 2011, TODS.
[5] Anthony K. H. Tung,et al. LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index , 2016, SIGMOD Conference.
[6] Luis Gravano,et al. Approximate String Joins in a Database (Almost) for Free , 2001, VLDB.
[7] Wen-Syan Li,et al. String Similarity Joins: An Experimental Evaluation , 2014, Proc. VLDB Endow..
[8] Xiaoyong Du,et al. Fast and Scalable Distributed Set Similarity Joins for Big Data Analytics , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).
[9] Erhard Rahm,et al. Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.
[10] Jin Wang,et al. Two birds with one stone: An efficient hierarchical framework for top-k and threshold-based string similarity search , 2015, 2015 IEEE 31st International Conference on Data Engineering.
[11] Srinivasan Parthasarathy,et al. Bayesian Locality Sensitive Hashing for Fast Similarity Search , 2011, Proc. VLDB Endow..
[12] Guoliang Li,et al. An Efficient Partition Based Method for Exact Set Similarity Joins , 2015, Proc. VLDB Endow..
[13] Guoliang Li,et al. Fast-join: An efficient method for fuzzy token matching based string similarity join , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[14] Wei Wang,et al. GPH: Similarity Search in Hamming Space , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).
[15] Ying Zhang,et al. An Efficient Framework for Exact Set Similarity Search Using Tree Structure Indexes , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).
[16] Xuemin Lin,et al. Top-k Set Similarity Joins , 2009, 2009 IEEE 25th International Conference on Data Engineering.
[17] Jiaheng Lu,et al. Efficient Merging and Filtering Algorithms for Approximate String Searches , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[18] Xuemin Lin,et al. TT-Join: Efficient Set Containment Join , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).
[19] Jian Pei,et al. Mining frequent patterns without candidate generation , 2000, SIGMOD '00.
[20] Guoliang Li,et al. Can we beat the prefix filtering?: an adaptive framework for similarity join and search , 2012, SIGMOD Conference.
[21] Michael Stonebraker,et al. SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints , 2017, Proc. VLDB Endow..
[22] Abhinandan Das,et al. Google news personalization: scalable online collaborative filtering , 2007, WWW '07.
[23] Xuemin Lin,et al. SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index , 2014, Proc. VLDB Endow..
[24] Roberto J. Bayardo,et al. Scaling up all pairs similarity search , 2007, WWW '07.
[25] H. Kuhn. The Hungarian method for the assignment problem , 1955 .
[26] Chuan Xiao,et al. Pigeonring: A Principle for Faster Thresholded Similarity Search , 2018, Proc. VLDB Endow..
[27] Reynold Xin,et al. Finding related tables , 2012, SIGMOD Conference.
[28] Jin Wang,et al. A Transformation-Based Framework for KNN Set Similarity Search , 2020, IEEE Transactions on Knowledge and Data Engineering.
[29] Chen Li,et al. Efficient parallel set-similarity joins using MapReduce , 2010, SIGMOD Conference.
[30] Alan M. Frieze,et al. Min-wise independent permutations (extended abstract) , 1998, STOC '98.
[31] Carlo Zaniolo,et al. An Efficient Sliding Window Approach for Approximate Entity Extraction with Synonyms , 2019, EDBT.
[32] Surajit Chaudhuri,et al. A Primitive Operator for Similarity Joins in Data Cleaning , 2006, 22nd International Conference on Data Engineering (ICDE'06).
[33] Nikolaus Augsten,et al. An Empirical Evaluation of Set Similarity Join Techniques , 2016, Proc. VLDB Endow..
[34] Jiaheng Lu,et al. String similarity measures and joins with synonyms , 2013, SIGMOD '13.
[35] Xuemin Lin,et al. Ed-Join: an efficient algorithm for similarity joins with edit distance constraints , 2008, Proc. VLDB Endow..
[36] Guoliang Li,et al. PASS-JOIN: A Partition-based Method for Similarity Joins , 2011, Proc. VLDB Endow..
[37] Raghav Kaushik,et al. Efficient exact set-similarity joins , 2006, VLDB.