Efficient Entity Maching over Multiple Data Sources with MapReduce
暂无分享,去创建一个
[1] Jianmin Wang,et al. MapDupReducer: detecting near duplicates over massive datasets , 2010, SIGMOD Conference.
[2] Jimmy J. Lin,et al. The Curse of Zipf and Limits to Parallelization: An Look at the Stragglers Problem in MapReduce , 2009, LSDS-IR@SIGIR.
[3] Mirek Riedewald,et al. Processing theta-joins using MapReduce , 2011, SIGMOD '11.
[4] Wagner Meira,et al. Adaptive and Flexible Blocking for Record Linkage Tasks , 2010, J. Inf. Data Manag..
[5] Andreas Thor,et al. Multi-pass sorted neighborhood blocking with MapReduce , 2012, Computer Science - Research and Development.
[6] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[7] Erhard Rahm,et al. Data Partitioning for Parallel Entity Matching , 2010, ArXiv.
[8] Dongwon Lee,et al. Parallel linkage , 2007, CIKM '07.
[9] Erhard Rahm,et al. Frameworks for entity matching: A comparison , 2010, Data Knowl. Eng..
[10] Pradeep Ravikumar,et al. A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.
[11] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[12] Chen Li,et al. Efficient parallel set-similarity joins using MapReduce , 2010, SIGMOD Conference.
[13] Carlos Eduardo S. Pires,et al. Improving load balancing for MapReduce-based entity matching , 2013, 2013 IEEE Symposium on Computers and Communications (ISCC).
[14] Andreas Thor,et al. Load Balancing for MapReduce-based Entity Resolution , 2011, 2012 IEEE 28th International Conference on Data Engineering.
[15] Peter Christen,et al. A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.