Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data
暂无分享,去创建一个
[1] Chen Li,et al. Supporting Efficient Record Linkage for Large Data Sets Using Mapping Techniques , 2006, World Wide Web.
[2] Georgia Koutrika,et al. Entity resolution with iterative blocking , 2009, SIGMOD Conference.
[3] Peter Fankhauser,et al. The missing links: discovering hidden same-as links among a billion of triples , 2010, iiWAS.
[4] Claudia Niederée,et al. On-the-fly entity-aware query processing in the presence of linkage , 2010, Proc. VLDB Endow..
[5] Previous version: , 2004 .
[6] Nilesh N. Dalvi,et al. Large-Scale Collective Entity Matching , 2011, Proc. VLDB Endow..
[7] David Maier,et al. Principles of dataspace systems , 2006, PODS '06.
[8] Mengchi Liu,et al. Modeling heterogeneous data in dataspace , 2008, IRI.
[9] Peter Fankhauser,et al. Efficient entity resolution for large heterogeneous information spaces , 2011, WSDM '11.
[10] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.
[11] Pradeep Ravikumar,et al. A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.
[12] Craig A. Knoblock,et al. Learning Blocking Schemes for Record Linkage , 2006, AAAI.
[13] Raymond J. Mooney,et al. Adaptive Blocking: Learning to Scale Up Record Linkage , 2006, Sixth International Conference on Data Mining (ICDM'06).
[14] Dongwon Lee,et al. HARRA: fast iterative hashed record linkage for large-scale data collections , 2010, EDBT '10.
[15] Sanjay Chawla,et al. Robust record linkage blocking using suffix arrays , 2009, CIKM.
[16] Jayant Madhavan,et al. Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.
[17] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[18] Jayant Madhavan,et al. Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.
[19] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[20] Renée J. Miller,et al. A framework for semantic link discovery over relational data , 2009, CIKM.
[21] Craig A. Knoblock,et al. Learning domain-independent string transformation weights for high accuracy object identification , 2002, KDD.
[22] Claudia Niederée,et al. Eliminating the redundancy in blocking-based entity resolution methods , 2011, JCDL '11.
[23] Claudia Niederée,et al. To compare or not to compare: making entity resolution more efficient , 2011, SWIM '11.
[24] Previous version: , 2004 .
[25] Peter Christen,et al. A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.
[26] Tim Berners-Lee,et al. Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..
[27] Luis Gravano,et al. Approximate String Joins in a Database (Almost) for Free , 2001, VLDB.
[28] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.