SuperPart: Supervised Graph Partitioning for Record Linkage
暂无分享,去创建一个
Andrew Borthwick | Robert A. Barton | Russell Reas | Steve Ash | Rob Barton | Stephen M. Ash | Andrew Borthwick | Russell Reas
[1] Georgia Koutrika,et al. Entity resolution with iterative blocking , 2009, SIGMOD Conference.
[2] Weiyi Meng,et al. Efficient SPectrAl Neighborhood blocking for entity resolution , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[3] Anton J. Enright,et al. An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.
[4] Avigdor Gal,et al. Comparative Analysis of Approximate Blocking Techniques for Entity Resolution , 2016, Proc. VLDB Endow..
[5] Santo Fortunato,et al. Community detection in graphs , 2009, ArXiv.
[6] Sheng Chen. The Case for Cost-Sensitive and Easy-To-Interpret Models in Industrial Record Linkage , 2011 .
[7] Divesh Srivastava,et al. Incremental Record Linkage , 2014, Proc. VLDB Endow..
[8] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .
[9] Renée J. Miller,et al. Creating probabilistic databases from duplicated data , 2009, The VLDB Journal.
[10] Jan Baumbach,et al. Comparing the performance of biomedical clustering methods , 2015, Nature Methods.
[11] Piotr Indyk,et al. Scalable Techniques for Clustering the Web , 2000, WebDB.
[12] S. Dongen. Graph clustering by flow simulation , 2000 .
[13] Greg Finak,et al. Critical assessment of automated flow cytometry data analysis techniques , 2013, Nature Methods.
[14] Divesh Srivastava,et al. Record linkage: similarity measures and algorithms , 2006, SIGMOD Conference.
[15] Ivan P. Fellegi,et al. A Theory for Record Linkage , 1969 .
[16] Alieh Saeedi,et al. Comparative Evaluation of Distributed Clustering Schemes for Multi-source Entity Resolution , 2017, ADBIS.
[17] Julio Gonzalo,et al. A comparison of extrinsic clustering evaluation metrics based on formal constraints , 2008, Information Retrieval.
[18] Andrew Borthwick,et al. Dynamic Record Blocking: Efficient Linking of Massive Databases in MapReduce , 2012 .
[19] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.
[20] Rui Xu,et al. Clustering Algorithms in Biomedical Research: A Review , 2010, IEEE Reviews in Biomedical Engineering.
[21] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[22] Vladimir Batagelj,et al. An O(m) Algorithm for Cores Decomposition of Networks , 2003, ArXiv.
[23] Renée J. Miller,et al. Framework for Evaluating Clustering Algorithms in Duplicate Detection , 2009, Proc. VLDB Endow..
[24] Andreas Thor,et al. Evaluation of entity resolution approaches on real-world match problems , 2010, Proc. VLDB Endow..
[25] Frank Harary,et al. Graph Theory , 2016 .
[26] Stéphane Bressan,et al. Ricochet: A Family of Unconstrained Algorithms for Graph Clustering , 2009, DASFAA.
[27] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.
[28] Divesh Srivastava,et al. Big Data Integration , 2015, Synthesis Lectures on Data Management.
[29] Jure Leskovec,et al. Defining and Evaluating Network Communities Based on Ground-Truth , 2012, ICDM.
[30] Stephen M. Ash,et al. Embracing the Sparse, Noisy, and Interrelated Aspects of Patient Demographics for use in Clinical Medical Record Linkage , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.
[31] Jennifer Widom,et al. Swoosh: a generic approach to entity resolution , 2008, The VLDB Journal.
[32] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.