论文信息 - IDENTIFICATION OF DUPLICATE RECORDS FOR QUERY RESULTS FROM REAL TIME DATABASES - 字舞流文

IDENTIFICATION OF DUPLICATE RECORDS FOR QUERY RESULTS FROM REAL TIME DATABASES

Angelina Geetha | A. Geetha

[1] J. Aruna,et al. Identification of Duplicate Records over Query Results from Real Time Web Databases , 2013 .

[2] Cai Bo,et al. Research on Chunking Algorithms of Data De-duplication , 2013 .

[3] Sang Yong Park,et al. Ecient Data Deduplication System Considering File Modication Pattern , 2012 .

[4] Wei Wang,et al. A Similar Duplicate Data Detection Method Based on Fuzzy Clustering for Topology Formation , 2012 .

[5] Marijn Schraagen. Complete Coverage for Approximate String Matching in Record Linkage Using Bit Vectors , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[6] Xuanjing Huang,et al. Efficient Near-Duplicate Detection for Q&A Forum , 2011, IJCNLP.

[7] Jeffrey Xu Yu,et al. Efficient similarity joins for near-duplicate detection , 2011, TODS.

[8] Guoliang Li,et al. Fast-join: An efficient method for fuzzy token matching based string similarity join , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[9] Vijay S. Mookerjee,et al. Efficient Techniques for Online Record Linkage , 2011, IEEE Transactions on Knowledge and Data Engineering.

[10] Wagner Meira,et al. Adaptive and Flexible Blocking for Record Linkage Tasks , 2010, J. Inf. Data Manag..

[11] Weifeng Su,et al. Record Matching over Query Results from Multiple Web Databases , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12] Rong Jin,et al. Efficient Algorithm for Localized Support Vector Machine , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13] S Zakia. Detection and Elimination of Duplicate Data from Semantic Web Queries , 2010 .

[14] Neha Aggarwal,et al. Query Based Duplicate Data Detection on WWW , 2010 .

[15] Amy J. C. Trappey,et al. A Fuzzy Ontological Knowledge Document Clustering Methodology , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16] Mikhail J. Atallah,et al. Efficient Private Record Linkage , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[17] Peter Christen,et al. Automatic record linkage using seeded nearest neighbour and support vector machine classification , 2008, KDD.

[18] Jeffrey Xu Yu,et al. Efficient similarity joins for near duplicate detection , 2008, WWW.

[19] Peter Christen,et al. Quality and Complexity Measures for Data Linkage and Deduplication , 2007, Quality Measures in Data Mining.

[20] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[21] Rajeev Motwani,et al. Robust identification of fuzzy duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).

[22] Wei-Ying Ma,et al. Instance-based Schema Matching for Web Databases by Domain-specific Query Probing , 2004, VLDB.

[23] Jiawei Han,et al. PEBL: Web page classification without negative examples , 2004, IEEE Transactions on Knowledge and Data Engineering.

[24] Lifang Gu,et al. Adaptive Filtering for Efficient Record Linkage , 2004, SDM.

[25] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.

[26] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.

[27] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.

[28] Surajit Chaudhuri,et al. Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.

[29] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.