IDENTIFICATION OF DUPLICATE RECORDS FOR QUERY RESULTS FROM REAL TIME DATABASES

[1]  J. Aruna,et al.  Identification of Duplicate Records over Query Results from Real Time Web Databases , 2013 .

[2]  Cai Bo,et al.  Research on Chunking Algorithms of Data De-duplication , 2013 .

[3]  Sang Yong Park,et al.  Ecient Data Deduplication System Considering File Modication Pattern , 2012 .

[4]  Wei Wang,et al.  A Similar Duplicate Data Detection Method Based on Fuzzy Clustering for Topology Formation , 2012 .

[5]  Marijn Schraagen Complete Coverage for Approximate String Matching in Record Linkage Using Bit Vectors , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[6]  Xuanjing Huang,et al.  Efficient Near-Duplicate Detection for Q&A Forum , 2011, IJCNLP.

[7]  Jeffrey Xu Yu,et al.  Efficient similarity joins for near-duplicate detection , 2011, TODS.

[8]  Guoliang Li,et al.  Fast-join: An efficient method for fuzzy token matching based string similarity join , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[9]  Vijay S. Mookerjee,et al.  Efficient Techniques for Online Record Linkage , 2011, IEEE Transactions on Knowledge and Data Engineering.

[10]  Wagner Meira,et al.  Adaptive and Flexible Blocking for Record Linkage Tasks , 2010, J. Inf. Data Manag..

[11]  Weifeng Su,et al.  Record Matching over Query Results from Multiple Web Databases , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12]  Rong Jin,et al.  Efficient Algorithm for Localized Support Vector Machine , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13]  S Zakia Detection and Elimination of Duplicate Data from Semantic Web Queries , 2010 .

[14]  Neha Aggarwal,et al.  Query Based Duplicate Data Detection on WWW , 2010 .

[15]  Amy J. C. Trappey,et al.  A Fuzzy Ontological Knowledge Document Clustering Methodology , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Mikhail J. Atallah,et al.  Efficient Private Record Linkage , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[17]  Peter Christen,et al.  Automatic record linkage using seeded nearest neighbour and support vector machine classification , 2008, KDD.

[18]  Jeffrey Xu Yu,et al.  Efficient similarity joins for near duplicate detection , 2008, WWW.

[19]  Peter Christen,et al.  Quality and Complexity Measures for Data Linkage and Deduplication , 2007, Quality Measures in Data Mining.

[20]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[21]  Rajeev Motwani,et al.  Robust identification of fuzzy duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).

[22]  Wei-Ying Ma,et al.  Instance-based Schema Matching for Web Databases by Domain-specific Query Probing , 2004, VLDB.

[23]  Jiawei Han,et al.  PEBL: Web page classification without negative examples , 2004, IEEE Transactions on Knowledge and Data Engineering.

[24]  Lifang Gu,et al.  Adaptive Filtering for Efficient Record Linkage , 2004, SDM.

[25]  Raymond J. Mooney,et al.  Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.

[26]  Rajeev Motwani,et al.  Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.

[27]  Peter Christen,et al.  A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.

[28]  Surajit Chaudhuri,et al.  Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.

[29]  Andrew McCallum,et al.  Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.