DWCLEANSER: A Framework for Approximate Duplicate Detection
暂无分享,去创建一个
[1] J JebamalarTamilselvi,et al. Detection and elimination of duplicate data using token-based method for a data warehouse: a clustering based approach , 2009 .
[2] Payal Pahwa,et al. An Efficient Algorithm for Data Cleaning , 2011, Int. J. Knowl. Based Organ..
[3] Igor Kononenko,et al. Attribute selection for modelling , 1997, Future Gener. Comput. Syst..
[4] Salvatore J. Stolfo,et al. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.
[5] V. Saravanan,et al. A Unified Framework and Sequential Data Cleaning Approach for a Data Warehouse , 2008 .
[6] Joseph M. Hellerstein,et al. Potter's Wheel: An Interactive Data Cleaning System , 2001, VLDB.
[7] Alvaro E. Monge,et al. Adaptive detection of approximately duplicate database records and the database integration approach to information discovery , 1998 .
[8] Thomas Redman,et al. The impact of poor data quality on the typical enterprise , 1998, CACM.
[9] Ramez Elmasri,et al. Fundamentals of Database Systems , 1989 .
[10] Saied Haidarian Shahri,et al. Eliminating Duplicates in Information Integration: An Adaptive, Extensible Framework , 2006, IEEE Intelligent Systems.