Identifying Value Mappings for Data Integration: An Unsupervised Approach
暂无分享,去创建一个
[1] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[2] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[3] Dennis Shasha,et al. An extensible Framework for Data Cleaning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).
[4] William E. Winkler,et al. The State of Record Linkage and Current Research Problems , 1999 .
[5] Erhard Rahm,et al. Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.
[6] Laura M. Haas,et al. Clio: a semi-automatic tool for schema mapping , 2001, SIGMOD '01.
[7] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[8] Renée J. Miller,et al. Information-theoretic tools for mining database structure from large data sets , 2004, SIGMOD '04.
[9] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[10] AnHai Doan,et al. iMAP: Discovering Complex Mappings between Database Schemas. , 2004, SIGMOD 2004.
[11] Erhard Rahm,et al. On Matching Schemas Automatically , 2001 .
[12] Erhard Rahm,et al. A survey of approaches to automatic schema matching , 2001, The VLDB Journal.
[13] William W. Cohen. Integration of heterogeneous databases without common domains using queries based on textual similarity , 1998, SIGMOD '98.
[14] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .
[15] Pedro M. Domingos,et al. iMAP: discovering complex semantic matches between database schemas , 2004, SIGMOD '04.
[16] Anuradha Bhamidipaty,et al. Interactive deduplication using active learning , 2002, KDD.
[17] Tova Milo,et al. Using Schema Matching to Simplify Heterogeneous Data Translation , 1998, VLDB.
[18] Alvaro E. Monge,et al. Adaptive detection of approximately duplicate database records and the database integration approach to information discovery , 1998 .
[19] Surajit Chaudhuri,et al. Eliminating Fuzzy Duplicates in Data Warehouses , 2002, VLDB.
[20] H. Kuhn. The Hungarian method for the assignment problem , 1955 .
[21] Ivan P. Fellegi,et al. A Theory for Record Linkage , 1969 .
[22] Gene H. Golub,et al. Matrix computations , 1983 .
[23] Chris Clifton,et al. SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks , 2000, Data Knowl. Eng..
[24] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.
[25] Stuart J. Russell,et al. Identity Uncertainty and Citation Matching , 2002, NIPS.
[26] Jeffrey F. Naughton,et al. On schema matching with opaque column names and data values , 2003, SIGMOD '03.
[27] Pedro M. Domingos,et al. Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.
[28] Luis Gravano,et al. Text joins for data cleansing and integration in an RDBMS , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).
[29] Erhard Rahm,et al. Generic Schema Matching with Cupid , 2001, VLDB.
[30] Erhard Rahm,et al. Comparison of Schema Matching Evaluations , 2002, Web, Web-Services, and Database Systems.
[31] Dongwon Lee,et al. Establishing value mappings using statistical models and user feedback , 2005, CIKM '05.