A grammar-based entity representation framework for data cleaning
暂无分享,去创建一个
[1] Jeffrey D. Uuman. Principles of database and knowledge- base systems , 1989 .
[2] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.
[3] Daniel S. Weld,et al. Automatically refining the wikipedia infobox ontology , 2008, WWW.
[4] Silviu Cucerzan,et al. Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.
[5] Jayant Madhavan,et al. Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.
[6] Alfred V. Aho,et al. Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.
[7] Elke A. Rundensteiner. Letter from the Special Issue Editor , 1999, IEEE Data Eng. Bull..
[8] Clifford Stein,et al. Introduction to Algorithms, 2nd edition. , 2001 .
[9] David Maier,et al. Dataspaces: A New Abstraction for Information Management , 2006, DASFAA.
[10] Rajeev Motwani,et al. Robust and efficient fuzzy match for online data cleaning , 2003, SIGMOD '03.
[11] Surajit Chaudhuri,et al. An efficient filter for approximate membership checking , 2008, SIGMOD Conference.
[12] William W. Cohen. Data integration using similarity joins and a word-based information representation language , 2000, TOIS.
[13] Jeffrey D. Ullman,et al. Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.
[14] William W. Cohen,et al. Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods , 2004, KDD.
[15] Divesh Srivastava,et al. Benchmarking declarative approximate selection predicates , 2007, SIGMOD '07.
[16] Paul A. Viola,et al. Learning to extract information from semi-structured text using a discriminative context free grammar , 2005, SIGIR '05.
[17] Divesh Srivastava,et al. Record linkage: similarity measures and algorithms , 2006, SIGMOD Conference.
[18] William W. Cohen,et al. Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.
[19] D. Nunan. From the special issue editor , 2005 .
[20] Jayant Madhavan,et al. Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.
[21] Surajit Chaudhuri,et al. Transformation-based Framework for Record Matching , 2008, 2008 IEEE 24th International Conference on Data Engineering.
[22] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[23] Sunita Sarawagi,et al. Automatic segmentation of text into structured records , 2001, SIGMOD '01.
[24] Gerhard Weikum,et al. YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..