Supervised Meta-blocking
暂无分享,去创建一个
George Papastefanatos | Georgia Koutrika | George Papadakis | G. Papadakis | G. Koutrika | George Papastefanatos
[1] Q. Mcnemar. Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.
[2] Julie Beth Lovins,et al. Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.
[3] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .
[4] Mike Paterson,et al. A Faster Algorithm Computing String Edit Distances , 1980, J. Comput. Syst. Sci..
[5] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[6] Huan Liu,et al. Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.
[7] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[8] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .
[9] Amit P. Sheth,et al. Semantic Interoperability in Global Information Systems: A Brief Introduction to the Research Area a , 1999 .
[10] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[11] Ian Witten,et al. Data Mining , 2000 .
[12] Luis Gravano,et al. Approximate String Joins in a Database (Almost) for Free , 2001, VLDB.
[13] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.
[14] Anuradha Bhamidipaty,et al. Interactive deduplication using active learning , 2002, KDD.
[15] William W. Cohen,et al. Learning to match and cluster large high-dimensional data sets for data integration , 2002, KDD.
[16] Craig A. Knoblock,et al. Learning domain-independent string transformation weights for high accuracy object identification , 2002, KDD.
[17] Ahmed K. Elmagarmid,et al. TAILOR: a record linkage toolbox , 2002, Proceedings 18th International Conference on Data Engineering.
[18] Peter Christen,et al. A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.
[19] Pradeep Ravikumar,et al. A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.
[20] Raymond J. Mooney,et al. Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.
[21] P. Ivax,et al. A THEORY FOR RECORD LINKAGE , 2004 .
[22] Alon Y. Halevy,et al. Semantic Integration Research in the Database Community : A Brief Survey , 2005 .
[23] Keizo Oyama,et al. A Fast Linkage Detection Scheme for Multi-Source Information Integration , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.
[24] Lise Getoor,et al. Link mining: a survey , 2005, SKDD.
[25] Janyce Wiebe,et al. Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.
[26] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[27] Raymond J. Mooney,et al. Adaptive Blocking: Learning to Scale Up Record Linkage , 2006, Sixth International Conference on Data Mining (ICDM'06).
[28] Craig A. Knoblock,et al. Learning Blocking Schemes for Record Linkage , 2006, AAAI.
[29] David Maier,et al. Principles of dataspace systems , 2006, PODS '06.
[30] Divesh Srivastava,et al. Record linkage: similarity measures and algorithms , 2006, SIGMOD Conference.
[31] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.
[32] Jayant Madhavan,et al. Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.
[33] C. Lee Giles,et al. Adaptive sorted neighborhood methods for efficient record linkage , 2007, JCDL '07.
[34] Josep-Lluís Larriba-Pey,et al. On the Use of Semantic Blocking Techniques for Data Cleansing and Integration , 2007, 11th International Database Engineering and Applications Symposium (IDEAS 2007).
[35] Marie-Francine Moens,et al. A machine learning approach to sentiment analysis in multilingual Web texts , 2009, Information Retrieval.
[36] Wouter van Atteveldt,et al. Good News or Bad News? Conducting Sentiment Analysis on Dutch Text to Distinguish Between Positive and Negative Relations , 2008 .
[37] Hsinchun Chen,et al. Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.
[38] Grzegorz Kondrak,et al. A Comparison of Sentiment Analysis Techniques: Polarizing Movie Blogs , 2008, Canadian Conference on AI.
[39] Georgia Koutrika,et al. Entity resolution with iterative blocking , 2009, SIGMOD Conference.
[40] Sanjay Chawla,et al. Robust record linkage blocking using suffix arrays , 2009, CIKM.
[41] Dmitri V. Kalashnikov,et al. Exploiting context analysis for combining multiple entity resolution systems , 2009, SIGMOD Conference.
[42] Tim Berners-Lee,et al. Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..
[43] Peter Christen,et al. Accurate Synthetic Generation of Realistic Personal Information , 2009, PAKDD.
[44] Felix Naumann,et al. A Comparison and Generalization of Blocking and Windowing Algorithms for Duplicate Detection , 2009 .
[45] Dongwon Lee,et al. HARRA: fast iterative hashed record linkage for large-scale data collections , 2010, EDBT '10.
[46] Patrick Paroubek,et al. Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.
[47] Felix Naumann,et al. An Introduction to Duplicate Detection , 2010, An Introduction to Duplicate Detection.
[48] Junlan Feng,et al. Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.
[49] Oren Etzioni,et al. Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.
[50] Peter Fankhauser,et al. Efficient entity resolution for large heterogeneous information spaces , 2011, WSDM '11.
[51] Brendan T. O'Connor,et al. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.
[52] Claudia Niederée,et al. To compare or not to compare: making entity resolution more efficient , 2011, SWIM '11.
[53] Muhammad Abdul-Mageed,et al. Subjectivity and Sentiment Analysis of Modern Standard Arabic , 2011, ACL.
[54] Ke Xu,et al. MoodLens: an emoticon-based sentiment analysis system for chinese tweets , 2012, KDD.
[55] Peter Christen,et al. A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication , 2012, IEEE Transactions on Knowledge and Data Engineering.
[56] Claudia Niederée,et al. Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data , 2012, WSDM '12.
[57] Ashwin Machanavajjhala,et al. Network sampling , 2013, KDD.
[58] Yongtao Ma,et al. TYPiMatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration , 2013, WSDM.
[59] Saif Mohammad,et al. NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.
[60] Huan Liu,et al. Unsupervised sentiment analysis with emotional signals , 2013, WWW.
[61] Fabrício Benevenuto,et al. Comparing and combining sentiment analysis methods , 2013, COSN '13.
[62] Preslav Nakov,et al. SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.
[63] Avigdor Gal,et al. MFIBlocks: An effective blocking algorithm for entity resolution , 2013, Inf. Syst..
[64] Hector Garcia-Molina,et al. Pay-As-You-Go Entity Resolution , 2013, IEEE Transactions on Knowledge and Data Engineering.
[65] Claudia Niederée,et al. A Blocking Framework for Entity Resolution in Highly Heterogeneous Information Spaces , 2013, IEEE Transactions on Knowledge and Data Engineering.
[66] Nikos Tsirakis,et al. Sentiment Analysis for Reputation Management: Mining the Greek Web , 2014, SETN.
[67] Wolfgang Nejdl,et al. Meta-Blocking: Taking Entity Resolutionto the Next Level , 2014, IEEE Transactions on Knowledge and Data Engineering.
[68] Disha Gupta,et al. A Clustering Approach for Class Imbalance Problem in Data Mining , 2016 .