Text categorization and similarity analysis: similarity measure, architecture and design
暂无分享,去创建一个
[1] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.
[2] Sadhan Sood,et al. Probabilistic Simhash Matching , 2012 .
[3] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.
[4] Johannes Knopp. Classification of Named Entities in a large multilingual resource using the Wikipedia category system , 2010 .
[5] Ian H. Witten,et al. Clustering Documents with Active Learning Using Wikipedia , 2008, 2008 Eighth IEEE International Conference on Data Mining.
[6] Ian H. Witten,et al. Clustering Documents Using a Wikipedia-Based Concept Representation , 2009, PAKDD.
[7] Luis Gravano,et al. dSCAM: finding document copies across multiple databases , 1996, Fourth International Conference on Parallel and Distributed Information Systems.