Limbo: A scalable algorithm to cluster categorical data
暂无分享,去创建一个
Panayiotis Tsaparas | Kenneth C. Sevcik | Periklis Andritsos | Renée J. Miller | K. Sevcik | P. Andritsos | Panayiotis Tsaparas
[1] Jon M. Kleinberg,et al. Clustering categorical data: an approach based on dynamical systems , 2000, The VLDB Journal.
[2] Mark A. Gluck,et al. Information, Uncertainty and the Utility of Categories , 1985 .
[3] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .
[4] Yi Li,et al. COOLCAT: an entropy-based algorithm for categorical clustering , 2002, CIKM '02.
[5] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.
[6] Sudipto Guha,et al. ROCK: a robust clustering algorithm for categorical attributes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).
[7] Tian Zhang,et al. BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.
[8] Yao Wang,et al. A robust and scalable clustering algorithm for mixed type attributes in large database environment , 2001, KDD '01.
[9] Naftali Tishby,et al. Agglomerative Information Bottleneck , 1999, NIPS.
[10] Heikki Mannila,et al. Context-Based Similarity Measures for Categorical Databases , 2000, PKDD.
[11] Naftali Tishby,et al. Unsupervised document classification using sequential information maximization , 2002, SIGIR '02.
[12] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[13] Allan Borodin,et al. Finding authorities and hubs from link structures on the World Wide Web , 2001, WWW '01.
[14] Naftali Tishby,et al. Document clustering using word clusters via the information bottleneck method , 2000, SIGIR '00.
[15] Johannes Gehrke,et al. CACTUS—clustering categorical data using summaries , 1999, KDD '99.