Determining the k in k-means with MapReduce
暂无分享,去创建一个
Olivier Thonnard | Wim Mees | Thibault Debatty | Pietro Michiardi | W. Mees | P. Michiardi | O. Thonnard | Thibault Debatty | Olivier Thonnard | Pietro Michiardi
[1] R. L. Thorndike. Who belongs in the family? , 1953 .
[2] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .
[3] J. C. Dunn,et al. A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .
[4] J. A. Hartigan,et al. A k-means clustering algorithm , 1979 .
[5] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.
[6] P. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .
[7] Hans-Peter Kriegel,et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.
[8] Andrew W. Moore,et al. Accelerating exact k-means algorithms with geometric reasoning , 1999, KDD '99.
[9] Hans-Peter Kriegel,et al. OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.
[10] Philip S. Yu,et al. Fast algorithms for projected clustering , 1999, SIGMOD '99.
[11] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[12] Robert Tibshirani,et al. Estimating the number of clusters in a data set via the gap statistic , 2000 .
[13] Andrew W. Moore,et al. X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.
[14] Jeremy Buhler,et al. Efficient large-scale sequence comparison by locality-sensitive hashing , 2001, Bioinform..
[15] Michael N. Vrahatis,et al. The New k-Windows Algorithm for Improving the k-Means Clustering Algorithm , 2002, J. Complex..
[16] Catherine A. Sugar,et al. Finding the Number of Clusters in a Dataset , 2003 .
[17] Greg Hamerly,et al. Learning the k in k-means , 2003, NIPS.
[18] Ming Li,et al. Clustering by compression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..
[19] Sariel Har-Peled,et al. On coresets for k-means and k-median clustering , 2004, STOC '04.
[20] Sariel Har-Peled,et al. Coresets for $k$-Means and $k$-Median Clustering and their Applications , 2018, STOC 2004.
[21] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.
[22] Anil K. Jain. Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..
[23] Sergei Vassilvitskii,et al. Scalable K-Means++ , 2012, Proc. VLDB Endow..