论文信息 - Favoring the k-Means Algorithm with Initialization Methods

Favoring the k-Means Algorithm with Initialization Methods

Clustering algorithms are non-supervised algorithms and, among the many available, the k-Means can be considered one of the most popular and successful. The performance of the k-Means, however, is highly dependent on a ‘good’ initialization of the k group centers (centroids) as well as of the value assigned to the number (k) of groups the final clustering should have. This chapter addresses experiments using five initialization algorithms available in the literature namely, the Method1, the k-Means++, the CCIA, the Maedeh&Suresh and the SPSS algorithms, to empirically evaluate their contribution to improving k-Means performance.

Maria do Carmo Nicoletti | M. C. Nicoletti | Anderson Francisco de Oliveira | A. D. Oliveira

[1] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .

[2] P. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[3] William M. Rand,et al. Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[4] Aristides Gionis,et al. Clustering aggregation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[5] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.

[6] K. Karteeka Pavan,et al. Robust seed selection algorithm for k-means type algorithms , 2011, ArXiv.

[7] Stuart A. Roberts,et al. New methods for the initialisation of clusters , 1996, Pattern Recognit. Lett..

[8] David J. Hand,et al. The Data Sets , 1994 .

[9] Shehroz S. Khan,et al. Cluster center initialization algorithm for K-means clustering , 2004, Pattern Recognit. Lett..

[10] Jian Pei,et al. 2012- Data Mining. Concepts and Techniques, 3rd Edition.pdf , 2012 .

[11] Dimitrios Gunopulos,et al. A clustering framework based on subjective and objective validity criteria , 2008, TKDD.

[12] Suresh Kumar,et al. Design of Efficient K-Means Clustering Algorithm with Improved Initial Centroids , 2018 .