论文信息 - An Iterative Improved k-means Clustering

An Iterative Improved k-means Clustering

Clustering is a data mining (machine learning), unsupervised learning technique used to place data elements into related groups without advance knowledge of the group definitions. One of the most popular and widely studied clustering methods that minimize the clustering error for points in Euclidean space is called K-means clustering. However, the k-means method converges to one of many local minima, and it is known that the final results depend on the initial starting points (means). In this research paper, we have introduced and tested an improved algorithm to start the k- means with good starting points (means). The good initial starting points allow k-means to converge to a better local minimum; also the numbers of iteration over the full dataset are being decreased. Experimental results show that initial starting points lead to good solution reducing the number of iterations to form a cluster.

Navi Mumbai | Nareshkumar D. Harale | Madhuri A. Dalal | Umesh L. Kulkarni

[1] atherine,et al. Finding the number of clusters in a data set : An information theoretic approach C , 2003 .

[2] Jiancheng Luo,et al. A modified clustering algorithm for data mining , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[3] Paul S. Bradley,et al. Refining Initial Points for K-Means Clustering , 1998, ICML.

[4] Siddheswar Ray,et al. Determination of Number of Clusters in K-Means Clustering and Application in Colour Image Segmentation , 2000 .

[5] Joaquín Pérez Ortega,et al. Research issues on K-means Algorithm : An Experimental Trial Using Matlab , 2009 .

[6] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.

[7] Wei Li. Modified K-Means Clustering Algorithm , 2008, 2008 Congress on Image and Signal Processing.

[8] Chien-Hsing Chou,et al. Short Papers , 2001 .

[9] Pedro Larrañaga,et al. An empirical comparison of four initialization methods for the K-Means algorithm , 1999, Pattern Recognit. Lett..

[10] Shehroz S. Khan,et al. Cluster center initialization algorithm for K-means clustering , 2004, Pattern Recognit. Lett..

[11] D. Pham,et al. Selection of K in K-means clustering , 2005 .

[12] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .

[13] Abdel-Badeeh M. Salem,et al. An efficient enhanced k-means clustering algorithm , 2006 .