Density base k-Mean's Cluster Centroid Initialization Algorithm

A spatial data mining is a process of extracting valid and useful information out of generated data, which recently becomes a highly demanding field due to the huge amount of data collected everyday across various applications domains which by far exceeded human’s ability to analyses, this brought about the development of many data mining tools among which clustering is recognized to be the efficient data mining method that categorized data based on similarity measures, where k-Means is a well-known clustering algorithm used across different application domains. Similarly, k-Means suffer from multiple limitations with its clustering accuracy fully depend on cluster center positioning. In this paper, a density base k-Means cluster centroid initialization algorithm has been proposed to overcome kMean’s cluster center initialization problem. To prove the accuracy of the proposed algorithm the evaluation test was conducted using two synthetic datasets called Jain and Path base dataset. The clustering accuracy result of the proposed algorithm is compared with that of traditional k-Means algorithm where it proved that the clustering accuracy of the proposed algorithm is better than that of traditional k-Means algorithm.

[1]  Ramandeep Kaur,et al.  A Survey of Clustering Techniques , 2010 .

[2]  Dorothy E. Denning,et al.  An Intrusion-Detection Model , 1987, IEEE Transactions on Software Engineering.

[3]  Wesam M. Ashour,et al.  Efficient Data Clustering Algorithms: Improvements over Kmeans , 2013 .

[4]  Xiaolong Su,et al.  An improved K-Means clustering algorithm , 2010, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[5]  Madhu Yedla,et al.  Enhancing K-means Clustering Algorithm with Improved Initial Center , 2010 .

[6]  K. Mumtaz,et al.  A Novel Density based improved k-means Clustering Algorithm – Dbkmeans , 2010 .

[7]  Fang Yuan,et al.  A new algorithm to get the initial centroids , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[8]  Till Biskup,et al.  Author Profile , 2013 .