COMPARISON OF PARTITION BASED CLUSTERING ALGORITHMS

Data mining refers to extracting or “mining” knowledge from large amounts of data. Clustering is one of the most important research areas in the field of data mining. Clustering means creating groups of objects based on their features in such a way that the objects belonging to the same groups are similar and those belonging in different groups are dissimilar. In this paper, the most representative partition based clustering algorithms are described and categorized based on their basic approach. The best algorithm is found out based on their performance. Two of the clustering algorithms, namely, Centroid based k-means, Representative object based k-medoids are implemented by using JAVA and their performance is analyzed based on their clustering quality. The randomly distributed data points are taken as input to these algorithms and clusters are found out for each algorithm. The algorithm’s performance is analyzed by different runs on the input data points. The experimental results are given as both graphical as well as tabular representation.