论文信息 - An Optimized repartitioned K-means Cluster algorithm using MapReduce Techniques for Big Data analysis-IJAERD

An Optimized repartitioned K-means Cluster algorithm using MapReduce Techniques for Big Data analysis-IJAERD

k-means is one of the simplest unsupervised learning algorithms that solve the well known clustering problem. The procedure follows a simple and easy way to classify a given data set through a certain number of clusters fixed apriori. The main idea is to define k centers, one for each cluster. These centers should be placed in a cunning way because of different location causes different result. In this research work, Proposed algorithm will perform better while handling clusters of circularly distributed data points and slightly overlapped clusters.

T. Mohana Priya | Dr. A. Saradha | T. Priya

[1] E. Forgy,et al. Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[2] André Hardy,et al. An examination of procedures for determining the number of clusters in a data set , 1994 .

[3] Man Ieee Systems,et al. IEEE transactions on systems, man and cybernetics. Part B, Cybernetics , 1996 .

[4] G H Ball,et al. A clustering technique for summarizing multivariate data. , 1967, Behavioral science.

[5] Warren S. Sarle,et al. Cubic Clustering Criterion , 1983 .

[6] Paul S. Bradley,et al. Refining Initial Points for K-Means Clustering , 1998, ICML.

[7] Pedro Larrañaga,et al. An empirical comparison of four initialization methods for the K-Means algorithm , 1999, Pattern Recognit. Lett..

[8] Nikos A. Vlassis,et al. The global k-means clustering algorithm , 2003, Pattern Recognit..