Random Centroid Selection for K-means Clustering: A Proposed Algorithm for Improving Clustering Results

Primary motivation towards this study was to obtain better clustering results from K-means clustering algorithm. Several studies had provided relevant findings showing normal clustering algorithm and scope of improvements regarding clustering accuracy. With the aim to increase clustering accuracy, a conceptual notion of Genetic Algorithm is utilized in K-means clustering algorithm. Depending on the Genetic algorithm concepts, an improved clustering technique is proposed in this study for obtaining more accurate and more precise clustering outcomes. The contribution from this study could be essential in terms of topic-modelled data and clustering text documents.

[1]  Xuan Li,et al.  An Improved K-means Text Clustering Algorithm by Optimizing Initial Cluster Centers , 2016, 2016 7th International Conference on Cloud Computing and Big Data (CCBD).

[2]  M RajeshKumar,et al.  Sentiment analysis on speaker specific speech data , 2017, 2017 International Conference on Intelligent Computing and Control (I2C2).

[3]  Manjusha Pandey,et al.  AKM—Augmentation of K-Means Clustering Algorithm for Big Data , 2018 .

[4]  Juanjuan He,et al.  Applying K-means Clustering and Genetic Algorithm for Solving MTSP , 2016, BIC-TA.

[5]  Anindya Das,et al.  An Empirical Evaluation of k-Means Clustering Technique and Comparison , 2019, 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon).

[6]  Clara Pizzuti,et al.  A K-means Based Genetic Algorithm for Data Clustering , 2016, SOCO-CISIS-ICEUTE.

[7]  Mohd Dilshad Ansari,et al.  On K-means data clustering algorithm with genetic algorithm , 2016, 2016 Fourth International Conference on Parallel, Distributed and Grid Computing (PDGC).

[8]  Shilpi Sharma,et al.  Novel technique for prediction analysis using normalization for an improvement in K-means clustering , 2016, 2016 International Conference on Information Technology (InCITe) - The Next Generation IT Summit on the Theme - Internet of Things: Connect your Worlds.