An Improved Initialization Method using Firefly Movement and Light Intensity for Better Clustering Performance

K-Means and EM algorithms are the most well-known clustering algorithms because they are simple, easy to understand and implement. However, both algorithms are sensitive to initial seeds which are randomly selected leading to slow convergence and less reliable clustering results. In this paper, an improved initialization method adopted the concept of light intensity and firefly movement to search for better initial seeds. Numerical experiments were conducted to evaluate the performance of the Enhanced K-Means and EM using faculty performance evaluation ratings as the dataset. The experiments showed that the implementation of the improved initialization method before the clustering process resulted in a higher intra-cluster and lower inter-cluster similarity. Also, there are fifty-nine percent (59%) and sixty-three percent (63%) decrease in the runtime execution while there are forty-four percent (44%) and twenty-seven percent (27%) fewer number of iterations recorded in the implementation of the enhanced KMeans and EM algorithms respectively.

[1]  Iztok Fister,et al.  A comprehensive review of firefly algorithms , 2013, Swarm Evol. Comput..

[2]  Xin-She Yang,et al.  Firefly Algorithms for Multimodal Optimization , 2009, SAGA.

[3]  K. Mumtaz,et al.  A Novel Density based improved k-means Clustering Algorithm – Dbkmeans , 2010 .

[4]  Qi Li,et al.  Fast k-means algorithm clustering , 2011, ArXiv.

[5]  Ming-Huwi Horng,et al.  Multilevel Image Thresholding Selection Based on the Firefly Algorithm , 2010, 2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing.

[6]  Adigun Abimbola Adebisi,et al.  An Exploratory Study of K-Means and Expectation Maximization Algorithms , 2012 .

[7]  Zhengyu Hu,et al.  Initializing the EM Algorithm for Data Clustering and Sub-population Detection , 2015 .

[8]  S Sundararajan,et al.  AN EFFICIENT HYBRID APPROACH FOR DATA CLUSTERING USING DYNAMIC K-MEANS ALGORITHM AND FIREFLY ALGORITHM , 2014 .

[9]  Jose M. J. Costa,et al.  An Empirical Comparison of EM Initialization Methods and Model Choice Criteria for Mixtures of Skew-Normal Distributions , 2012 .

[10]  Bobby D. Gerardo,et al.  Enhanced Initial Centroids for K-means Algorithm , 2017 .

[11]  Johannes Blömer,et al.  Simple Methods for Initializing the EM Algorithm for Gaussian Mixture Models , 2013, ArXiv.

[12]  Divakar Singh,et al.  The Improved K-Means with Particle Swarm Optimization , 2013 .

[13]  Pakanati Chenna Reddy,et al.  Clustering large datasets using K-means modified inter and intra clustering (KM-I2C) in Hadoop , 2017, Journal of Big Data.

[14]  Kanwal Garg,et al.  IMPROVED EXPECTATION MAXIMIZATION CLUSTERING ALGORITHM , 2014 .

[15]  Husniza Husni,et al.  Weight-Based Firefly Algorithm for Document Clustering , 2013, DaEng.

[16]  V. Mani,et al.  Clustering using firefly algorithm: Performance study , 2011, Swarm Evol. Comput..

[17]  A DISTANCE BASED CLUSTERING ALGORITHM , 2014 .

[18]  Mohammad Reza Meybodi,et al.  A new hybrid approach for data clustering using firefly algorithm and K-means , 2012, The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012).

[19]  Chong-Wah Ngo,et al.  Boost K-Means , 2016, ArXiv.

[20]  Ajay Rana,et al.  K-means with Three different Distance Metrics , 2013 .

[21]  Wesam M. Ashour,et al.  Initializing K-Means Clustering Algorithm using Statistical Information , 2011 .

[22]  Claudomiro Sales,et al.  Genetic‐based EM algorithm to improve the robustness of Gaussian mixture models for damage detection in bridges , 2017 .

[23]  Anil Kumar Gupta,et al.  A Comparative study Between Fuzzy Clustering Algorithm and Hard Clustering Algorithm , 2014, ArXiv.

[24]  Smriti Sehgal,et al.  Image segmentation using firefly algorithm , 2016, 2016 International Conference on Information Technology (InCITe) - The Next Generation IT Summit on the Theme - Internet of Things: Connect your Worlds.

[25]  Mohammad Reza Meybodi,et al.  A Gaussian Firefly Algorithm , 2011 .

[26]  Can Yang,et al.  On the Convergence of the EM Algorithm: A Data-Adaptive Analysis , 2016, 1611.00519.

[27]  Kamaljit Kaur,et al.  Review of Existing Methods for Finding Initial Clusters in K-means Algorithm , 2013 .