FPA clust: evaluation of the flower pollination algorithm for data clustering

In this work, a standalone approach based on the flower pollination algorithm (FPA) is proposed for solving data clustering problems. The FPA is a nature-inspired algorithm simulating the behavior of flower pollination. The proposed approach is used to extract key information in terms of optimal cluster centers that are derived from training samples of the selected databases. These extracted cluster centers are then validated on test samples. Three datasets from the UCI machine learning data repository and an additional multi-spectral, real-time satellite image are chosen to illustrate the effectiveness and diversity of the proposed technique. The FPA performance is compared with the k-means, a popular clustering algorithm and metaheuristic algorithms, namely, the Genetic Algorithm, Particle Swarm Optimization, Cuckoo Search, Spider Monkey Optimization, Grey Wolf Optimization, Differential Evolution, Harmony Search and Bat Algorithm. The results are evaluated based on classification error percentage (CEP), time complexity and statistical significance. FPA has the lowest CEP for all four datasets and an average CEP of 28%, which is 5.5% lower than next best algorithm in that sense. The FPA is the second quickest algorithm to converge after HS algorithm. FPA also shows a higher level of statistical significance. Therefore, the obtained results show that the FPA efficiently clusters the data and performs better than the state-of-the-art methods.

[1]  Himanshu Mittal,et al.  An image segmentation method using logarithmic kbest gravitational search algorithm based superpixel clustering , 2018, Evolutionary Intelligence.

[2]  Seyed Jalaleddin Mousavirad,et al.  Multilevel image thresholding using entropy of histogram and recently developed population-based metaheuristic algorithms , 2017, Evol. Intell..

[3]  V. Mani,et al.  Clustering using firefly algorithm: Performance study , 2011, Swarm Evol. Comput..

[4]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[5]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[6]  B. Kulkarni,et al.  An ant colony approach for clustering , 2004 .

[7]  Harish Sharma,et al.  Hybrid Artificial Bee Colony algorithm with Differential Evolution , 2017, Appl. Soft Comput..

[8]  Erwie Zahara,et al.  A hybridized approach to data clustering , 2008, Expert Syst. Appl..

[9]  J. Senthilnath,et al.  BELMKN: Bayesian Extreme Learning Machines Kohonen Network , 2018, Algorithms.

[10]  Jon Atli Benediktsson,et al.  A Novel Approach for Multispectral Satellite Image Classification Based on the Bat Algorithm , 2016, IEEE Geoscience and Remote Sensing Letters.

[11]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[12]  Narasimhan Sundararajan,et al.  A sequential multi-category classifier using radial basis function networks , 2008, Neurocomputing.

[13]  Pierre Hansen,et al.  Cluster analysis and mathematical programming , 1997, Math. Program..

[14]  V. Mani,et al.  Clustering Using Levy Flight Cuckoo Search , 2012, BIC-TA.

[15]  Piotr A. Kowalski,et al.  Clustering using flower pollination algorithm and Calinski-Harabasz index , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[16]  Ehl Emile Aarts,et al.  Simulated annealing and Boltzmann machines , 2003 .

[17]  Rozaida Ghazali,et al.  Hybrid of firefly algorithm and pattern search for solving optimization problems , 2018, Evol. Intell..

[18]  Xin-She Yang,et al.  Flower Pollination Algorithm for Global Optimization , 2012, UCNC.

[19]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[20]  Harish Sharma,et al.  Artificial bee colony algorithm with global and local neighborhoods , 2014, International Journal of System Assurance Engineering and Management.

[21]  Harish Sharma,et al.  Spider Monkey Optimization algorithm for numerical optimization , 2014, Memetic Computing.

[22]  Magdalene Marinaki,et al.  A hybrid stochastic genetic–GRASP algorithm for clustering analysis , 2008, Oper. Res..

[23]  Shashi Shekhar,et al.  Clustering and Information Retrieval , 2011, Network Theory and Applications.

[24]  K. Krishna,et al.  Genetic K-means algorithm. , 1999, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society.

[25]  D. Karaboga,et al.  A Simple and Global Optimization Algorithm for Engineering Problems: Differential Evolution Algorithm , 2004 .

[26]  Rui Wang,et al.  Flower Pollination Algorithm with Bee Pollinator for cluster analysis , 2016, Inf. Process. Lett..

[27]  Ujjwal Maulik,et al.  An evolutionary technique based on K-Means algorithm for optimal clustering in RN , 2002, Inf. Sci..

[28]  Taher Niknam,et al.  An Efficient Hybrid Evolutionary Algorithm for Cluster Analysis , 2008 .

[29]  Ivanoe De Falco,et al.  Evaluation of Particle Swarm Optimization Effectiveness in Classification , 2005, WILF.

[30]  V. Mani,et al.  A novel harmony search-based approach for clustering problems , 2016 .

[31]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[32]  Xin-She Yang,et al.  Flower pollination algorithm: A novel approach for multiobjective optimization , 2014, ArXiv.

[33]  Long Quan,et al.  A novel data clustering algorithm based on modified gravitational search algorithm , 2017, Eng. Appl. Artif. Intell..

[34]  Xin-She Yang,et al.  Swarm intelligence based algorithms: a critical analysis , 2013, Evolutionary Intelligence.

[35]  Shikha Mehta,et al.  Enhanced flower pollination algorithm on data clustering , 2016 .