Data clustering using K-Means based on Crow Search Algorithm

Cluster analysis is one of the popular data mining techniques and it is defined as the process of grouping similar data. K-Means is one of the clustering algorithms to cluster the numerical data. The features of K-Means clustering algorithm are easy to implement and it is efficient to handle large amounts of data. The major problem with K-Means is the selection of initial centroids. It selects the initial centroids randomly and it leads to a local optimum solution. Recently, nature-inspired optimization algorithms are combined with clustering algorithms to obtain the global optimum solution. Crow Search Algorithm (CSA) is a new population-based metaheuristic optimization algorithm. This algorithm is based on the intelligent behaviour of the crows. In this paper, CSA is combined with the K-Means clustering algorithm to obtain the global optimum solution. Experiments are conducted on benchmark datasets and the results are compared to those from various clustering algorithms and optimization-based clustering algorithms. Also the results are evaluated with internal, external and statistical experiments to prove the efficiency of the proposed algorithm.

[1]  Nigel Webb,et al.  Introduction to computational mathematics , 1987, The Mathematical Gazette.

[2]  Yi-Zeng Liang,et al.  Cluster analysis by the K-means algorithm and simulated annealing , 1994 .

[3]  Amir Hossein Alavi,et al.  Krill herd: A new bio-inspired optimization algorithm , 2012 .

[4]  Khaled S. Al-Sultan,et al.  A Tabu search approach to the clustering problem , 1995, Pattern Recognit..

[5]  Simon Fong,et al.  Integrating nature-inspired optimization algorithms to K-means clustering , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[6]  Giuliano Armano,et al.  Clustering Analysis with Combination of Artificial Bee Colony Algorithm and k-Means Technique , 2014 .

[7]  Xin-She Yang,et al.  Firefly Algorithm, Lévy Flights and Global Optimization , 2010, SGAI Conf..

[8]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[9]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[10]  Alireza Askarzadeh,et al.  A novel metaheuristic method for solving constrained engineering optimization problems: Crow search algorithm , 2016 .

[11]  Salwani Abdullah,et al.  A combined approach for clustering based on K-means and gravitational search algorithms , 2012, Swarm Evol. Comput..

[12]  Frans van den Bergh,et al.  An analysis of particle swarm optimizers , 2002 .

[13]  Xin-She Yang Introduction to Computational Mathematics , 2008 .

[14]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[15]  Jue Lu,et al.  A new hybrid clustering algorithm based on K-means and ant colony algorithm , 2013 .

[16]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[17]  Shokri Z. Selim,et al.  A simulated annealing algorithm for the clustering problem , 1991, Pattern Recognit..

[18]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[19]  Dervis Karaboga,et al.  A novel clustering approach: Artificial Bee Colony (ABC) algorithm , 2011, Appl. Soft Comput..

[20]  Xin-She Yang,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010, NICSO.

[21]  S. Brooks,et al.  Optimization Using Simulated Annealing , 1995 .

[22]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[23]  Xin-She Yang,et al.  Cuckoo search: recent advances and applications , 2013, Neural Computing and Applications.

[24]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[25]  Mohammad Reza Meybodi,et al.  A new hybrid approach for data clustering using firefly algorithm and K-means , 2012, The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012).

[26]  Simon Fong,et al.  Wolf search algorithm with ephemeral memory , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[27]  Manuel Laguna,et al.  Tabu Search , 1997 .

[28]  Dantong Ouyang,et al.  An artificial bee colony approach for clustering , 2010, Expert Syst. Appl..

[29]  Charles E. Taylor Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. Complex Adaptive Systems.John H. Holland , 1994 .

[30]  Budi Santosa,et al.  Cat Swarm Optimization for Clustering , 2009, 2009 International Conference of Soft Computing and Pattern Recognition.

[31]  H. Modares,et al.  Combining PSO and k-means to enhance data clustering , 2008, 2008 International Symposium on Telecommunications.

[32]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[33]  B. Kulkarni,et al.  An ant colony approach for clustering , 2004 .

[34]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[35]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[36]  Ching-Yi Chen,et al.  Particle swarm optimization algorithm and its application to clustering analysis , 2004, 2012 Proceedings of 17th Conference on Electrical Power Distribution.

[37]  Pei-wei Tsai,et al.  Cat Swarm Optimization , 2006, PRICAI.

[38]  Yan Liu,et al.  A Hybrid Tabu Search Based Clustering Algorithm , 2005, KES.

[39]  Xin-She Yang,et al.  Cuckoo Search via Lévy flights , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[40]  Dervis Karaboga,et al.  Artificial Bee Colony (ABC) Optimization Algorithm for Solving Constrained Optimization Problems , 2007, IFSA.

[41]  Edgar N. Reyes,et al.  Optimization using simulated annealing , 1998, Northcon/98. Conference Proceedings (Cat. No.98CH36264).