BHOHS: A Two Stage Novel Algorithm for Data Clustering

Data clustering is a collection of data objects that are similar to one another within in the same cluster and are dissimilar to the objects in other clusters. In this paper we proposed new algorithm called black hole optimization with a heuristic search algorithm to overcome the problem of the BHO algorithm. The black hole optimization is used to produce an initial solution to the clustering problem and then a heuristic search algorithm is applied to improve the quality of this solution by searching around it. The black hole optimization algorithm starts with an initial population of candidate solutions to an optimization problem and for each star, evaluate the objective function. Then at each iteration of the black hole optimization algorithm, the best star is selected to be the black hole and all other candidate forms the normal stars. If a star moves too close to the black hole, it will be absorbed by the black hole and permanently disappear. The black hole optimization algorithm converges rapidly during the starting stage of the search process, but it goes near global optimal solution, the convergence speed also will decrease. So, it may get trapped in local optimal over a certain number of iterations. To evaluate the performance of BHOHS can be compared with k-means, particle swarm optimization (PSO) algorithm and black hole optimization algorithm (BHO).

[1]  E.P.J. van den Heuvel,et al.  Black holes in binaries and galactic nuclei , 2001 .

[2]  Clifford A. Pickover Black Holes: A Traveler's Guide , 1996 .

[3]  Bijaya K. Panigrahi,et al.  Multi-objective optimization with artificial weed colonies , 2011, Inf. Sci..

[4]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[5]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[6]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[7]  Xin-She Yang,et al.  Nature-Inspired Metaheuristic Algorithms , 2008 .

[8]  Salwani Abdullah,et al.  Application of Gravitational Search Algorithm on Data Clustering , 2011, RSKT.

[9]  Riccardo Giacconi,et al.  Black holes in binaries and galactic nuclei, diagnostics, demography, and formation : proceedings of the ESO workshop held at Garching, Germany, 6-8 September 1999, in honour of Riccardo Giacconi , 2001 .

[10]  Ying Tan,et al.  Random black hole particle swarm optimization and its application , 2008, 2008 International Conference on Neural Networks and Signal Processing.

[11]  Zülal Güngör,et al.  K-harmonic means data clustering with simulated annealing heuristic , 2007, Appl. Math. Comput..

[12]  Salwani Abdullah,et al.  Data Clustering Using Big Bang–Big Crunch Algorithm , 2011 .

[13]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[14]  Abdolreza Hatamlou,et al.  Black hole: A new heuristic optimization approach for data clustering , 2013, Inf. Sci..

[15]  Kyoung-jae Kim,et al.  A recommender system using GA K-means clustering in an online shopping market , 2008, Expert Syst. Appl..

[16]  Richard H. Price,et al.  Black Holes , 1997 .

[17]  Salwani Abdullah,et al.  A combined approach for clustering based on K-means and gravitational search algorithms , 2012, Swarm Evol. Comput..