Applying the Coral Reefs Optimization algorithm to clustering problems

Several clustering algorithms have been developed and applied to a great variety of problems in different fields. However, some of these algorithms have limitations. Bio-inspired algorithms have been applied to clustering problems aiming to overcome some of these limitations. In this paper, we apply the Coral Reefs Optimization (CRO) algorithm to clustering problems. The CRO algorithm has been originally proposed for classical optimization problems. In this paper, this algorithm will be adjusted to provide a good clustering partition for a dataset. In addition, we also propose three new modifications of this algorithm and an index to be used as objective function for the optimization techniques. In order to evaluate the effectiveness of the CRO algorithm and the proposed extensions when dealing with real data, we conduct a comparison analysis with another bio-inspired algorithm, a hybrid genetic algorithm proposed for solving clustering problems. In this analysis, two clustering validity measures are employed to measure the generated clusters by the bio-inspired algorithms. We also use two objective functions (TWCV and MX index) in the reproduction process of the analysed algorithms.

[1]  Taher Niknam,et al.  Application of a New Hybrid Optimization Algorithm on Cluster Analysis , 2008 .

[2]  Fushuan Wen,et al.  A modified harmony search based method for optimal rural radial line planning , 2010 .

[3]  M. Cugmas,et al.  On comparing partitions , 2015 .

[4]  Khaled S. Al-Sultan,et al.  A Tabu search approach to the clustering problem , 1995, Pattern Recognit..

[5]  Antonio J. Nebro,et al.  jMetal: A Java framework for multi-objective optimization , 2011, Adv. Eng. Softw..

[6]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[8]  Sanjay Jasola,et al.  A hybrid sequential approach for data clustering using K-Means and particle swarm optimization algorithm , 2011 .

[9]  Yan Liu,et al.  A Hybrid Tabu Search Based Clustering Algorithm , 2005, KES.

[10]  Jun Du,et al.  Alternative Clustering by Utilizing Multi-objective Genetic Algorithm with Linked-List Based Chromosome Encoding , 2005, MLDM.

[11]  J. A. Portilla-Figueras,et al.  The Coral Reefs Optimization Algorithm: A Novel Metaheuristic for Efficiently Solving Optimization Problems , 2014, TheScientificWorldJournal.

[12]  Georgios C. Anagnostopoulos,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2003, Lecture Notes in Computer Science.

[13]  Kevin Cheng,et al.  An ACO-Based Clustering Algorithm , 2006, ANTS Workshop.

[14]  Joydeep Ghosh,et al.  Data Clustering Algorithms And Applications , 2013 .

[15]  Mukesh M. Raghuwanshi,et al.  Genetic Algorithm Based Clustering: A Survey , 2008, 2008 First International Conference on Emerging Trends in Engineering and Technology.

[16]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[17]  O. A. Mohamed Jafar,et al.  A Study of Bio-inspired Algorithm to Data Clustering using Different Distance Measures , 2013 .

[18]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[20]  Ujjwal Maulik,et al.  An evolutionary technique based on K-Means algorithm for optimal clustering in RN , 2002, Inf. Sci..

[21]  Javier Del Ser,et al.  A new grouping genetic algorithm for clustering problems , 2012, Expert Syst. Appl..

[22]  G. W. Milligan,et al.  A Study of the Comparability of External Criteria for Hierarchical Cluster Analysis. , 1986, Multivariate behavioral research.