Clustering using Cellular Genetic Algorithms

The goal of the clustering process is to find groups of similar patterns in multidimensional data. In this work, the clustering problem is approached using cellular genetic algorithms. The population structure adopted in the cellular genetic algorithm contributes to the population genetic diversity preventing the premature convergence to local optima. The performance of the proposed algorithm is evaluated on 13 test databases. An extension to the basic algorithm was also investigated to handle instances containing non-linearly separable data. The algorithm is compared with nine non-evolutionary classification techniques from the literature, and also compared with three nature inspired methodologies, namely Particle Swarm Optimization, Artificial Bee Colony, and the Firefly Algorithm. The cellular genetic algorithm attains the best result on a test database. A statistical ranking of the compared methods was made, and the proposed algorithm is ranked fifth overall.

[1]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[2]  Emanuel Falkenauer,et al.  Genetic Algorithms and Grouping Problems , 1998 .

[3]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[4]  Boris Mirkin,et al.  Mathematical Classification and Clustering , 1996 .

[5]  Clustering Using Improved Cuckoo Search Algorithm , 2014, ICSI.

[6]  Shokri Z. Selim,et al.  K-Means-Type Algorithms: A Generalized Convergence Theorem and Characterization of Local Optimality , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Taher Niknam,et al.  An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis , 2010, Appl. Soft Comput..

[8]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[9]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[10]  Fei Qiao,et al.  A novel memetic algorithm and its application to data clustering , 2013, Memetic Comput..

[11]  Enrique Alba,et al.  Cellular genetic algorithms , 2014, GECCO.

[12]  Ana L. N. Fred,et al.  Probabilistic consensus clustering using evidence accumulation , 2013, Machine Learning.

[13]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[14]  V. Mani,et al.  Clustering using firefly algorithm: Performance study , 2011, Swarm Evol. Comput..

[15]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[16]  Ivanoe De Falco,et al.  Facing classification problems with Particle Swarm Optimization , 2007, Appl. Soft Comput..

[17]  El-Ghazali Talbi,et al.  ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics , 2004, J. Heuristics.

[18]  Dervis Karaboga,et al.  A novel clustering approach: Artificial Bee Colony (ABC) algorithm , 2011, Appl. Soft Comput..

[19]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[20]  Magdalene Marinaki,et al.  A hybrid stochastic genetic–GRASP algorithm for clustering analysis , 2008, Oper. Res..