Improved activation schema on Automatic Clustering using Differential Evolution algorithm

Partitional Clustering is one of the major techniques in Unsupervised Learning in which similar data are put into the same partition. Besides partitioning the unlabeled data, determining the optimal number of partitions is also another main concern in the field of data clustering. Automatic Clustering Differential Evolution (ACDE) is one of the state-of-the-art algorithms that address this concern. In ACDE, the mechanism to determine the optimal number of clusters is by encoding the activation value of each cluster centroid into the chromosome with fixed threshold value. However, it could be argued that a fixed threshold value would be seen as arbitrary, but a varying and adaptive threshold value could yield a solution that would better reflect the quality of clusters. In this paper, a new changing schema of threshold values is introduced for adaptively activating the clusters in the chromosomes, and a heuristic approach is implemented for adjusting the threshold values of each cluster according to their individual quality measurements. The results of several experiments show that the proposed algorithm performed generally better than other state-of-the-art automatic evolutionary clustering algorithms.

[1]  Sandra Paterlini,et al.  Evolutionary Approaches for Cluster Analysis , 2003 .

[2]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[3]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[4]  Yee Leung,et al.  Clustering by Scale-Space Filtering , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[7]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[8]  Ujjwal Maulik,et al.  Genetic clustering for automatic evolution of clusters and application to image classification , 2002, Pattern Recognit..

[9]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[10]  Andries Petrus Engelbrecht,et al.  Dynamic clustering using particle swarm optimization with application in image segmentation , 2006, Pattern Analysis and Applications.

[11]  Dinesh Kumar,et al.  Automatic cluster evolution using gravitational search algorithm and its application on image segmentation , 2014, Eng. Appl. Artif. Intell..

[12]  Michalis Vazirgiannis,et al.  On Clustering Validation Techniques , 2001, Journal of Intelligent Information Systems.

[13]  Sandra Paterlini,et al.  Differential evolution and particle swarm optimisation in partitional clustering , 2006, Comput. Stat. Data Anal..

[14]  M.-C. Su,et al.  A new cluster validity measure and its application to image compression , 2004, Pattern Analysis and Applications.

[15]  Amit Konar,et al.  Automatic kernel clustering with a Multi-Elitist Particle Swarm Optimization Algorithm , 2008, Pattern Recognit. Lett..

[16]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[17]  P. Brucker On the Complexity of Clustering Problems , 1978 .

[18]  Sandra Paterlini,et al.  High performance clustering with differential evolution , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[19]  E. Forgy,et al.  Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[20]  Wilfrido Gómez-Flores,et al.  Automatic clustering using nature-inspired metaheuristics: A survey , 2016, Appl. Soft Comput..

[21]  Marco Mora,et al.  New Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance , 2013, 2013 32nd International Conference of the Chilean Computer Science Society (SCCC).