Data Clustering with Particle Swarms

This paper presents a new proposal for data clustering based on the particle swarm optimization (PSO) algorithm. The human tendency of adapting its behavior due to the influence of the environment minimizing the differences in opinions and ideas through time and taking into account the past experiences characterizes an emergent social behavior. In the PSO algorithm, each individual in the population searches for a solution taking into account the best individual in a certain neighborhood and its own past best solution as well. In the present work, the PSO algorithm was adapted to position prototypes (particles) in regions of the space that represent natural clusters of the input data set. The proposed method, named particle swarm clustering (PSC) algorithm, was applied in an unsupervised fashion to a number of benchmark classification problems and to one bioinformatics dataset in order to evaluate its performance.

[1]  Russell C. Eberhart,et al.  Gene clustering using self-organizing maps and particle swarm optimization , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[2]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[3]  Roger E Bumgarner,et al.  Clustering gene-expression data with repeated measurements , 2003, Genome Biology.

[4]  Andries Petrus Engelbrecht,et al.  Dynamic clustering using particle swarm optimization with application in image segmentation , 2006, Pattern Analysis and Applications.

[5]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[6]  L. Festinger,et al.  A Theory of Cognitive Dissonance , 2017 .

[7]  Maurice Clerc,et al.  The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..

[8]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[11]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[12]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[13]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[14]  T. K. Baker,et al.  Temporal gene expression analysis of monolayer cultured rat hepatocytes. , 2001, Chemical research in toxicology.