A hybrid sequential approach for data clustering using K-Means and particle swarm optimization algorithm

Clustering is a widely used technique of finding interesting patterns residing in the dataset that are not obviously known. The K-Means algorithm is the most commonly used partitioned clustering algorithm because it can be easily implemented and is the most efficient in terms of the execution time. However, due to its sensitiveness to initial partition it can only generate a local optimal solution. Particle Swarm Optimization (PSO) technique offers a globalized search methodology but suffers from slow convergence near optimal solution. In this paper, we present a new Hybrid Sequential clustering approach, which uses PSO in sequence with K-Means algorithm for data clustering. The proposed approach overcomes drawbacks of both algorithms, improves clustering and avoids being trapped in a local optimal solution. Experiments on four kinds of data sets have been conducted. The obtained results are compared with K-Means, PSO, Hybrid, K-Means+Genetic Algorithm and it has been found that the proposed algorithm generates more accurate, robust and better clustering results. International Journal of Engineering, Science and Technology, Vol. 2, No. 6, 2010, pp. 167-176

[1]  Ching-Yi Chen,et al.  Particle swarm optimization algorithm and its application to clustering analysis , 2004, 2012 Proceedings of 17th Conference on Electrical Power Distribution.

[2]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[3]  Dao-Qiang Zhang,et al.  A novel kernelized fuzzy C-means algorithm with application in medical image segmentation , 2004, Artif. Intell. Medicine.

[4]  Saso Dzeroski,et al.  OntoDM: An Ontology of Data Mining , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[5]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Xindong Wu Data mining: artificial intelligence in data analysis , 2004 .

[7]  Sushmita Mitra,et al.  Data Mining , 2003 .

[8]  K. alik An efficient k'-means clustering algorithm , 2008 .

[9]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[10]  Chieh-Yuan Tsai,et al.  Developing a feature weight self-adjustment mechanism for a K-means clustering algorithm , 2008, Comput. Stat. Data Anal..

[11]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[12]  Rajesh Kumar,et al.  Optimal placement of Phasor Measurement Units using Particle swarm Optimization , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[13]  B. Hawkins,et al.  A framework: , 2020, Harmful Interaction between the Living and the Dead in Greek Tragedy.

[14]  V. Saravanan,et al.  A Framework of an Automated Data Mining System Using Autonomous Intelligent Agents , 2008, 2008 International Conference on Computer Science and Information Technology.

[15]  Amit Kumar,et al.  Linear-time approximation schemes for clustering problems in any dimensions , 2010, JACM.

[16]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[17]  Thomas E. Potok,et al.  Document clustering using particle swarm optimization , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[18]  David M. Mount,et al.  A local search approximation algorithm for k-means clustering , 2002, SCG '02.

[19]  Russell C. Eberhart,et al.  Parameter Selection in Particle Swarm Optimization , 1998, Evolutionary Programming.

[20]  Mohammed El-Abd,et al.  Information exchange in multiple cooperating swarms , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[21]  Mauro Birattari,et al.  Swarm Intelligence , 2012, Lecture Notes in Computer Science.