Multiobjective clustering analysis using particle swarm optimization

A multiobjective clustering method based on particle swarm optimization is proposed.Two objective functions used to measure cohesion and connectivity of clusters.Able to adaptively find the optimal number of clusters.Tested on 27 benchmark datasets in terms of accuracy and robustness.The system outperformed four state-of-the-art clustering algorithms in most cases. Clustering is a significant data mining task which partitions datasets based on similarities among data. This technique plays a very important role in the rapidly growing field known as exploratory data analysis. A key difficulty of effective clustering is to define proper grouping criteria that reflect fundamentally different aspects of a good clustering solution such as compactness and separation of clusters. Moreover, in the conventional clustering algorithms only a single criterion is considered that may not conform to the diverse and complex shapes of the underlying clusters. In this study, partitional clustering is defined as a multiobjective optimization problem. The aim is to obtain well-separated, connected, and compact clusters and for this purpose, two objective functions have been defined based on the concepts of data connectivity and cohesion. These functions are the core of an efficient multiobjective particle swarm optimization algorithm, which has been devised for and applied to automatic grouping of large unlabeled datasets. A comprehensive experimental study is conducted and the obtained results are compared with the results of four other state-of-the-art clustering techniques. It is shown that the proposed algorithm can achieve the optimal number of clusters, is robust and outperforms, in most cases, the other methods on the selected benchmark datasets.

[1]  Daniel Angus,et al.  Multiple objective ant colony optimisation , 2009, Swarm Intelligence.

[2]  José Boaventura Cunha,et al.  Design of PID controllers using the particle swarm algorithm , 2002 .

[3]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[4]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[5]  Jing Li,et al.  A new hybrid method based on partitioning-based DBSCAN and ant clustering , 2011, Expert Syst. Appl..

[6]  Nur Evin Özdemirel,et al.  An adaptive neighbourhood construction algorithm based on density and connectivity , 2015, Pattern Recognit. Lett..

[7]  Xiaodong Li,et al.  Better Spread and Convergence: Particle Swarm Multiobjective Optimization Using the Maximin Fitness Function , 2004, GECCO.

[8]  Xinquan Chen,et al.  A new clustering algorithm based on near neighbor influence , 2014, Expert Syst. Appl..

[9]  Pankaj K. Bharne,et al.  Data clustering algorithms based on Swarm Intelligence , 2011, 2011 3rd International Conference on Electronics Computer Technology.

[10]  K. Shanti Swarup,et al.  Particle swarm optimization based K-means clustering approach for security assessment in power systems , 2011, Expert Syst. Appl..

[11]  Tieli Sun,et al.  An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization , 2009, Expert Syst. Appl..

[12]  Bogdan Filipic,et al.  DEMO: Differential Evolution for Multiobjective Optimization , 2005, EMO.

[13]  Le Hoang Son,et al.  HIFCF: An effective hybrid model between picture fuzzy clustering and intuitionistic fuzzy recommender systems for medical diagnosis , 2015, Expert Syst. Appl..

[14]  Asif Ekbal,et al.  Brain image segmentation using semi-supervised clustering , 2016, Expert Syst. Appl..

[15]  Adam Baharum,et al.  Automatic Clustering Using Multi-objective Particle Swarm and Simulated Annealing , 2015, PloS one.

[16]  Parviz Mohammad Zadeh,et al.  Multi-objective collaborative multidisciplinary design optimization using particle swarm techniques and fuzzy decision making , 2012 .

[17]  Jiannong Cao,et al.  Dynamic genetic algorithms for the dynamic load balanced clustering problem in mobile ad hoc networks , 2013, Expert Syst. Appl..

[18]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[19]  Jasbir S. Arora,et al.  Survey of multi-objective optimization methods for engineering , 2004 .

[20]  Krzysztof J. Cios,et al.  GAKREM: A novel hybrid clustering algorithm , 2008, Inf. Sci..

[21]  Chi-Yang Tsai,et al.  Particle swarm optimization with selective particle regeneration for data clustering , 2011, Expert Syst. Appl..

[22]  Bassem Jarboui,et al.  Combinatorial particle swarm optimization (CPSO) for partitional clustering problem , 2007, Appl. Math. Comput..

[23]  E. Voorhees The Effectiveness & Efficiency of Agglomerative Hierarchic Clustering in Document Retrieval , 1985 .

[24]  Ajith Abraham,et al.  Swarm Intelligence in Data Mining , 2009, Swarm Intelligence in Data Mining.

[25]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[26]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[27]  Tunchan Cura,et al.  A particle swarm optimization approach to clustering , 2012, Expert Syst. Appl..

[28]  Rajesh Kumar,et al.  A review on particle swarm optimization algorithms and their applications to data clustering , 2011, Artificial Intelligence Review.

[29]  Manoj Kumar Tiwari,et al.  A data clustering algorithm for stratified data partitioning in artificial neural network , 2012, Expert Syst. Appl..

[30]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[31]  Nickolas Vlahopoulos,et al.  An integrated multidisciplinary particle swarm optimization approach to conceptual ship design , 2010 .

[32]  Arindam Roy,et al.  Application of Particle Swarm Optimization in Data Clustering: A Survey , 2013 .

[33]  James A. Rodger,et al.  Application of a Fuzzy Feasibility Bayesian Probabilistic Estimation of supply chain backorder aging, unfilled backorders, and customer wait time using stochastic simulation with Markov blankets , 2014, Expert Syst. Appl..

[34]  Liang Gao,et al.  Multi-stage design space reduction and metamodeling optimization method based on self-organizing maps and fuzzy clustering , 2016, Expert Syst. Appl..

[35]  Erwie Zahara,et al.  A hybridized approach to data clustering , 2008, Expert Syst. Appl..

[36]  Carlos A. Coello Coello,et al.  Solving Multiobjective Optimization Problems Using an Artificial Immune System , 2005, Genetic Programming and Evolvable Machines.

[37]  Andries Petrus Engelbrecht,et al.  Dynamic clustering using particle swarm optimization with application in image segmentation , 2006, Pattern Analysis and Applications.

[38]  Howard Raiffa,et al.  Games and Decisions: Introduction and Critical Survey. , 1958 .

[39]  M.-C. Su,et al.  A new cluster validity measure and its application to image compression , 2004, Pattern Analysis and Applications.

[40]  Adriano Lorena Inácio de Oliveira,et al.  Hybrid methods for fuzzy clustering based on fuzzy c-means and improved particle swarm optimization , 2015, Expert Syst. Appl..

[41]  Yee Leung,et al.  Clustering by Scale-Space Filtering , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Hichem Frigui,et al.  A Robust Competitive Clustering Algorithm With Applications in Computer Vision , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  C.A. Coello Coello,et al.  MOPSO: a proposal for multiple objective particle swarm optimization , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[44]  M Reyes Sierra,et al.  Multi-Objective Particle Swarm Optimizers: A Survey of the State-of-the-Art , 2006 .

[45]  Ajith Abraham,et al.  Fuzzy C-means and fuzzy swarm for fuzzy clustering problem , 2011, Expert Syst. Appl..

[46]  Ajith Abraham,et al.  Swarm Intelligence Algorithms for Data Clustering , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[47]  S. N. Omkar,et al.  Applied Soft Computing Artificial Bee Colony (abc) for Multi-objective Design Optimization of Composite Structures , 2022 .

[48]  Pablo M. Granitto,et al.  Clustering using PK-D: A connectivity and density dissimilarity , 2016, Expert Syst. Appl..

[49]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[50]  Russell C. Eberhart,et al.  Parameter Selection in Particle Swarm Optimization , 1998, Evolutionary Programming.

[51]  Li-Yeh Chuang,et al.  Chaotic particle swarm optimization for data clustering , 2011, Expert Syst. Appl..

[52]  James A. Rodger NeuroIS Knowledge Discovery Approach to Prediction of Traumatic Brain Injury Survival Rates: A Semantic Data Analysis Regression Feasibility Study , 2015 .

[53]  K. Lewis,et al.  Pareto analysis in multiobjective optimization using the collinearity theorem and scaling method , 2001 .