Multi-objective cluster analysis using a gradient evolution algorithm

Data analysis becomes more important since rapid technology developments. In data analysis, data clustering is one of the very useful approaches. It can reveal important information hiding inside the dataset by organizing the instances based on their similarity. The objectives of data clustering are maximizing dissimilarity between clusters and minimizing dissimilarity within clusters. In order to construct a good clustering results, many clustering algorithms have been proposed, including the metaheuristic-based clustering algorithms. Recently, a new metaheuristic algorithm named gradient evolution has been proposed. This algorithm shows a good performance on solving the optimization problems. Therefore, this paper employs this GE algorithm for solving the clustering problem. In order to obtain a better clustering result, this paper considers multi-objective clustering instead of single-objective clustering. In this paper, the original GE algorithm is improved so then it is suitable for the multi-objective problem. The proposed modification includes the procedure for vector updating and jumping which involves Pareto rank assignment. In addition, it also employs K -means algorithm to provide the final clustering result. The proposed algorithm is verified using some benchmark datasets. It is also compared with some other multi-objective metaheuristic-based clustering algorithms. The experimental results show that the proposed algorithm can obtain better results than other metaheuristic-based algorithms.

[1]  R. J. Kuo,et al.  Non-dominated sorting genetic algorithm using fuzzy membership chromosome for categorical data clustering , 2015, Appl. Soft Comput..

[2]  David E. Goldberg,et al.  A niched Pareto genetic algorithm for multiobjective optimization , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[3]  Andries P. Engelbrecht,et al.  A cooperative multi-population approach to clustering temporal data , 2013, 2013 IEEE Congress on Evolutionary Computation.

[4]  Marco Laumanns,et al.  A Spatial Predator-Prey Approach to Multi-objective Optimization: A Preliminary Study , 1998, PPSN.

[5]  Ganapati Panda,et al.  A survey on nature inspired metaheuristic algorithms for partitional clustering , 2014, Swarm Evol. Comput..

[6]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Clustering , 2015, ACM Comput. Surv..

[7]  Dervis Karaboga,et al.  Artificial Bee Colony based image clustering method , 2012, 2012 IEEE Congress on Evolutionary Computation.

[8]  C.A. Coello Coello,et al.  MOPSO: a proposal for multiple objective particle swarm optimization , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[9]  Rajesh Kumar,et al.  Real-Time Implementation of a Harmony Search Algorithm-Based Clustering Protocol for Energy-Efficient Wireless Sensor Networks , 2014, IEEE Transactions on Industrial Informatics.

[10]  Craig A. Knoblock,et al.  A Survey of Digital Map Processing Techniques , 2014, ACM Comput. Surv..

[11]  David Corne,et al.  The Pareto archived evolution strategy: a new baseline algorithm for Pareto multiobjective optimisation , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[12]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[13]  R. J. Kuo,et al.  The gradient evolution algorithm: A new metaheuristic , 2015, Inf. Sci..

[14]  Taher Niknam,et al.  An efficient hybrid evolutionary optimization algorithm based on PSO and SA for clustering , 2009 .

[15]  Spiros Mancoridis,et al.  Automatic clustering of software systems using a genetic algorithm , 1999, STEP '99. Proceedings Ninth International Workshop Software Technology and Engineering Practice.

[16]  Y. Baghzouz,et al.  Genetic-Algorithm-Based Optimization Approach for Energy Management , 2013, IEEE Transactions on Power Delivery.

[17]  R. J. Kuo,et al.  Application of a hybrid of genetic algorithm and particle swarm optimization algorithm for order clustering , 2010, Decis. Support Syst..

[18]  Peter Xiaoping Liu,et al.  Online data-driven fuzzy clustering with applications to real-time robotic tracking , 2004, IEEE Transactions on Fuzzy Systems.

[19]  Pierre Hansen,et al.  New heuristic for harmonic means clustering , 2015, J. Glob. Optim..

[20]  Dervis Karaboga,et al.  AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION , 2005 .

[21]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[22]  R. Eberhart,et al.  Empirical study of particle swarm optimization , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[23]  P. Siarry,et al.  Multiobjective Optimization: Principles and Case Studies , 2004 .

[24]  Sudipta Acharya,et al.  Multiobjective Simulated Annealing-Based Clustering of Tissue Samples for Cancer Diagnosis , 2016, IEEE Journal of Biomedical and Health Informatics.

[25]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[26]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[27]  Asif Ekbal,et al.  Semi-supervised clustering for gene-expression data in multiobjective optimization framework , 2015, International Journal of Machine Learning and Cybernetics.

[28]  Salwani Abdullah,et al.  A combined approach for clustering based on K-means and gravitational search algorithms , 2012, Swarm Evol. Comput..

[29]  Adil M. Bagirov,et al.  A heuristic algorithm for solving the minimum sum-of-squares clustering problems , 2015, J. Glob. Optim..

[30]  T Watson Layne,et al.  A Genetic Algorithm Approach to Cluster Analysis , 1998 .

[31]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[32]  R. Tavakkoli-Moghaddam,et al.  A hybrid fuzzy clustering PSO algorithm for a clustering supplier problem , 2007, 2007 IEEE International Conference on Industrial Engineering and Engineering Management.

[33]  E ZulviaFerani The gradient evolution algorithm , 2015 .

[34]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[35]  Abdolreza Hatamlou,et al.  Black hole: A new heuristic optimization approach for data clustering , 2013, Inf. Sci..

[36]  Richard M. Leahy,et al.  An Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[39]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[40]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.