Multi-clustering via evolutionary multi-objective optimization

Abstract The choice of the number of clusters (k) remains challenging for clustering methods. Instead of determining k, the implicit parallelism feature of evolutionary multi-objective optimization (EMO) provides an effective and efficient paradigm to find the optimal clustering in a posteriori manner. That is, first EMO algorithms are employed to search for a set of non-dominated solutions, representing different clustering results with different k. Then, a certain validity index is used to select the optimal clustering result. This study systematically investigates the use of EMO for multi-clustering (i.e., searching for multiple clustering simultaneously). An effective bi-objective model is built wherein the number of clusters and the sum of squared distances (SSD) between data points and their cluster centroids are considered as objectives. To ensure the two objectives are conflicting with each other, a novel transformation strategy is applied to the SSD. Then, the model is solved by an EMO algorithm. The derived paradigm, EMO-k-clustering, is examined on three datasets of different properties where NSGA-II serves as the EMO algorithm. Experimental results show that the proposed bi-objective model is effective. EMO-k-clustering is able to efficiently obtain all the clustering results for different k values in its single run.

[1]  Michael G. Epitropakis,et al.  Progressive preference articulation for decision making in multi-objective optimisation problems , 2017, Integr. Comput. Aided Eng..

[2]  Xiaodong Li,et al.  Cooperative Co-Evolution With Differential Grouping for Large Scale Optimization , 2014, IEEE Transactions on Evolutionary Computation.

[3]  Yimin Liu,et al.  Integrating Multi-Objective Genetic Algorithm and Validity Analysis for Locating and Ranking Alternative Clustering , 2005, Informatica.

[4]  Peter J. Fleming,et al.  Preference-inspired co-evolutionary algorithm using adaptively generated goal vectors , 2013, 2013 IEEE Congress on Evolutionary Computation.

[5]  Ferrante Neri,et al.  A fast hypervolume driven selection mechanism for many-objective optimisation problems , 2017, Swarm Evol. Comput..

[6]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[7]  Yong Wang,et al.  A Weighted Biobjective Transformation Technique for Locating Multiple Optimal Solutions of Nonlinear Equation Systems , 2017, IEEE Transactions on Evolutionary Computation.

[8]  Yong Wang,et al.  MOMMOP: Multiobjective Optimization for Locating Multiple Optimal Solutions of Multimodal Optimization Problems , 2015, IEEE Transactions on Cybernetics.

[9]  Tao Zhang,et al.  On the effect of reference point in MOEA/D for multi-objective optimization , 2017, Appl. Soft Comput..

[10]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[11]  Ming Yang,et al.  An Adaptive Multipopulation Framework for Locating and Tracking Multiple Optima , 2016, IEEE Transactions on Evolutionary Computation.

[12]  Yimin Liu,et al.  Multi-objective Genetic Algorithm Based Clustering Approach and Its Application to Gene Expression Data , 2004, ADVIS.

[13]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Clustering , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[14]  Jinjun Chen,et al.  Hybrid multi-objective cuckoo search with dynamical local search , 2017, Memetic Computing.

[15]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[16]  Dervis Karaboga,et al.  A comprehensive survey of traditional, merge-split and evolutionary approaches proposed for determination of cluster number , 2017, Swarm Evol. Comput..

[17]  Rüdiger Stephan,et al.  Cardinality constrained combinatorial optimization: Complexity and polyhedra , 2010, Discret. Optim..

[18]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[19]  Zhihua Cui,et al.  Bat algorithm with triangle-flipping strategy for numerical optimization , 2017, International Journal of Machine Learning and Cybernetics.

[20]  Ferrante Neri,et al.  Covariance matrix adaptation pareto archived evolution strategy with hypervolume-sorted adaptive grid algorithm , 2016, Integr. Comput. Aided Eng..

[21]  Xingguang Peng,et al.  Large-scale cooperative co-evolution using niching-based multi-modal optimization and adaptive fast clustering , 2017, Swarm Evol. Comput..

[22]  Hisao Ishibuchi,et al.  Balance between genetic search and local search in memetic algorithms for multiobjective permutation flowshop scheduling , 2003, IEEE Trans. Evol. Comput..

[23]  Xin Yao,et al.  Large scale evolutionary optimization using cooperative coevolution , 2008, Inf. Sci..

[24]  Yu Xue,et al.  Immune clonal algorithm based on directed evolution for multi-objective capacitated arc routing problem , 2016, Appl. Soft Comput..

[25]  Lothar Thiele,et al.  Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[26]  Tao Zhang,et al.  Localized Weighted Sum Method for Many-Objective Optimization , 2018, IEEE Transactions on Evolutionary Computation.

[27]  Yu Xue,et al.  Improved bat algorithm with optimal forage strategy and random disturbance strategy , 2016, Int. J. Bio Inspired Comput..

[28]  Yew-Soon Ong,et al.  Multifactorial Evolution: Toward Evolutionary Multitasking , 2016, IEEE Transactions on Evolutionary Computation.

[29]  Peter J. Fleming,et al.  The iPICEA-g: a new hybrid evolutionary multi-criteria decision making approach using the brushing technique , 2015, Eur. J. Oper. Res..

[30]  Peter J. Fleming,et al.  Preference-Inspired Coevolutionary Algorithms for Many-Objective Optimization , 2013, IEEE Transactions on Evolutionary Computation.

[31]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Clustering , 2015, ACM Comput. Surv..

[32]  Fang Liu,et al.  A Novel Immune Clonal Algorithm , 2006, ICNC.

[33]  Licheng Jiao,et al.  A multi-population cooperative coevolutionary algorithm for multi-objective capacitated arc routing problem , 2014, Inf. Sci..

[34]  Peter J. Fleming,et al.  Preference-inspired co-evolutionary algorithms using weight vectors , 2015, Eur. J. Oper. Res..

[35]  Fang Liu,et al.  A Novel Immune Clonal Algorithm for MO Problems , 2012, IEEE Transactions on Evolutionary Computation.

[36]  Weiguo Sheng,et al.  Adaptive Multisubpopulation Competition and Multiniche Crowding-Based Memetic Algorithm for Automatic Data Clustering , 2016, IEEE Transactions on Evolutionary Computation.

[37]  Qingfu Zhang,et al.  Decomposition-Based Algorithms Using Pareto Adaptive Scalarizing Methods , 2016, IEEE Transactions on Evolutionary Computation.