A Novel Ensemble Clustering Approach with Internal Weighting Strategy

Due to its strong robustness and high accuracy, the ensemble clustering has been extensively applied in many fields. Despite the significant achievements, there is a limitation need to be overcome. Most of the existing ensemble clustering methods commonly value all component clusterings equally regardless of their different quality. Therefore, low-quality component clusterings are very destructive for them. Although many modified measures have been adopted to weight each component clustering base on their quality, yet these measures have a tendency to treat each cluster as an unit and ignore the diversity of clusters belonging to same basis clustering. How to take into count the diversity of clusters and the internal structure of component clusterings to improve the consensus performance is a problem that need to be solved. In this paper, a novel ensemble clustering method is proposed to solve this problem. Firstly, We adopt five clustering methods to generate diverse component clusterings. Secondly, in order to build candidate component clustering pool, we propose a new method to discard low-quality component clusterings. Thirdly, we utilize a advanced method to evaluate the unreliability of clusters with respect to candidate component clusterings. Finally, we further utilize a accepted weighting strategy and a consensus function to obtain consensus clustering. We have conducted extensive experiments to demonstrate the superiority of the proposed approach.

[1]  Xuan Xiao,et al.  Similarity-based spectral clustering ensemble selection , 2012, 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery.

[2]  Sangeeta Ahuja,et al.  Germplasm evaluation using cluster ensemble , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[3]  Junping Du,et al.  Cluster Ensemble-Based Image Segmentation , 2013 .

[4]  Arun K. Pujari,et al.  Weighted Co-clustering Based Clustering Ensemble , 2011, 2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics.

[5]  Francisco Azuaje,et al.  An integrated tool for microarray data clustering and cluster validity assessment , 2004, SAC '04.

[6]  Hui Xiong,et al.  Adapting the right measures for K-means clustering , 2009, KDD.

[7]  Qiang-Li Zhao,et al.  A fast ensemble pruning algorithm based on pattern mining process , 2009, Data Mining and Knowledge Discovery.

[8]  Lan Wang,et al.  Comparison of Cluster Ensembles Methods Based on Hierarchical Clustering , 2009, 2009 International Conference on Computational Intelligence and Natural Computing.

[9]  Lei Wang,et al.  Cluster Ensemble Based Image Segmentation Algorithm , 2015, 2015 Eighth International Conference on Internet Computing for Science and Engineering (ICICSE).

[10]  Frank Nielsen,et al.  On weighting clustering , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Chang-Dong Wang,et al.  Locally Weighted Ensemble Clustering , 2016, IEEE Transactions on Cybernetics.