Double Partition Around Medoids based Cluster Ensemble

Cluster ensemble is one of the hot topics in the machine learning area. Though plenty of cluster ensemble methods and frameworks have been proposed, many cluster ensemble methods are easily faded by noisy datasets and local optimal problems. In this article, we introduced a novel cluster ensemble method, named as Double Partition Around Medoids based Cluster Ensemble (PAM2CE). PAM2CE will effectively weaken or even eliminate the effect of noisy datasets and local optimal problems via clustering attributes and selecting the representative attributes. The experimental results reveal the better robustness and effectiveness of proposed method.

[1]  Carla E. Brodley,et al.  Random Projection for High Dimensional Data Clustering: A Cluster Ensemble Approach , 2003, ICML.

[2]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Tomer Hertz,et al.  Boosting margin based distance functions for clustering , 2004, ICML.

[4]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Thierry Denoeux,et al.  Belief Functions and Cluster Ensembles , 2009, ECSQARU.

[6]  Sandrine Dudoit,et al.  Bagging to Improve the Accuracy of A Clustering Procedure , 2003, Bioinform..

[7]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Multi-Objective Clustering Ensemble , 2006, 2006 Sixth International Conference on Hybrid Intelligent Systems (HIS'06).

[8]  Kurt Hornik,et al.  Voting-Merging: An Ensemble Method for Clustering , 2001, ICANN.

[9]  Anil K. Jain,et al.  Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[11]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .