Selective Clustering Ensemble Based on Covariance

Clustering Ensemble effectively improves clustering accuracy, stability and robustness, which is most resulted from the diversity of the base clustering results. It is a key point to measure the diversity of clustering results. This paper proposes a method to measure diversity of base clustering results and a covariance-based selective clustering ensemble algorithm. Experiments on 20 UCI data sets show that this algorithm effectively improves the clustering performance.

[1]  Derek Greene,et al.  Ensemble clustering in medical diagnostics , 2004 .

[2]  B. Verma,et al.  Decisions Fusion Strategy: Towards Hybrid Cluster Ensemble , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[3]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[4]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[5]  Xuan Xiao,et al.  Similarity-based spectral clustering ensemble selection , 2012, 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery.

[6]  Tossapon Boongoen,et al.  A Link-Based Cluster Ensemble Approach for Categorical Data Clustering , 2012, IEEE Transactions on Knowledge and Data Engineering.

[7]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[8]  Peng Zhou,et al.  Semi-Supervised Cluster Ensemble Model Based on Bayesian Network: Semi-Supervised Cluster Ensemble Model Based on Bayesian Network , 2011 .

[9]  Zhou Zhihua,et al.  Bagging-Based Selective Clusterer Ensemble , 2005 .

[10]  Yan Yang,et al.  Semi-supervised Clustering Ensemble Based on Multi-ant Colonies Algorithm , 2012, RSKT.

[11]  Licheng Jiao,et al.  Bagging-based spectral clustering ensemble selection , 2011, Pattern Recognit. Lett..

[12]  Liu Limin,et al.  A New Selective Clustering Ensemble Algorithm , 2012, ICEBE 2012.

[13]  Carla E. Brodley,et al.  Random Projection for High Dimensional Data Clustering: A Cluster Ensemble Approach , 2003, ICML.

[14]  Anil K. Jain,et al.  A Mixture Model for Clustering Ensembles , 2004, SDM.

[15]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[16]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[17]  Kai Li,et al.  A Novel Measure of Diversity for Support Vector Machine Ensemble , 2010, 2010 Third International Symposium on Intelligent Information Technology and Security Informatics.

[18]  J. C. Dunn,et al.  A Graph Theoretic Analysis of Pattern Classification via Tamura's Fuzzy Relation , 1974, IEEE Trans. Syst. Man Cybern..

[19]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[20]  Zahoor Ali Khan,et al.  Semi-supervised Clustering Ensemble by Voting , 2012, ArXiv.

[21]  Wei Tang,et al.  Ensembling neural networks: Many could be better than all , 2002, Artif. Intell..

[22]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[23]  Anil K. Jain,et al.  Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Amin Nikanjam,et al.  An Evolutionary Approach to Clustering Ensemble , 2008, 2008 Fourth International Conference on Natural Computation.

[25]  James C. Bezdek,et al.  A Convergence Theorem for the Fuzzy ISODATA Clustering Algorithms , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[27]  Ludmila I. Kuncheva,et al.  Moderate diversity for better cluster ensembles , 2006, Inf. Fusion.

[28]  Hui-lan Luo,et al.  Combining Multiple Clusterings using Information Theory based Genetic Algorithm , 2006, 2006 International Conference on Computational Intelligence and Security.

[29]  Xiaoli Z. Fern,et al.  Cluster Ensemble Selection , 2008 .

[30]  Mohamed S. Kamel,et al.  An aggregated clustering approach using multi-ant colonies algorithms , 2006, Pattern Recognit..

[31]  Zhou Peng,et al.  Semi-Supervised Cluster Ensemble Model Based on Bayesian Network , 2010 .