Object-Neighbourhood Clustering Ensemble Method

Clustering is an unsupervised learning and clustering results are often inconsistent and unreliable when different clustering algorithms are used. In this paper we have proposed a clustering ensemble framework, named Object-Neighbourhood Clustering Ensemble (ONCE), to improve the consistency, reliability and quality of the clustering result. The core of the ONCE is a new consensus function that addresses the uncertain agreements between members by taking the neighbourhood relationship between object pairs into account in the similarity matrix. The experiments are carried out on 11 benchmark datasets. The results show that our ensemble method outperforms the co-association method, when the Average linkage is used. Furthermore, the results show that our ensemble method is more accurate than the baseline algorithm, and this indicates that the clustering ensemble method is more consistent and reliable than a single clustering algorithm.

[1]  Sandro Vega-Pons,et al.  Weighted association based methods for the combination of heterogeneous partitions , 2011, Pattern Recognit. Lett..

[2]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[3]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[4]  Aristides Gionis,et al.  Clustering aggregation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[5]  Jinfeng Yi,et al.  Robust Ensemble Clustering by Matrix Completion , 2012, 2012 IEEE 12th International Conference on Data Mining.

[6]  Ray A. Jarvis,et al.  Clustering Using a Similarity Measure Based on Shared Near Neighbors , 1973, IEEE Transactions on Computers.

[7]  R. Iman,et al.  Approximations of the critical region of the fbietkan statistic , 1980 .

[8]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[9]  Cor J. Veenman,et al.  A Maximum Variance Cluster Algorithm , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Xi Wang,et al.  Clustering aggregation by probability accumulation , 2009, Pattern Recognit..

[11]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[12]  Ana L. N. Fred,et al.  Combining multiple clusterings using evidence accumulation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.