Spectral clustering with eigenvector selection based on ensemble ranking

Ng-Jordan-Weiss (NJW) method is one of the most widely used spectral clustering algorithms. For a clustering problem with K clusters, this method clusters data using the largest K eigenvectors of the normalized affinity matrix derived from the data set. However, the top K eigenvectors are not always the most important eigenvectors for clustering. In this paper, we propose an eigenvector selection method based on an ensemble of multiple eigenvector rankings (ESEER) for spectral clustering. In ESEER method, first multiple rankings of eigenvectors are obtained by using the entropy metric, which is used to measure the importance of each eigenvector, next the multiple eigenvector rankings are aggregated into a single consensus one, then the first K eigenvectors in the consensus ranking list are adopted as the selected eigenvectors. We have performed experiments on artificial data sets, standard data sets of UCI repository and handwritten digits from MNIST database. The experimental results show that ESEER method is more effective than NJW method in some cases.

[1]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[2]  Bernhard Schölkopf,et al.  A Local Learning Approach for Clustering , 2006, NIPS.

[3]  Shaogang Gong,et al.  Spectral clustering with eigenvector selection , 2008, Pattern Recognit..

[4]  Bruce Hendrickson,et al.  An Improved Spectral Graph Partitioning Algorithm for Mapping Parallel Computations , 1995, SIAM J. Sci. Comput..

[5]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[6]  C. Ding,et al.  Spectral relaxation models and structure analysis for K-way graph clustering and bi-clustering , 2001 .

[7]  Huan Liu,et al.  Feature selection for clustering - a filter solution , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[8]  Dit-Yan Yeung,et al.  Robust path-based spectral clustering , 2008, Pattern Recognit..

[9]  Michael I. Jordan,et al.  Learning Spectral Clustering, With Application To Speech Separation , 2006, J. Mach. Learn. Res..

[10]  Andrew B. Kahng,et al.  Spectral Partitioning with Multiple Eigenvectors , 1999, Discret. Appl. Math..

[11]  Zhi-Hua Zhou,et al.  Supervised nonlinear dimensionality reduction for visualization and classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Huan Liu,et al.  Feature Selection for Clustering , 2000, Encyclopedia of Database Systems.

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.