New Methods for Spectral Clustering.

Analyzing the affinity matrix spectrum is an increasingly popular data clustering method. We propose three new algorithmic components which are appropriate for improving performance of spectral clustering. First, observing the eigenvectors suggests to use a K-lines algorithm instead of the commonly applied K-means. Second, the clustering works best if the affinity matrix has a clear block structure, which can be achieved by computing a conductivity matrix. Third, many clustering problems are inhomogeneous or asymmetric in the sense that some clusters are concentrated while others are dispersed. In this case, a context-dependent calculation of the affinity matrix helps. This method also turns out to allow a robust automatic determination of the kernel radius σ.

[1]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2]  Mark Jerrum,et al.  Approximate Counting, Uniform Generation and Rapidly Mixing Markov Chains , 1987, WG.

[3]  S. Fahlman Fast-learning variations on back propagation: an empirical study. , 1989 .

[4]  Guy L. Scott,et al.  Feature grouping by 'relocalisation' of eigenvectors of the proximity matrix , 1990, BMVC.

[5]  Charles J. Alpert,et al.  Spectral Partitioning: The More Eigenvectors, The Better , 1995, 32nd Design Automation Conference.

[6]  Shang-Hua Teng,et al.  Spectral partitioning works: planar graphs and finite element meshes , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[7]  Bernhard Schölkopf,et al.  Support vector learning , 1997 .

[8]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Pietro Perona,et al.  A Factorization Approach to Grouping , 1998, ECCV.

[10]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[11]  Yair Weiss,et al.  Segmentation using eigenvectors: a unifying view , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[12]  Jianbo Shi,et al.  Learning Segmentation by Random Walks , 2000, NIPS.

[13]  Paul S. Bradley,et al.  k-Plane Clustering , 2000, J. Glob. Optim..

[14]  Santosh S. Vempala,et al.  On clusterings-good, bad and spectral , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[15]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[16]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[17]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[18]  Marina Meila,et al.  A Comparison of Spectral Clustering Algorithms , 2003 .