Spectral clustering with density sensitive similarity function

In recent years, spectral clustering has become quite popular for data analysis because it can be solved efficiently by standard linear algebra tools and do not suffer from the problem of local optima. The clustering effect by using such spectral method, however, depends heavily on the description of similarity between instances of the datasets. In this paper, we defined the adjustable line segment length which can adjust the distance in regions with different density. It squeezes the distances in high density regions while widen them in low density regions. And then a density sensitive distance measure satisfied by symmetric, non-negative, reflexivity and triangle inequality was present, by which we can define a new similarity function for spectral clustering. Experimental results show that compared with conventional Euclidean distance based and Gaussian kernel function based spectral clustering, our proposed algorithm with density sensitive similarity measure can obtain desirable clusters with high performance on both synthetic and real life datasets.

[1]  Jiao Licheng,et al.  Density-Sensitive Spectral Clustering , 2007 .

[2]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Jianbo Shi,et al.  Learning Segmentation by Random Walks , 2000, NIPS.

[4]  Zoubin Ghahramani,et al.  Spectral Methods for Automatic Multiscale Data Clustering , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[6]  Mikhail Belkin,et al.  Regularization and Semi-supervised Learning on Large Graphs , 2004, COLT.

[7]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[8]  Jianbo Shi,et al.  A Random Walks View of Spectral Segmentation , 2001, AISTATS.

[9]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[10]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[11]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[12]  B. Mohar THE LAPLACIAN SPECTRUM OF GRAPHS y , 1991 .

[13]  Pang-Ning Tan,et al.  Outrank: a Graph-Based Outlier Detection Framework Using Random Walk , 2008, Int. J. Artif. Intell. Tools.

[14]  Claire Cardie,et al.  Clustering with Instance-Level Constraints , 2000, AAAI/IAAI.

[15]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Chih-Hsuan Wang,et al.  Recognition of semiconductor defect patterns using spatial filtering and spectral clustering , 2008, Expert Syst. Appl..

[18]  Pang-Ning Tan,et al.  Outlier Detection Using Random Walks , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[19]  Marina Meila,et al.  Comparing clusterings: an axiomatic view , 2005, ICML.