SHC: A Spectral Algorithm for Hierarchical Clustering

Hierarchical clustering (HC) is a widely used approach both in pattern recognition and data mining and has rich solutions in the literature. But all these existing solutions have some restrictions when the clustered dataset has complex structure. Spectral clustering is a graph-based, simple and outperforming method with the ability to find complex structure in dataset using spectral properties of the dataset-associated affinity matrix. In this paper, we propose a novel effective HC algorithm called SHC base on the techniques of spectral method. The experiment results both on artificial and real data sets show that our algorithm can hierarchically cluster complex data effectively and naturally.

[1]  Shai Ben-David,et al.  A Sober Look at Clustering Stability , 2006, COLT.

[2]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[3]  Vipin Kumar,et al.  Chameleon: Hierarchical Clustering Using Dynamic Modeling , 1999, Computer.

[4]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[5]  Sudipto Guha,et al.  ROCK: A Robust Clustering Algorithm for Categorical Attributes , 2000, Inf. Syst..

[6]  Ronald R. Yager Intelligent control of the hierarchical agglomerative clustering process , 2000, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[8]  Lin Xue-yin Learning a family of intrinsic cluster numbers , 2006 .

[9]  Sudipto Guha,et al.  ROCK: a robust clustering algorithm for categorical attributes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[10]  Marina Meila,et al.  A Comparison of Spectral Clustering Algorithms , 2003 .

[11]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[12]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[13]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[14]  Min Luo,et al.  An Intrusion Detection Research Based on Spectral Clustering , 2008, 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing.

[15]  Joachim M. Buhmann,et al.  Stability-Based Validation of Clustering Solutions , 2004, Neural Computation.

[16]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[17]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[18]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.