Cone Cluster Labeling for Support Vector Clustering

Clustering forms natural groupings of data points that maximize intra-cluster similarity and minimize intercluster similarity. Support Vector Clustering (SVC) is a clustering algorithm that can handle arbitrary cluster shapes. One of the major SVC challenges is a cluster labeling performance bottleneck. We propose a novel cluster labeling algorithm that relies on approximate coverings both in feature space and data space. Comparison with existing cluster labeling approaches suggests that our strategy improves efficiency without sacrificing clustering quality.

[1]  Daewon Lee,et al.  An improved cluster labeling method for support vector clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  David Harel,et al.  Clustering spatial data using random walks , 2001, KDD '01.

[3]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[4]  Jianhua Yang,et al.  Support vector clustering through proximity graph modelling , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[5]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[6]  ISTVAN JONYER,et al.  Graph-Based Hierarchical Conceptual Clustering , 2000, Int. J. Artif. Intell. Tools.

[7]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[8]  Karen Daniels,et al.  Gaussian Kernel Width Generator for Support Vector Clustering , 2005, Advances in Bioinformatics and Its Applications.

[9]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[10]  Bernhard Schölkopf,et al.  Extracting Support Data for a Given Task , 1995, KDD.

[11]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .