Kernel-based deterministic annealing algorithm for data clustering

Data clustering in kernel-induced feature space is interesting in that, by nonlinearly mapping the observed data from a low-dimensional input space into a high (possibly infinite)-dimensional feature space by means of a given kernel function, the kernel-based clustering can reveal complicated (e.g. linearly nonseparable) data structures that may be missed by traditional clustering methods in the standard Euclidean space. A kernel-based deterministic annealing (KDA) algorithm is developed for data clustering by using a Gaussian kernel function. The Gaussian parameter (width), which determines the nonlinear mapping together with the Gaussian kernel, is adaptively selected by the scaled inverse of data covariance. The effectiveness of the Gaussian parameter (width) selection method and the superiority of the KDA algorithm for clustering a variety of data structures are supported by the experimental results on artificial and real data sets.

[1]  Shokri Z. Selim,et al.  A simulated annealing algorithm for the clustering problem , 1991, Pattern Recognit..

[2]  John Q. Gan,et al.  An Unsupervised Kernel Based Fuzzy C-Means Clustering Algorithm With Kernel Normalisation , 2004, Int. J. Comput. Intell. Appl..

[3]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[4]  David Horn,et al.  Clustering via Hilbert space , 2001 .

[5]  Qing Song A Robust Information Clustering Algorithm , 2005, Neural Computation.

[6]  J. Łȩski Fuzzy c-varieties/elliptotypes clustering in reproducing kernel Hilbert space , 2004 .

[7]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[8]  J. Chiang,et al.  A new kernel-based fuzzy clustering approach: support vector clustering with cell growing , 2003, IEEE Trans. Fuzzy Syst..

[9]  Kenneth Rose,et al.  Hierarchical, Unsupervised Learning with Growing via Phase Transitions , 1996, Neural Computation.

[10]  Dao-Qiang Zhang,et al.  A novel kernelized fuzzy C-means algorithm with application in medical image segmentation , 2004, Artif. Intell. Medicine.

[11]  Stephen J. Roberts,et al.  Maximum certainty data partitioning , 2000, Pattern Recognit..

[12]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[13]  Geoffrey C. Fox,et al.  Constrained Clustering as an Optimization Method , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Richard E. Blahut,et al.  Computation of channel capacity and rate-distortion functions , 1972, IEEE Trans. Inf. Theory.

[15]  K. Rose Deterministic annealing for clustering, compression, classification, regression, and related optimization problems , 1998, Proc. IEEE.

[16]  Francesco Camastra,et al.  A Novel Kernel Method for Clustering , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[18]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[19]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[20]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[21]  Rose,et al.  Statistical mechanics and phase transitions in clustering. , 1990, Physical review letters.

[22]  Miin-Shen Yang,et al.  Alternative c-means clustering algorithms , 2002, Pattern Recognit..