Voronoi Cell-Based Clustering Using a Kernel Support

Support-based clustering using kernels suffers from serious computational limitations inherent in many kernel methods when applied to very large-scale problems despite its ability to identify clusters with complex shapes. In this paper, we propose a novel clustering algorithm called Voronoi cell-based clustering to expedite support-based clustering using kernels. In contrast to previous studies, including the basin cell-based method, the proposed method achieves computational efficiency in both the training phase to construct a support estimate using sampled data to reduce the evaluation of kernels and the labeling phase to assign a cluster label on each data point nearest its representative point. The performance superiority of the proposed method over the other basin cell-based methods in terms of computational time and storage efficiency is verified by various experiments using benchmark sets and in real applications to image segmentation.

[1]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[2]  Daewon Lee,et al.  Equilibrium-Based Support Vector Machine for Semisupervised Classification , 2007, IEEE Transactions on Neural Networks.

[3]  M. Cugmas,et al.  On comparing partitions , 2015 .

[4]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[5]  Fernando J. Von Zuben,et al.  Improving Support Vector Clustering with Ensembles , 2005 .

[6]  Daewon Lee,et al.  Domain described support vector classifier for multi-classification problems , 2007, Pattern Recognit..

[7]  Daewon Lee,et al.  Dynamic Characterization of Cluster Structures for Robust and Inductive Support Vector Clustering , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  S. Abe,et al.  Spatially chunking support vector clustering algorithm , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[9]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[10]  Hildur Ólafsdóttir,et al.  Robust Pseudo-hierarchical Support Vector Clustering , 2007, SCIA.

[11]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[12]  Kyu-Hwan Jung,et al.  Dynamic pattern denoising method using multi-basin system with kernels , 2011, Pattern Recognit..

[13]  K. Stanovich,et al.  Heuristics and Biases: Individual Differences in Reasoning: Implications for the Rationality Debate? , 2002 .

[14]  Daewon Lee,et al.  An improved cluster labeling method for support vector clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Karen M. Daniels,et al.  Gaussian kernel width exploration and cone cluster labeling for support vector clustering , 2011, Pattern Analysis and Applications.

[16]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Francesco Camastra,et al.  Cursive character recognition by learning vector quantization , 2001, Pattern Recognit. Lett..

[18]  Jean-Yves Audibert Optimization for Machine Learning , 1995 .

[19]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[20]  Jianhua Yang,et al.  Support vector clustering through proximity graph modelling , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[21]  Jaewook Lee,et al.  Clustering Based on Gaussian Processes , 2007, Neural Computation.

[22]  Xiang Ji,et al.  Support vector clustering combined with spectral graph partitioning , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[23]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[24]  Kyu-Hwan Jung,et al.  Probabilistic generative ranking method based on multi-support vector domain description , 2013, Inf. Sci..

[25]  Francesco Camastra,et al.  Offline Cursive Character Challenge: a New Benchmark for Machine Learning and Pattern Recognition Algorithms. , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[26]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[27]  D. Kahneman Thinking, Fast and Slow , 2011 .

[28]  Daewon Lee,et al.  Dynamic Dissimilarity Measure for Support-Based Clustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[29]  Daewon Lee,et al.  Fast support-based clustering method for large-scale problems , 2010, Pattern Recognit..

[30]  Francesco Camastra,et al.  A novel kernel method for clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.