Denoised Kernel Spectral data Clustering

Kernel Spectral Clustering (KSC) solves a weighted kernel principal component analysis problem in a primal-dual optimization framework. It builds an unsupervised model on a small subset of data using the dual solution of the optimization problem. This allows KSC to have a powerful out-of-sample extension property leading to good cluster generalization w.r.t. unseen data points. However, in the presence of noise that causes overlapping data, the technique often fails to provide good generalization capability. In this paper, we propose a two-step process for clustering noisy data. We first denoise the data using kernel principal component analysis (KPCA) with a recently proposed Model selection criterion based on point-wise Distance Distributions (MDD) to obtain the underlying information in the data. We then use the KSC technique on this denoised data to obtain good quality clusters. One advantage of model based techniques is that we can use the same training and validation set for denoising and for clustering. We discovered that using the same kernel bandwidth parameter obtained from MDD for KPCA works efficiently with KSC in combination with the optimal number of clusters k to produce good quality clusters. We compare the proposed approach with normal KSC and KSC with KPCA using a heuristic method based on reconstruction error for several synthetic and real-world datasets to showcase the effectiveness of the proposed approach.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Adrian E. Raftery,et al.  Inference in model-based cluster analysis , 1997, Stat. Comput..

[3]  Johan A. K. Suykens,et al.  Soft kernel spectral clustering , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[4]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[5]  Gunnar Rätsch,et al.  Kernel PCA and De-Noising in Feature Spaces , 1998, NIPS.

[6]  R. Shah,et al.  Least Squares Support Vector Machines , 2022 .

[7]  Ricardo J. G. B. Campello,et al.  Relative Validity Criteria for Community Mining Algorithms , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[8]  Johan A. K. Suykens,et al.  FURS: Fast and Unique Representative Subset selection retaining large-scale community structure , 2013, Social Network Analysis and Mining.

[9]  Ivor W. Tsang,et al.  The pre-image problem in kernel methods , 2003, IEEE Transactions on Neural Networks.

[10]  Mark Girolami,et al.  Orthogonal Series Density Estimation and the Kernel Eigenvalue Problem , 2002, Neural Computation.

[11]  Nika Haghtalab,et al.  Clustering in the Presence of Noise , 2013 .

[12]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Johan A. K. Suykens,et al.  Noise Level Estimation for Model Selection in Kernel PCA Denoising , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[15]  Shai Ben-David,et al.  Clustering in the Presence of Background Noise , 2014, ICML.

[16]  Johan A. K. Suykens,et al.  Self-tuned kernel spectral clustering for large scale networks , 2013, 2013 IEEE International Conference on Big Data.

[17]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[18]  Halima Bensmail,et al.  Model-based Clustering with Noise: Bayesian Inference and Estimation , 2003, J. Classif..

[19]  Johan A. K. Suykens,et al.  Sparse kernel spectral clustering models for large-scale data analysis , 2011, Neurocomputing.

[20]  Johan A. K. Suykens,et al.  Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jianbo Shi,et al.  Learning Segmentation by Random Walks , 2000, NIPS.

[22]  Zhenguo Li,et al.  Noise Robust Spectral Clustering , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[23]  Johan A. K. Suykens,et al.  Kernel Spectral Clustering for Big Data Networks , 2013, Entropy.

[24]  Ricardo J. G. B. Campello,et al.  Relative Validity Criteria for Community Mining Algorithms , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.