A new Kmeans clustering model and its generalization achieved by joint spectral embedding and rotation

The Kmeans clustering and spectral clustering are two popular clustering methods for grouping similar data points together according to their similarities. However, the performance of Kmeans clustering might be quite unstable due to the random initialization of the cluster centroids. Generally, spectral clustering methods employ a two-step strategy of spectral embedding and discretization postprocessing to obtain the cluster assignment, which easily lead to far deviation from true discrete solution during the postprocessing process. In this paper, based on the connection between the Kmeans clustering and spectral clustering, we propose a new Kmeans formulation by joint spectral embedding and spectral rotation which is an effective postprocessing approach to perform the discretization, termed KMSR. Further, instead of directly using the dot-product data similarity measure, we make generalization on KMSR by incorporating more advanced data similarity measures and call this generalized model as KMSR-G. An efficient optimization method is derived to solve the KMSR (KMSR-G) model objective whose complexity and convergence are provided. We conduct experiments on extensive benchmark datasets to validate the performance of our proposed models and the experimental results demonstrate that our models perform better than the related methods in most cases.

[1]  Feiping Nie,et al.  Spectral Rotation versus K-Means in Spectral Clustering , 2013, AAAI.

[2]  Feiping Nie,et al.  A New Simplex Sparse Learning Model to Measure Data Similarity for Clustering , 2015, IJCAI.

[3]  Feiping Nie,et al.  The Constrained Laplacian Rank Algorithm for Graph-Based Clustering , 2016, AAAI.

[4]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[5]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Qichun Zhang,et al.  Neural Membrane Mutual Coupling Characterisation Using Entropy-Based Iterative Learning Identification , 2020, IEEE Access.

[7]  Feiping Nie,et al.  Subspace Clustering via New Low-Rank Model with Discrete Group Structure Constraint , 2016, IJCAI.

[8]  Feiping Nie,et al.  Scalable Normalized Cut with Improved Spectral Rotation , 2017, IJCAI.

[9]  Yong Peng,et al.  Joint low-rank representation and spectral regression for robust subspace learning , 2020, Knowl. Based Syst..

[10]  Feiping Nie,et al.  Improved MinMax Cut Graph Clustering with Nonnegative Relaxation , 2010, ECML/PKDD.

[11]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[12]  Feiping Nie,et al.  Heterogeneous Image Features Integration via Multi-modal Semi-supervised Learning Model , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  Xuelong Li,et al.  A generalized power iteration method for solving quadratic problem on the Stiefel manifold , 2017, Science China Information Sciences.

[14]  Chris H. Q. Ding,et al.  A min-max cut algorithm for graph partitioning and data clustering , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[15]  Xuelong Li,et al.  Multiview Clustering via Adaptively Weighted Procrustes , 2018, KDD.

[16]  Feiping Nie,et al.  Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut , 2018, KDD.

[17]  Seiki Ubukata,et al.  A unified approach for cluster-wise and general noise rejection approaches for k-means clustering , 2019, PeerJ Comput. Sci..

[18]  Feiping Nie,et al.  Parallel Vector Field Regularized Non-Negative Matrix Factorization for Image Representation , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Feiping Nie,et al.  Fast algorithm for large-scale subspace clustering by LRR , 2020, IET Image Process..

[20]  Jianbo Shi,et al.  Multiclass spectral clustering , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[21]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[22]  Qichun Zhang,et al.  An introductory survey of probability density function control , 2019, Systems Science & Control Engineering.