Discriminative K-Means Laplacian Clustering

Recently, more and more multi-source data are widely used in many real world applications. This kind of data is high dimensional and comes from different resources, which are often the attribute information and similarity information of the same data. It is challenging to use these two types of information to deal with the high dimensional problem simultaneously. A natural way to adopt is a two-step procedure: it utilizes feature integration or kernel integration to combine these two types of information first and then perform dimensional reduction like principal component analysis or various manifold learning algorithms. Different from that, we proposed to deal with these problems in a unified framework which combines discriminative K-means clustering and spectral clustering together. Compared with those separate two-step procedure, information integration and dimension reduction can benefit from each other in our method to promote clustering performance.In addition, discriminative K-means clustering has incorporated K-means and linear discriminant analysis to promote clustering and tackle high dimensional problem. Spectral clustering can reduce the original dimension easily due to the singular value decomposition. Thus it is a good way to combine discriminative K-means and spectral clustering to improve clustering and deal with high dimensional problem. Experimental results on multiple real world data sets verified its effectiveness.

[1]  Shiliang Sun,et al.  Semi-supervised Multitask Learning via Self-training and Maximum Entropy Discrimination , 2012, ICONIP.

[2]  Chris H. Q. Ding,et al.  Spectral Relaxation for K-means Clustering , 2001, NIPS.

[3]  Weifeng Liu,et al.  Multiview dimension reduction via Hessian multiset canonical correlations , 2018, Inf. Fusion.

[4]  Jieping Ye,et al.  Adaptive Distance Metric Learning for Clustering , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[6]  Meng Wang,et al.  Multimodal Deep Autoencoder for Human Pose Recovery , 2015, IEEE Transactions on Image Processing.

[7]  Zhou Yu,et al.  Discriminative coupled dictionary hashing for fast cross-media retrieval , 2014, SIGIR.

[8]  Xinlei Chen,et al.  Large Scale Spectral Clustering with Landmark-Based Representation , 2011, AAAI.

[9]  Takeo Kanade,et al.  Discriminative cluster analysis , 2006, ICML.

[10]  H. Deutsch Principle Component Analysis , 2004 .

[11]  Xuelong Li,et al.  General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jun Yu,et al.  Multi-view ensemble manifold regularization for 3D object recognition , 2015, Inf. Sci..

[13]  Shiliang Sun,et al.  Alternative Multiview Maximum Entropy Discrimination , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Justin Starren,et al.  Natural Language Processing for EHR-Based Pharmacovigilance: A Structured Review , 2017, Drug Safety.

[15]  Shiliang Sun,et al.  Multi-View Maximum Entropy Discrimination , 2013, IJCAI.

[16]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[18]  Shiliang Sun,et al.  Multi-kernel maximum entropy discrimination for multi-view learning , 2016, Intell. Data Anal..

[19]  Shiliang Sun,et al.  Consensus and complementarity based maximum entropy discrimination for multi-view classification , 2016, Inf. Sci..

[20]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[21]  Meng Wang,et al.  Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval , 2015, IEEE Transactions on Industrial Electronics.

[22]  Fei Wang,et al.  Integrated KL (K-means - Laplacian) Clustering: A New Clustering Approach by Combining Attribute Data and Pairwise Relations , 2009, SDM.

[23]  Fei Gao,et al.  Deep Multimodal Distance Metric Learning Using Click Constraints for Image Ranking , 2017, IEEE Transactions on Cybernetics.

[24]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[25]  Mehryar Mohri,et al.  Learning Non-Linear Combinations of Kernels , 2009, NIPS.

[26]  Y. Rui,et al.  Learning to Rank Using User Clicks and Visual Features for Image Retrieval , 2015, IEEE Transactions on Cybernetics.

[27]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[28]  Shiliang Sun,et al.  Applying a multitask feature sparsity method for the classification of semantic relations between nominals , 2012, 2012 International Conference on Machine Learning and Cybernetics.

[29]  Zhou Yu,et al.  Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[30]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[31]  Jun Yu,et al.  Multitask Autoencoder Model for Recovering Human Poses , 2018, IEEE Transactions on Industrial Electronics.

[32]  Jieping Ye,et al.  Discriminative K-means for Clustering , 2007, NIPS.

[33]  Ke Lu,et al.  $p$-Laplacian Regularized Sparse Coding for Human Activity Recognition , 2016, IEEE Transactions on Industrial Electronics.

[34]  Shiliang Sun,et al.  A Survey on Multiview Clustering , 2017, IEEE Transactions on Artificial Intelligence.

[35]  Theofanis Sapatinas,et al.  Discriminant Analysis and Statistical Pattern Recognition , 2005 .