Discriminative Multiple Kernel Concept Factorization for Data Representation

Concept Factorization (CF) improves Nonnegative matrix factorization (NMF), which can be only performed in the original data space, by conducting factorization within proper kernel space where the structure of data become much clear than the original data space. CF-based methods have been widely applied and yielded impressive results in optimal data representation and clustering tasks. However, CF methods still face with the problem of proper kernel function design or selection in practice. Most existing Multiple Kernel Clustering (MKC) algorithms do not sufficiently consider the intrinsic neighborhood structure of base kernels. In this paper, we propose a novel Discriminative Multiple Kernel Concept Factorization method for data representation and clustering. We first extend the original kernel concept factorization with the integration of multiple kernel clustering framework to alleviate the problem of kernel selection. For each base kernel, we extract the local discriminant structure of data via the local discriminant models with global integration. Moreover, we further linearly combine all these kernel-level local discriminant models to obtain an integrated consensus characterization of the intrinsic structure across base kernels. In this way, it is expected that our method can achieve better results by more compact data reconstruction and more faithful local structure preserving. An iterative algorithm with convergence guarantee is also developed to find the optimal solution. Extensive experiments on benchmark datasets further show that the proposed method outperforms many state-of-the-art algorithms.

[1]  Lei Shi,et al.  Recovery of Corrupted Multiple Kernels for Clustering , 2015, IJCAI.

[2]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[3]  Xinwang Liu,et al.  Multiple Kernel Clustering With Neighbor-Kernel Subspace Segmentation , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Wei Yan,et al.  A novel regularized concept factorization for document clustering , 2017, Knowl. Based Syst..

[5]  Yihong Gong,et al.  Document clustering by concept factorization , 2004, SIGIR '04.

[6]  Lin Wang,et al.  Discriminative sparse embedding based on adaptive graph for dimension reduction , 2020, Eng. Appl. Artif. Intell..

[7]  En Zhu,et al.  Subspace segmentation-based robust multiple kernel clustering , 2020, Inf. Fusion.

[8]  Rong Jin,et al.  Generalized Maximum Margin Clustering and Unsupervised Kernel Learning , 2006, NIPS.

[9]  Hal Daumé,et al.  A Co-training Approach for Multi-view Spectral Clustering , 2011, ICML.

[10]  Bin Song,et al.  Adaptive graph regularized nonnegative matrix factorization for data representation , 2019, Applied Intelligence.

[11]  Chong Peng,et al.  Nonnegative Matrix Factorization with Local Similarity Learning , 2019, Inf. Sci..

[12]  Johan A. K. Suykens,et al.  Optimized Data Fusion for Kernel k-Means Clustering , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Feng Tian,et al.  Graph-regularized concept factorization for multi-view document clustering , 2017, J. Vis. Commun. Image Represent..

[14]  Qiang Cheng,et al.  Discriminative Ridge Machine: A Classifier for High-Dimensional Data or Imbalanced Data. , 2020, IEEE transactions on neural networks and learning systems.

[15]  Haibo Wang,et al.  Adaptive Structure Concept Factorization for Multiview Clustering , 2018, Neural Computation.

[16]  Chunxia Zhang,et al.  Graph-based discriminative concept factorization for data representation , 2017, Knowl. Based Syst..

[17]  Yuan Sun,et al.  Joint correntropy metric weighting and block diagonal regularizer for robust multiple kernel subspace clustering , 2019, Inf. Sci..

[18]  Lei Shi,et al.  Robust Multiple Kernel K-means Using L21-Norm , 2015, IJCAI.

[19]  Yung-Yu Chuang,et al.  Affinity aggregation for spectral clustering , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Liang Du,et al.  Heterogeneous Metric Learning with Content-Based Regularization for Software Artifact Retrieval , 2014, 2014 IEEE International Conference on Data Mining.

[21]  Lei Shi,et al.  Experimental Design with Multiple Kernels , 2015, 2015 IEEE International Conference on Data Mining.

[22]  Zheng Yang,et al.  Locality-Constrained Concept Factorization , 2011, IJCAI.

[23]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[24]  Jun Deng,et al.  Heterogeneous Metric Learning for Cross-Modal Multimedia Retrieval , 2013, WISE.

[25]  Xuelong Li,et al.  Local Coordinate Concept Factorization for Image Representation , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Hal Daumé,et al.  Co-regularized Multi-view Spectral Clustering , 2011, NIPS.

[27]  Xinwang Liu,et al.  K-Means Clustering With Incomplete Data , 2019, IEEE Access.

[28]  Jing-Yu Yang,et al.  Multiple kernel clustering based on centered kernel alignment , 2014, Pattern Recognit..

[29]  Xiaofei He,et al.  Discriminative concept factorization for data representation , 2011, Neurocomputing.

[30]  Jieping Ye,et al.  Discriminative K-means for Clustering , 2007, NIPS.

[31]  Lei Du,et al.  Robust Multi-View Spectral Clustering via Low-Rank and Sparse Decomposition , 2014, AAAI.

[32]  Zenglin Xu,et al.  Unified Spectral Clustering with Optimal Graph , 2017, AAAI.

[33]  Xinwang Liu,et al.  Multiple Kernel Clustering With Global and Local Structure Alignment , 2018, IEEE Access.

[34]  Xuan Li,et al.  Graph-Based Marginal Ranking for Update Summarization , 2011, SDM.

[35]  Zhao Kang,et al.  Twin Learning for Similarity and Clustering: A Unified Kernel Approach , 2017, AAAI.

[36]  Yung-Yu Chuang,et al.  Multiple Kernel Fuzzy Clustering , 2012, IEEE Transactions on Fuzzy Systems.

[37]  Yong Dou,et al.  Multiple kernel clustering with corrupted kernels , 2017, Neurocomputing.

[38]  Mehmet Gönen,et al.  Localized Data Fusion for Kernel k-Means Clustering with Application to Cancer Biology , 2014, NIPS.

[39]  Curtis B. Storlie,et al.  Multiple Kernel Learning Clustering with an Application to Malware , 2012, 2012 IEEE 12th International Conference on Data Mining.

[40]  Zhen Liu,et al.  Multiple Graph Regularized Concept Factorization With Adaptive Weights , 2018, IEEE Access.

[41]  Yiu-ming Cheung,et al.  Feature Selection and Kernel Learning for Local Learning-Based Clustering , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Xuelong Li,et al.  Self-Representative Manifold Concept Factorization with Adaptive Neighbors for Clustering , 2018, IJCAI.

[43]  Lei Wang,et al.  Multiple Kernel Clustering with Local Kernel Alignment Maximization , 2016, IJCAI.

[44]  Chris H. Q. Ding,et al.  Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Peng Zhou,et al.  Unsupervised Robust Multiple Kernel Learning via Extracting Local and Global Noises , 2019, IEEE Access.

[46]  Peng Zhou,et al.  Local Graph Reconstruction for Parameter Free Unsupervised Feature Selection , 2019, IEEE Access.

[47]  Lei Wang,et al.  Multiple Kernel k-Means with Incomplete Kernels , 2017, AAAI.

[48]  Yung-Yu Chuang,et al.  Multi-affinity spectral clustering , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[49]  Wen Gao,et al.  Localized Incomplete Multiple Kernel k-means , 2018, IJCAI.

[50]  Zenglin Xu,et al.  Low-rank kernel learning for graph-based clustering , 2019, Knowl. Based Syst..

[51]  Fariza Tahi,et al.  Localized Multiple Sources Self-Organizing Map , 2018, ICONIP.

[52]  Xuan Li,et al.  Robust Nonnegative Matrix Factorization via Half-Quadratic Minimization , 2012, 2012 IEEE 12th International Conference on Data Mining.

[53]  Qiang Cheng,et al.  Robust principal component analysis: A factorization-based approach with linear complexity , 2020, Inf. Sci..

[54]  Bin Zhao,et al.  Multiple Kernel Clustering , 2009, SDM.

[55]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[56]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[57]  Fan Ye,et al.  Incremental multi-view spectral clustering , 2019, Knowl. Based Syst..

[58]  Lei Wang,et al.  Multiple Kernel k-Means Clustering with Matrix-Induced Regularization , 2016, AAAI.

[59]  Chun Chen,et al.  Clustering analysis using manifold kernel concept factorization , 2012, Neurocomputing.

[60]  Yong Dou,et al.  Optimal Neighborhood Kernel Clustering with Multiple Kernels , 2017, AAAI.

[61]  Dale Schuurmans,et al.  Maximum Margin Clustering , 2004, NIPS.

[62]  Yi Yang,et al.  Image Clustering Using Local Discriminant Models and Global Integration , 2010, IEEE Transactions on Image Processing.

[63]  Jiawei Han,et al.  Locally Consistent Concept Factorization for Document Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.