Fast Clustering With Co-Clustering Via Discrete Non-Negative Matrix Factorization for Image Identification

How to effectively cluster large-scale image data sets is a challenge and is receiving more and more attention. To address this problem, a novel clustering method called fast clustering with co-clustering via discrete non-negative matrix factorization, is proposed. Inspired by co-clustering, our algorithm reduces computational complexity by transforming clustering tasks into co-clustering tasks. Although our model has the same form of objective function as normalized cut, we relax it to a matrix decomposition problem, which is different from most graph-based approaches. In addition, an efficient optimization algorithm is proposed to solve the relaxed problem, where a discrete solution corresponding to the clustering result can be directly obtained. Extensive experiments have been conducted on several synthetic data sets and real word data sets. Compared with the state-of-the-art clustering methods, the proposed algorithm achieves very promising performance.

[1]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[3]  Qi Wang,et al.  Fast Spectral Clustering for Unsupervised Hyperspectral Image Classification , 2019, Remote. Sens..

[4]  Xinlei Chen,et al.  Large Scale Spectral Clustering Via Landmark-Based Sparse Representation , 2015, IEEE Transactions on Cybernetics.

[5]  Michael Elad,et al.  Optimally sparse representation in general (nonorthogonal) dictionaries via ℓ1 minimization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Feiping Nie,et al.  Scalable Normalized Cut with Improved Spectral Rotation , 2017, IJCAI.

[7]  Jianchao Fan,et al.  A Two-Phase Fuzzy Clustering Algorithm Based on Neurodynamic Optimization With Its Application for PolSAR Image Segmentation , 2018, IEEE Transactions on Fuzzy Systems.

[8]  Charles A. Sutton,et al.  Summarizing Software API Usage Examples Using Clustering Techniques , 2018, FASE.

[9]  Yilin Zhang,et al.  Discovering Political Topics in Facebook Discussion threads with Spectral Contextualization , 2017, The Annals of Applied Statistics.

[10]  Jitendra Malik,et al.  Spectral grouping using the Nystrom method , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[12]  Andreas Loukas,et al.  Approximating Spectral Clustering via Sampling: a Review , 2019, Sampling Techniques for Supervised or Unsupervised Tasks.

[13]  James T. Kwok,et al.  Making Large-Scale Nyström Approximation Possible , 2010, ICML.

[14]  Ioannis P. Panapakidis,et al.  Optimal Selection of Clustering Algorithm via Multi-Criteria Decision Analysis (MCDA) for Load Profiling Applications , 2018, Applied Sciences.

[15]  Pierre Vandergheynst,et al.  Compressive Spectral Clustering , 2016, ICML.

[16]  Xuelong Li,et al.  Unsupervised Large Graph Embedding , 2017, AAAI.

[17]  Rong Wang,et al.  Fast Spectral Clustering With Anchor Graph for Large Hyperspectral Images , 2017, IEEE Geoscience and Remote Sensing Letters.

[18]  Feiping Nie,et al.  Spectral Clustering of Large-scale Data by Directly Solving Normalized Cut , 2018, KDD.

[19]  Atsushi Imiya,et al.  Fast Spectral Clustering with Random Projection and Sampling , 2009, MLDM.

[20]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[21]  Guangliang Chen,et al.  Scalable spectral clustering with cosine similarity , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[22]  Michael W. Berry,et al.  Document clustering using nonnegative matrix factorization , 2006, Inf. Process. Manag..

[23]  Song Wang,et al.  Attention and amplification in the hybrid media system: The composition and activity of Donald Trump’s Twitter following during the 2016 presidential election , 2018, New Media Soc..