Neural Manifold Clustering and Embedding

Given a union of non-linear manifolds, non-linear subspace clustering or manifold clustering aims to cluster data points based on manifold structures and also learn to parameterize each manifold as a linear subspace in a feature space. Deep neural networks have the potential to achieve this goal under highly non-linear settings given their large capacity and flexibility. We argue that achieving manifold clustering with neural networks requires two essential ingredients: a domain-specific constraint that ensures the identification of the manifolds, and a learning algorithm for embedding each manifold to a linear subspace in the feature space. This work shows that many constraints can be implemented by data augmentation. For subspace feature learning, Maximum Coding Rate Reduction (MCR) objective can be used. Putting them together yields Neural Manifold Clustering and Embedding (NMCE), a novel method for general purpose manifold clustering, which significantly outperforms autoencoder-based deep subspace clustering. Further, on more challenging natural image datasets, NMCE can also outperform other algorithms specifically designed for clustering. Qualitatively, we demonstrate that NMCE learns a meaningful and interpretable feature space. As the formulation of NMCE is closely related to several important Self-supervised learning (SSL) methods, we believe this work can help us build a deeper understanding on SSL representation learning.

[1]  Seunghoon Hong,et al.  Improving Unsupervised Image Clustering With Robust Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Xinlei Chen,et al.  Exploring Simple Siamese Representation Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Nicu Sebe,et al.  Solo-learn: A Library of Self-supervised Methods for Visual Representation Learning , 2021, ArXiv.

[4]  Daniel P. Robinson,et al.  Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Dimitris N. Metaxas,et al.  Deep Subspace Clustering with Data Augmentation , 2020, NeurIPS.

[6]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[9]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[10]  Jiashi Feng,et al.  Deep Adversarial Subspace Clustering , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11]  Chong You,et al.  A Critique of Self-Expressive Deep Subspace Clustering , 2020, ArXiv.

[12]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[13]  Ben Poole,et al.  Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[14]  Gérard G. Medioni,et al.  Robust Multiple Manifolds Structure Learning , 2012, ICML 2012.

[15]  Matthias Bethge,et al.  Contrastive Learning Inverts the Data Generating Process , 2021, ICML.

[16]  René Vidal,et al.  Sparse Manifold Clustering and Embedding , 2011, NIPS.

[17]  René Vidal,et al.  Learning a Self-Expressive Network for Subspace Clustering , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Julien Mairal,et al.  Unsupervised Learning of Visual Features by Contrasting Cluster Assignments , 2020, NeurIPS.

[19]  Kaiming He,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Aiden Durrant,et al.  Hyperspherically Regularized Networks for BYOL Improves Feature Uniformity and Separability , 2021, ArXiv.

[21]  Robert Pless,et al.  Manifold clustering , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[22]  Chen Min,et al.  Self-Supervised Deep Multi-View Subspace Clustering , 2019, ACML.

[23]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[24]  Ürün Dogan,et al.  Representation Learning for Clustering via Building Consensus , 2021, ArXiv.

[25]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[26]  S. Shankar Sastry,et al.  Generalized Principal Component Analysis , 2016, Interdisciplinary applied mathematics.

[27]  Yang You,et al.  Large Batch Training of Convolutional Networks , 2017, 1708.03888.

[28]  Jun Zhu,et al.  MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering , 2021, ICLR.

[29]  René Vidal,et al.  Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[30]  Daniel P. Robinson,et al.  Scalable Exemplar-based Subspace Clustering on Class-Imbalanced Data , 2018, European Conference on Computer Vision.

[31]  Dezhong Peng,et al.  Contrastive Clustering , 2021, AAAI.

[32]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[33]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[34]  Tong Zhang,et al.  Deep Subspace Clustering Networks , 2017, NIPS.

[35]  Luigi Gresele,et al.  Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style , 2021, NeurIPS.

[36]  Michal Valko,et al.  Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[37]  Colin Wei,et al.  Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss , 2021, ArXiv.

[38]  Zhouchen Lin,et al.  Self-Supervised Convolutional Subspace Clustering Network , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Svetha Venkatesh,et al.  Clustering by Maximizing Mutual Information Across Views , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[40]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[41]  Kentaro Takagi,et al.  Clustering-friendly Representation Learning via Instance Discrimination and Feature Decorrelation , 2021, ICLR.

[42]  Yann LeCun,et al.  Barlow Twins: Self-Supervised Learning via Redundancy Reduction , 2021, ICML.

[43]  Nicolas Gillis,et al.  Beyond Linear Subspace Clustering: A Comparative Study of Nonlinear Manifold Clustering Algorithms , 2021, Comput. Sci. Rev..

[44]  John Wright,et al.  Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Chong Peng,et al.  LogDet Rank Minimization with Application to Subspace Clustering , 2015, Comput. Intell. Neurosci..

[46]  René Vidal,et al.  Kernel sparse subspace clustering , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[47]  Chong You,et al.  Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction , 2020, NeurIPS.

[48]  Phillip Isola,et al.  Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere , 2020, ICML.

[49]  Olga Taran,et al.  ScatSimCLR: self-supervised contrastive learning with pretext task regularization for small-scale datasets , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[50]  Bruno A. Olshausen,et al.  The Sparse Manifold Transform , 2018, NeurIPS.

[51]  Ling Shao,et al.  You Never Cluster Alone , 2021, NeurIPS.

[52]  Jean Ponce,et al.  VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning , 2021, ArXiv.

[53]  Hongdong Li,et al.  Efficient dense subspace clustering , 2014, IEEE Winter Conference on Applications of Computer Vision.

[54]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .