Discriminative deep semi-nonnegative matrix factorization network with similarity maximization for unsupervised feature learning

Abstract Deep Semi-NMF (DSN), which learns hierarchical representations by stacking multiple layers Semi-NMF, shows competitive performance in unsupervised data analysis. However, the features learned from DSN always lack of representativity and discriminativity. In this paper, we build a novel Deep Semi-NMF network (DSNnet) to address the issues of DSN. Specifically, DSNnet contains multiple fully-connected layers, in which the activation function of each layer adopts Smoothly Clipped Absolute Deviation (SCAD). The non-negative hidden features are computed forwardly, while the network parameters are updated by the stochastic gradient descent method. Moreover, to enhance the discriminativity of features, we suggest simultaneously minimizing the reconstruction error of input and output, and maximizing the similarity between input and learned features. The proposed similarity measurement, which consists of global geometric similarity and local pointwise similarity, encourages the compactness between similar points and separateness between dissimilar points in the feature space, and is beneficial to preserve intrinsic information of original data. Extensive experiments conducted on several datasets illustrate the superiority of the proposed approach in comparison with state-of-the-art methods.

[1]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[2]  Yoshua Bengio,et al.  Learning deep representations by mutual information estimation and maximization , 2018, ICLR.

[3]  Naoto Yokoya,et al.  Nonlinear Unmixing of Hyperspectral Data Using Semi-Nonnegative Matrix Factorization , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Christos Boutsidis,et al.  SVD based initialization: A head start for nonnegative matrix factorization , 2008, Pattern Recognit..

[5]  Kaiming He,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Hyunsoo Kim,et al.  Sparse Non-negative Matrix Factorizations via Alternating Non-negativity-constrained Least Squares , 2006 .

[7]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[10]  Bruce A. Draper,et al.  Semi-Nonnegative Matrix Factorization for Motion Segmentation with Missing Data , 2012, ECCV.

[11]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[12]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[13]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[14]  Fuchun Sun,et al.  Nonlinear non-negative matrix factorization using deep learning , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[15]  Bensheng Lyu,et al.  A Deep Orthogonal Non-negative Matrix Factorization Method for Learning Attribute Representations , 2017, ICONIP.

[16]  Chris H. Q. Ding,et al.  Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[19]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[20]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[21]  Tong Zhang,et al.  Deep Subspace Clustering Networks , 2017, NIPS.

[22]  Yang Wang,et al.  Unsupervised local deep feature for image recognition , 2016, Inf. Sci..

[23]  Xianhua Zeng,et al.  Graph regularized deep semi-nonnegative matrix factorization for clustering , 2016, International Conference on Digital Image Processing.

[24]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[25]  Rongrong Ji,et al.  Nonnegative Spectral Clustering with Discriminative Regularization , 2011, AAAI.

[26]  George Trigeorgis,et al.  A Deep Matrix Factorization Method for Learning Attribute Representations , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Andrzej Cichocki,et al.  Learning the Hierarchical Parts of Objects by Deep Non-Smooth Nonnegative Matrix Factorization , 2018, IEEE Access.

[28]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[29]  Derek Greene,et al.  Ensemble non-negative matrix factorization methods for clustering protein-protein interactions , 2008, Bioinform..