Low-Rank Regularized Deep Collaborative Matrix Factorization for Micro-Video Multi-Label Classification

Deep matrix factorization can be regarded as an extension of traditional matrix factorization to help improve applications like social image tag refinement, image retrieval, and face clustering. Toward this tendency, in this letter, we proposed a low-rank regularized deep collaborative matrix factorization (LRDCMF) method to better tackle micro-video multi-label classification tasks. The proposed method aims to collaboratively learn two sets of factor matrices for characterization of latent attributes and two deep representations for instances and labels, respectively. During factorization process, the inverse covariance constraints are exploited to capture the latent correlation structures among latent attributes and labels and the low-dimensional intrinsic deep representations are ensured by further considering low-rank constraints. Moreover, a triplet term that connects instances representations, label representations, and labels is constructed to increase discrimination power of our method. Experimental results conducted on a large-scale micro-video dataset illustrate our model achieves superior performance in comparison with state-of-the-art methods.

[1]  Shuicheng Yan,et al.  Latent Low-Rank Representation for subspace segmentation and feature extraction , 2011, 2011 International Conference on Computer Vision.

[2]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[3]  George Trigeorgis,et al.  A Deep Matrix Factorization Method for Learning Attribute Representations , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Yiu-ming Cheung,et al.  Toward Efficient Image Representation: Sparse Concept Discriminant Matrix Factorization , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Jing Zhang,et al.  Low-Rank Regularized Heterogeneous Tensor Decomposition for Subspace Clustering , 2018, IEEE Signal Processing Letters.

[7]  Chang Liu,et al.  Low-rank and sparse matrix decomposition via the truncated nuclear norm and a sparse regularizer , 2018, The Visual Computer.

[8]  Shih-Fu Chang,et al.  Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Yong Yu,et al.  Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[10]  Meng Liu,et al.  Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning , 2019, IEEE Transactions on Image Processing.

[11]  Yun Fu,et al.  Learning Robust and Discriminative Subspace With Low-Rank Constraints , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Xiangnan He,et al.  MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video , 2019, ACM Multimedia.

[13]  Jicong Fan,et al.  Matrix completion by deep matrix factorization , 2018, Neural Networks.

[14]  Ivica Kopriva,et al.  A Nonlinear Orthogonal Non-Negative Matrix Factorization Approach to Subspace Clustering , 2017, Pattern Recognit..

[15]  Junping Du,et al.  Deep Constrained Low-Rank Subspace Learning for Multi-View Semi-Supervised Classification , 2019, IEEE Signal Processing Letters.

[16]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[17]  Shujian Huang,et al.  Deep Matrix Factorization Models for Recommender Systems , 2017, IJCAI.

[18]  Václav Šmídl,et al.  Bayesian Non-Negative Matrix Factorization With Adaptive Sparsity and Smoothness Prior , 2019, IEEE Signal Processing Letters.

[19]  Qi Tian,et al.  Enhancing Micro-video Understanding by Harnessing External Sounds , 2017, ACM Multimedia.

[20]  Andrzej Cichocki,et al.  Two Efficient Algorithms for Approximately Orthogonal Nonnegative Matrix Factorization , 2015, IEEE Signal Processing Letters.

[21]  Yun Fu,et al.  Multi-View Clustering via Deep Matrix Factorization , 2017, AAAI.

[22]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[23]  Dit-Yan Yeung,et al.  A Convex Formulation for Learning Task Relationships in Multi-Task Learning , 2010, UAI.

[24]  Jinhui Tang,et al.  Weakly Supervised Deep Matrix Factorization for Social Image Understanding , 2017, IEEE Transactions on Image Processing.