An efficient dictionary-based multi-view learning method

Abstract Multi-view learning can be considered as a kind of classification method which explores common and unique information among different views. For dictionary learning, it can identify informative features by learning sparse representation of samples and has great advantages for classification. However, there are few researches on the problem of multi-view learning with dictionary learning. In order to improve the performance of multi-view classification, we propose a new multi-view dictionary learning with consensus of view(MVDL-CV). First of all, we learn a particular dictionary for each view and obtain the sparse representation of the sample. Then, by utilizing the regularization term between two dictionaries in consensus, we can determine the similarity of samples and obtain the discriminative sparse representation, which can be helpful to construct the improved classifiers. Further, we obtain the solution of the model through an alternating convex optimization method and present the convergence analysis of MVDL-CV. In the experiments, we compare the proposed method with previous multi-view learning methods, and the experimental results show that MVDL-CV is a feasible and competitive method.

[1]  Pavlo Molchanov,et al.  Weakly-Supervised 3D Human Pose Learning via Multi-View Images in the Wild , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[3]  Zi Huang,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence ℓ2,1-Norm Regularized Discriminative Feature Selection for Unsupervised Learning , 2022 .

[4]  Qingming Huang,et al.  Bilevel Multiview Latent Space Learning , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[6]  Yongdong Zhang,et al.  Multiview Spectral Embedding , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Meng Wang,et al.  Multimodal Deep Autoencoder for Human Pose Recovery , 2015, IEEE Transactions on Image Processing.

[8]  Jiewen Zhao,et al.  Complementary-View Multiple Human Tracking , 2020, AAAI.

[9]  Nebojsa Jojic,et al.  LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[10]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[11]  Dong Yue,et al.  Multi-view Discriminant Dictionary Learning via Learning View-specific and Shared Structured Dictionaries for Image Classification , 2017, Neural Processing Letters.

[12]  Feiping Nie,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Multi-View K-Means Clustering on Big Data , 2022 .

[13]  Jun Yu,et al.  Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[14]  Qinghua Hu,et al.  Latent Multi-view Subspace Clustering , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Dong Yue,et al.  Multi-view low-rank dictionary learning for image classification , 2016, Pattern Recognit..

[16]  Jun Yu,et al.  Exploiting Click Constraints and Multi-view Features for Image Re-ranking , 2014, IEEE Transactions on Multimedia.

[17]  Shiliang Sun,et al.  Multi-view learning overview: Recent progress and new challenges , 2017, Inf. Fusion.

[18]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[19]  David Zhang,et al.  Sparse Representation Based Fisher Discrimination Dictionary Learning for Image Classification , 2014, International Journal of Computer Vision.

[20]  Kathrin Klamroth,et al.  Biconvex sets and optimization with biconvex functions: a survey and extensions , 2007, Math. Methods Oper. Res..

[21]  Chenping Hou,et al.  Multiview Classification With Cohesion and Diversity , 2020, IEEE Transactions on Cybernetics.

[22]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Dacheng Tao,et al.  Large-margin multi-view Gaussian process for image classification , 2013, ICIMCS '13.

[24]  Zhi-Hua Zhou,et al.  A New Analysis of Co-Training , 2010, ICML.

[25]  R. Bharat Rao,et al.  Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[26]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[27]  Yuhui Zheng,et al.  Optimal discriminative feature and dictionary learning for image set classification , 2021, Inf. Sci..

[28]  David Zhang,et al.  Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[29]  Bo Liu,et al.  Transfer learning-based one-class dictionary learning for recommendation data stream , 2021, Inf. Sci..

[30]  Zhendong Mao,et al.  Hierarchical multi-view context modelling for 3D object classification and retrieval , 2021, Inf. Sci..

[31]  Tao Zhou,et al.  Multiview Latent Space Learning With Feature Redundancy Minimization , 2020, IEEE Transactions on Cybernetics.

[32]  Jiawei Han,et al.  Orthogonal Laplacianfaces for Face Recognition , 2006, IEEE Transactions on Image Processing.

[33]  Shuicheng Yan,et al.  Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Jian Yang,et al.  A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Lei Zhang,et al.  Projective dictionary pair learning for pattern classification , 2014, NIPS.

[36]  Qingming Huang,et al.  Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Zenglin Xu,et al.  Simple and Efficient Multiple Kernel Learning by Group Lasso , 2010, ICML.