Cross-view classification by joint adversarial learning and class-specificity distribution

Abstract Despite the promising preliminary results, none of existing deep learning based cross-view classification methods simultaneously takes into account both view consistency learning and class-specificity distribution of the extracted features, resulting in unstable classification performance. Moreover, most existing cross-view classification methods are sensitive to scale due to the scale issue of view representations, resulting in unstable view-consistent representations. In this paper, we propose a new deep adversarial network for cross-view classification that attempts to learn robust view-consistent representations by combing the thought of adversarial learning and metric learning in Fisher criterion. Meanwhile, a class-specificity distribution term, which is measured by l12-norm, is employed to make the view-consistent representations with the same label to further have a common distribution in dimension space while view-representations with different labels have different distribution in the intrinsic dimension space. We formulate the aforementioned two concerns into a unified optimization framework. Extensive experiments on several real-world datasets indicate the effectiveness of our method over the other state-of-the-arts.

[1]  Feiping Nie,et al.  A Probabilistic Derivation of LASSO and L12-Norm Feature Selections , 2019, AAAI.

[2]  Wei Xia,et al.  Multi-View Classification via Adaptive Discriminant Analysis , 2019, IEEE Access.

[3]  Feiping Nie,et al.  Exclusive Feature Learning on Arbitrary Structures via \ell_{1, 2}-norm , 2014, NIPS.

[4]  Chris H. Q. Ding,et al.  Joint stage recognition and anatomical annotation of drosophila gene expression patterns , 2012, Bioinform..

[5]  Robert P. W. Duin,et al.  Handwritten digit recognition by combined classifiers , 1998, Kybernetika.

[6]  Moncef Gabbouj,et al.  DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Chris Ding,et al.  Robust Flexible Feature Selection via Exclusive L21 Regularization , 2019, IJCAI.

[8]  Wei Yuan,et al.  Multi-view manifold learning with locality alignment , 2018, Pattern Recognit..

[9]  Jiwen Lu,et al.  Deep Coupled Metric Learning for Cross-Modal Matching , 2017, IEEE Transactions on Multimedia.

[10]  Ahmed M. Elgammal,et al.  A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation , 2016, ICML.

[11]  Wei Yuan,et al.  Multi-view Common Component Discriminant Analysis for Cross-view Classification , 2018, Pattern Recognit..

[12]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[13]  Nebojsa Jojic,et al.  LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Feiping Nie,et al.  Adaptive-weighting discriminative regression for multi-view classification , 2019, Pattern Recognit..

[15]  Xinbo Gao,et al.  Triplet-Based Deep Hashing Network for Cross-Modal Retrieval , 2018, IEEE Transactions on Image Processing.

[16]  Ming-Yu Liu,et al.  Coupled Generative Adversarial Networks , 2016, NIPS.

[17]  Dewei Li,et al.  Multi-view deep metric learning for image classification , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[18]  Ling Chen,et al.  Multi-layer multi-view topic model for classifying advertising video , 2017, Pattern Recognit..

[19]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shiliang Sun,et al.  Multi-view learning overview: Recent progress and new challenges , 2017, Inf. Fusion.

[21]  Maneesh Kumar Singh,et al.  DRIT++: Diverse Image-to-Image Translation via Disentangled Representations , 2019, International Journal of Computer Vision.

[22]  Xuelong Li,et al.  Multi-View Clustering and Semi-Supervised Classification with Adaptive Neighbours , 2017, AAAI.

[23]  Jieping Ye,et al.  Canonical Correlation Analysis for Multilabel Classification: A Least-Squares Formulation, Extensions, and Analysis , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Yifeng He,et al.  Multiview learning via deep discriminative canonical correlation analysis , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[26]  Liu Liu,et al.  Discriminative Cross-View Binary Representation Learning , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27]  Feiping Nie,et al.  A Closed Form Solution to Multi-View Low-Rank Regression , 2015, AAAI.

[28]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[29]  Shiguang Shan,et al.  Multi-view Deep Network for Cross-View Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Dumitru Erhan,et al.  Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Tao Xiang,et al.  Scalable and Effective Deep CCA via Soft Decorrelation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.