Gated Fusion of Discriminant Features for Caricature Recognition

Caricature recognition is a challenging problem, because there are typically geometric deformations between photographs and caricatures. It is nontrivial to learn discriminant large-margin features. To combat this challenge, we propose a novel framework by using a gated fusion of global and local discriminant features. First, we employ A-Softmax loss to jointly learn angularly discriminant features of the whole face and local facial parts. Besides, we use the convolutional block attention module (CBAM) to further boost the discriminant ability of the learnt features. Next, we use global features as dominant representation and local features as supplemental ones; and propose a gated fusion unit to automatically learn the weighting factors for these local parts and moderate local features correspondingly. Finally, an integration of all these features is used for caricature recognition. Extensive experiments are conducted on the cross-modal face recognition task. Results show that, our method significantly boosts previous state-of-the-art Rank-1 and Rank-10 from 36.27% to 55.29% and from 64.37% to 85.78%, respectively, for caricature-to-photograph (C2P) recognition. Besides, our method achieves a Rank-1 of 60.81% and Rank-10 of 89.26% for photograph-to-caricature (P2C) recognition.

[1]  Jing Liao,et al.  CariGANs , 2018, ACM Trans. Graph..

[2]  Bhiksha Raj,et al.  SphereFace: Deep Hypersphere Embedding for Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Tayfun Akgül,et al.  Matching caricatures to photographs , 2015, Signal, Image and Video Processing.

[4]  Yinghuan Shi,et al.  WebCaricature: a benchmark for caricature recognition , 2017, BMVC.

[5]  Xinbo Gao,et al.  A Deep Collaborative Framework for Face Photo–Sketch Synthesis , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Yinghuan Shi,et al.  A Joint Local and Global Deep Metric Learning Method for Caricature Recognition , 2018, ACCV.

[7]  Anil K. Jain,et al.  Towards automated caricature recognition , 2012, 2012 5th IAPR International Conference on Biometrics (ICB).

[8]  Anil K. Jain,et al.  WarpGAN: Automatic Caricature Generation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Narayanan C. Krishnan,et al.  Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet) , 2018, ACM Multimedia.

[10]  Yinghuan Shi,et al.  Variation Robust Cross-Modal Metric Learning for Caricature Recognition , 2017, ACM Multimedia.

[11]  Xinbo Gao,et al.  Graphical Representation for Heterogeneous Face Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.