论文信息 - Pose-robust Face Recognition by Deep Meta Capsule network-based Equivariant Embedding

Pose-robust Face Recognition by Deep Meta Capsule network-based Equivariant Embedding

Despite the exceptional success in face recognition related technologies, handling large pose variations still remains a key challenge. Current techniques for pose-robust face recognition either, directly extract pose-invariant features, or first synthesize a face that matches the target pose before feature extraction. It is more desirable to learn face representations equivariant to pose variations. To this end, this paper proposes a deep meta Capsule network-based Equivariant Embedding Model (DM-CEEM) with three distinct novelties. First, the proposed RB-CapsNet allows DM-CEEM to learn an equivariant embedding for pose variations and achieve the desired transformation for input face images. Second, we introduce a new version of a Capsule network called RB-CapsNet to extend CapsNet to perform a profile-to-frontal face transformation in deep feature space. Third, we train the DM-CEEM in a meta way by treating a single overall classification target as multiple sub-tasks that satisfy certain unknown probabilities. In each sub-task, we sample the support and query sets randomly. The experimental results on both controlled and in-the-wild databases demonstrate the superiority of DM-CEEM over state-of-the-art.

[1] Carlos D. Castillo,et al. Triplet probabilistic embedding for face verification and clustering , 2016, 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[2] Federico Tombari,et al. 3D Point Capsule Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Zhenan Sun,et al. Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization , 2018, NeurIPS.

[4] Cordelia Schmid,et al. A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Honglak Lee,et al. Learning Invariant Representations with Local Transformations , 2012, ICML.

[6] Hien Van Nguyen,et al. Fast CapsNet for Lung Cancer Screening , 2018, MICCAI.

[7] Yu Liu,et al. Quality Aware Network for Set to Set Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Misha Denil,et al. Learning to Learn without Gradient Descent by Gradient Descent , 2016, ICML.

[9] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Aurko Roy,et al. Learning to Remember Rare Events , 2017, ICLR.

[11] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Anil K. Jain,et al. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Xilin Chen,et al. Deformable face net for pose invariant face recognition , 2020, Pattern Recognit..

[14] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[15] Rama Chellappa,et al. Fisher vector encoded deep convolutional features for unconstrained face verification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[16] Bong-Nam Kang,et al. Attentional Feature-Pair Relation Networks for Accurate Face Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17] Cheng Li,et al. Pose-Robust Face Recognition via Deep Residual Equivariant Mapping , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18] Pascal Libuschewski,et al. Group Equivariant Capsule Networks , 2018, NeurIPS.

[19] Geoffrey E. Hinton,et al. Matrix capsules with EM routing , 2018, ICLR.

[20] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[21] Xiaoming Liu,et al. Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition , 2017, IEEE Transactions on Image Processing.

[22] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[23] Xiaogang Wang,et al. Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[24] Max Welling,et al. Group Equivariant Convolutional Networks , 2016, ICML.

[25] Joan Bruna,et al. Few-Shot Learning with Graph Neural Networks , 2017, ICLR.

[26] Yuxiao Hu,et al. MS-Celeb-1M: Challenge of Recognizing One Million Celebrities in the Real World , 2016, IMAWM.

[27] Stephan J. Garbin,et al. Harmonic Networks: Deep Translation and Rotation Equivariance , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Stefanos Zafeiriou,et al. Robust Statistical Face Frontalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[29] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[30] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.

[31] Elena Marchiori,et al. Location Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities , 2016, Scientific Reports.

[32] Stefan Roth,et al. Learning rotation-aware features: From invariant priors to equivariant descriptors , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33] Carlos D. Castillo,et al. Frontal to profile face verification in the wild , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[34] Yee Whye Teh,et al. Stacked Capsule Autoencoders , 2019, NeurIPS.

[35] Dacheng Tao,et al. Robust Face Recognition via Multimodal Deep Face Representation , 2015, IEEE Transactions on Multimedia.

[36] Bo Huang,et al. Toward End-to-End Face Recognition Through Alignment Learning , 2017, IEEE Signal Processing Letters.

[37] Angelo Cangelosi,et al. Head pose estimation in the wild using Convolutional Neural Networks and adaptive gradient methods , 2017, Pattern Recognit..

[38] Gérard G. Medioni,et al. Pose-Aware Face Recognition in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Fang Zhao,et al. Towards Pose Invariant Face Recognition in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40] Xiaoming Liu,et al. Disentangled Representation Learning GAN for Pose-Invariant Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Lijun Zhao,et al. Remote Sensing Image Scene Classification Using CNN-CapsNet , 2019, Remote. Sens..

[42] Geoffrey E. Hinton,et al. Dynamic Routing Between Capsules , 2017, NIPS.

[43] Max Welling,et al. Spherical CNNs , 2018, ICLR.

[44] Stéphane Mallat,et al. Deep roto-translation scattering for object classification , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Geoffrey E. Hinton,et al. Transforming Autoencoders , 2011 .

[46] Premkumar Natarajan,et al. CapsuleGAN: Generative Adversarial Capsule Network , 2018, ECCV Workshops.

[47] Ranga Rodrigo,et al. DeepCaps: Going Deeper With Capsule Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[49] Hong Yu,et al. Meta Networks , 2017, ICML.