论文信息 - Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition

Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition

This paper explores multi-task learning (MTL) for face recognition. First, we propose a multi-task convolutional neural network (CNN) for face recognition, where identity classification is the main task and pose, illumination, and expression (PIE) estimations are the side tasks. Second, we develop a dynamic-weighting scheme to automatically assign the loss weights to each side task, which solves the crucial problem of balancing between different tasks in MTL. Third, we propose a pose-directed multi-task CNN by grouping different poses to learn pose-specific identity features, simultaneously across all poses in a joint framework. Last but not least, we propose an energy-based weight analysis method to explore how CNN-based MTL works. We observe that the side tasks serve as regularizations to disentangle the PIE variations from the learnt identity features. Extensive experiments on the entire multi-PIE dataset demonstrate the effectiveness of the proposed approach. To the best of our knowledge, this is the first work using all data in multi-PIE for face recognition. Our approach is also applicable to in-the-wild data sets for pose-invariant face recognition and achieves comparable or better performance than state of the art on LFW, CFP, and IJB-A datasets.

Xiaoming Liu | Xi Yin | Xiaoming Liu | Xi Yin

[1] Xiaogang Wang,et al. Deep Learning Identity-Preserving Face Space , 2013, 2013 IEEE International Conference on Computer Vision.

[2] Hongliang Fei,et al. Structured Feature Selection and Task Relationship Inference for Multi-task Learning , 2011, ICDM.

[3] Anil K. Jain,et al. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Takeo Kanade,et al. Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[5] Dimitris N. Metaxas,et al. Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[6] Thomas S. Huang,et al. Pose-robust face recognition via sparse representation , 2013, Pattern Recognit..

[7] Sébastien Marcel,et al. Continuously Reproducing Toolchains in Pattern Recognition and Machine Learning Experiments , 2017, ICML 2017.

[8] Jiayu Zhou,et al. Efficient multi-task feature learning with calibration , 2014, KDD.

[9] Du-Sik Park,et al. Rotating your face using multi-task deep neural network , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Xiaoou Tang,et al. Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[11] Michael J. Jones,et al. Fully automatic pose-invariant face recognition via 3D pose normalization , 2011, 2011 International Conference on Computer Vision.

[12] Dacheng Tao,et al. Multi-Task Pose-Invariant Face Recognition , 2015, IEEE Transactions on Image Processing.

[13] Shuicheng Yan,et al. Conditional Convolutional Neural Network for Modality-Aware Face Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14] Rama Chellappa,et al. Fisher vector encoded deep convolutional features for unconstrained face verification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[15] Lei Zhang,et al. Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person , 2013, 2013 IEEE International Conference on Computer Vision.

[16] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Jiayu Zhou,et al. Multi-Task Feature Interaction Learning , 2016, KDD.

[18] Massimiliano Pontil,et al. Convex multi-task feature learning , 2008, Machine Learning.

[19] Michael I. Jordan,et al. Multi-task feature selection , 2006 .

[20] Xiaogang Wang,et al. DeepID3: Face Recognition with Very Deep Neural Networks , 2015, ArXiv.

[21] Jeff A. Bilmes,et al. Deep Canonical Correlation Analysis , 2013, ICML.

[22] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[23] Yu Qiao,et al. A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[24] Tal Hassner,et al. Effective face frontalization in unconstrained images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Yi Li,et al. Bagging Based Efficient Kernel Fisher Discriminant Analysis for Face Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[26] Wen Gao,et al. Lighting Aware Preprocessing for Face Recognition across Varying Illumination , 2010, ECCV.

[27] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[28] Liming Chen,et al. 3D-Aided Face Recognition Robust to Expression and Pose Variations , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29] Gang Wang,et al. Multi-Task CNN Model for Attribute Prediction , 2015, IEEE Transactions on Multimedia.

[30] Martial Hebert,et al. Cross-Stitch Networks for Multi-task Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Kihyuk Sohn,et al. Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[32] S. Shan,et al. Maximizing intra-individual correlations for face recognition across pose differences , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[33] Gérard G. Medioni,et al. Pose-Aware Face Recognition in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Xiaoming Liu,et al. Disentangled Representation Learning GAN for Pose-Invariant Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Anil K. Jain,et al. Face Search at Scale , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Xiaogang Wang,et al. Multi-View Perceptron: a Deep Model for Learning Face Identity and View Representations , 2014, NIPS.

[37] Zhengyou Zhang,et al. Improving multiview face detection with multi-task deep convolutional neural networks , 2014, IEEE Winter Conference on Applications of Computer Vision.

[38] Shengcai Liao,et al. Learning Face Representation from Scratch , 2014, ArXiv.

[39] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40] Amnon Shashua,et al. Learning a Metric Embedding for Face Recognition using the Multibatch Method , 2016, NIPS.

[41] Dacheng Tao,et al. A Comprehensive Survey on Pose-Invariant Face Recognition , 2015, ACM Trans. Intell. Syst. Technol..

[42] Carlos D. Castillo,et al. Triplet probabilistic embedding for face verification and clustering , 2016, 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[43] Xiaogang Wang,et al. Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[44] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[45] Jiayu Zhou,et al. Interactive Multi-task Relationship Learning , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[46] Xiaoming Liu,et al. Coefficients Pose-Variant Input Recogni 8 on Engine Frontalized Output Generator FF-GAN D Discriminator Extreme Pose Input Frontalized Output , 2017 .

[47] Bhiksha Raj,et al. SphereFace: Deep Hypersphere Embedding for Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Carlos D. Castillo,et al. Frontal to profile face verification in the wild , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[49] Xiaoou Tang,et al. Learning Deep Representation for Face Alignment with Auxiliary Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] Xiaogang Wang,et al. Pedestrian detection aided by deep learning semantic tasks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[52] Lior Wolf,et al. The Multiverse Loss for Robust Transfer Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53] Xiaoming Liu,et al. Pose-Invariant Face Alignment via CNN-Based Dense 3D Model Fitting , 2017, International Journal of Computer Vision.

[54] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[55] Jian Sun,et al. Joint Cascade Face Detection and Alignment , 2014, ECCV.

[56] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[57] Rama Chellappa,et al. Unconstrained face verification using deep CNN features , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[58] Xin Liu,et al. Morphable Displacement Field Based Image Matching for Face Recognition across Pose , 2012, ECCV.

[59] Xiaogang Wang,et al. Deeply learned face representations are sparse, selective, and robust , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Jieping Ye,et al. An accelerated gradient method for trace norm minimization , 2009, ICML '09.

[61] Dit-Yan Yeung,et al. A Convex Formulation for Learning Task Relationships in Multi-Task Learning , 2010, UAI.

[62] Andrew Zisserman,et al. Fisher Vector Faces in the Wild , 2013, BMVC.

[63] Xiaoming Liu,et al. Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64] Wen Gao,et al. Separability Oriented Preprocessing for Illumination-Insensitive Face Recognition , 2012, ECCV.

[65] Shiguang Shan,et al. Stacked Progressive Auto-Encoders (SPAE) for Face Recognition Across Poses , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[66] Xiangyu Zhu,et al. High-fidelity Pose and Expression Normalization for face recognition in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[67] Jian Sun,et al. Blessing of Dimensionality: High-Dimensional Feature and Its Efficient Compression for Face Verification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[68] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .