A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition

Caricature is a simple and abstract description of a person using her/his exaggerated characteristics. Due to amplified facial variations in the caricatures and significant differences among caricature and real face modalities, building vision models for recognizing each other between these modalities is an extremely challenging task. In addition, it is not easy to collect abundant samples of real faces and corresponding caricatures for training vision models, which makes the recognition more difficult. In this paper, we propose a novel relation network via meta learning to address the problem of few-shot caricature face recognition. In particular, we present a deep relation network to capture and memorize the relation among different samples. To employ the prior knowledge, we combine learned deep and handcrafted features to form the hybrid-prior representation via joint meta learning. Final recognition is derived from our relation network by learning to compare between the hybrid-prior features of samples. Experimental results on three caricature datasets of WebCaricature, IIIT-CFW, and Caricature-207 demonstrate that our method performs better than many existing ones for few-shot caricature recognition.

[1]  Narayanan C. Krishnan,et al.  Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet) , 2018, ACM Multimedia.

[2]  C. V. Jawahar,et al.  IIIT-CFW: A Benchmark Database of Cartoon Faces in the Wild , 2016, ECCV Workshops.

[3]  Zhengming Ma,et al.  Face recognition based on the fusion of global and local HOG features of face images , 2014, IET Comput. Vis..

[4]  Gaoyun An,et al.  A Novel Facial Expression Recognition Approach Based on MoSift , 2015 .

[5]  Jiashu Zhang,et al.  Discriminant Locality Preserving Projections Based on L1-Norm Maximization , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Tayfun Akgül,et al.  Matching caricatures to photographs , 2015, Signal, Image and Video Processing.

[7]  Sajid Javed,et al.  Robust Subspace Learning: Robust PCA, Robust Subspace Tracking, and Robust Subspace Recovery , 2017, IEEE Signal Processing Magazine.

[8]  Georgios Tziritas,et al.  Face Detection Using Quantized Skin Color Regions Merging and Wavelet Packet Analysis , 1999, IEEE Trans. Multim..

[9]  Md Jan Nordin,et al.  Combining Local Binary Pattern and Principal Component Analysis on T-Zone face area for face recognition , 2011, 2011 International Conference on Pattern Analysis and Intelligence Robotics.

[10]  Stefano Berretti,et al.  Boosting 3D LBP-based face recognition by fusing shape and texture descriptors on the mesh , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[11]  Thanapant Raicharoen,et al.  Face recognition using feature extraction based on descriptive statistics of a face image , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[12]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[13]  Xuelong Li,et al.  Fast Haar Transform Based Feature Extraction for Face Representation and Recognition , 2009, IEEE Transactions on Information Forensics and Security.

[14]  Yinghuan Shi,et al.  WebCaricature: a benchmark for caricature recognition , 2017, BMVC.

[15]  Tat-Seng Chua,et al.  An efficient sparse metric learning in high-dimensional space via l1-penalized log-determinant regularization , 2009, ICML '09.

[16]  Anil K. Jain,et al.  Towards automated caricature recognition , 2012, 2012 5th IAPR International Conference on Biometrics (ICB).

[17]  Cheng Li,et al.  Pose-Robust Face Recognition via Deep Residual Equivariant Mapping , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Lei Zhang,et al.  Gabor Feature Based Sparse Representation for Face Recognition with Gabor Occlusion Dictionary , 2010, ECCV.

[19]  Gholamreza Anbarjafari,et al.  Pose Invariant Face Recognition Using Probability Distribution Functions in Different Color Channels , 2008, IEEE Signal Processing Letters.

[20]  Andrew Zisserman,et al.  Multicolumn Networks for Face Recognition , 2018, BMVC.

[21]  Xiang Zhang,et al.  Metric Learning from Relative Comparisons by Minimizing Squared Residual , 2012, 2012 IEEE 12th International Conference on Data Mining.

[22]  Tao Xiang,et al.  Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.