Position-Squeeze and Excitation Block for Facial Attribute Analysis

In this paper, we focus on multiple facial attribute recognition in a single Convolutional Neural Network (CNN). We propose a Position-Squeeze and Excitation (PSE) module, which incorporates the spatial information of different attributes into CNN training. By adding a lateral branch which computes a weight mask for each attribute, the PSE module can help the network learn features from where attributes naturally appear. Moreover, the module can be added as a branch to any classical convolutional neural network to perform end-to-end multi-attribute classification. Experiments show that, our solution has achieved high accuracy on both the CelebA dataset and the LFWA dataset.

[1]  Xiaoou Tang,et al.  Learning Social Relation Traits from Face Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Rama Chellappa,et al.  Segment-Based Methods for Facial Attribute Detection from Partial Faces , 2018, IEEE Transactions on Affective Computing.

[3]  Shiguang Shan,et al.  Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Shree K. Nayar,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Describable Visual Attributes for Face Verification and Image Search , 2022 .

[5]  Andreas Holzinger,et al.  Augmentor: An Image Augmentation Library for Machine Learning , 2017, J. Open Source Softw..

[6]  Joachim Denzler,et al.  ImageNet pre-trained models with batch normalization , 2016, ArXiv.

[7]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[8]  Kristen Grauman,et al.  Interactively building a discriminative vocabulary of nameable attributes , 2011, CVPR 2011.

[9]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Terrance E. Boult,et al.  Multi-attribute spaces: Calibration for attribute fusion and similarity search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Terrance E. Boult,et al.  AFFACT: Alignment-free facial attribute classification technique , 2016, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[12]  Rama Chellappa,et al.  A Deep Cascade Network for Unaligned Face Attribute Classification , 2017, AAAI.

[13]  Boqing Gong,et al.  Improving Facial Attribute Prediction Using Semantic Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Tal Hassner,et al.  Effective Unconstrained Face Recognition by Combining Multiple Descriptors and Learned Background Statistics , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Rama Chellappa,et al.  Attributes for Improved Attributes: A Multi-Task Network for Attribute Classification , 2016, ArXiv.