ASD: Towards Attribute Spatial Decomposition for Prior-Free Facial Attribute Recognition

—Representing the spatial properties of facial at- tributes is a vital challenge for facial attribute recognition (FAR). Recent advances have achieved the reliable performances for FAR, benefiting from the description of spatial properties via extra prior information. However, the extra prior information might not be always available, resulting in the restricted application scenario of the prior-based methods. Meanwhile, the spatial ambiguity of facial attributes caused by inherent spatial diversities of facial parts is ignored. To address these issues, we propose a prior-free method for attribute spatial decomposition (ASD), mitigating the spatial ambiguity of facial attributes without any extra prior information. Specifically, assignment-embedding module (AEM) is proposed to enable the procedure of ASD, which consists of two operations: attribute-to-location assignment and location-to-attribute embedding. The attribute- to-location assignment first decomposes the feature map based on latent factors, assigning the magnitude of attribute components on each spatial location. Then, the assigned attribute components from all locations to represent the global-level attribute embed- dings. Furthermore, correlation matrix minimization (CMM) is introduced to enlarge the discriminability of attribute embed- dings. Experimental results demonstrate the superiority of ASD compared with state-of-the-art prior-based methods, while the reliable performance of ASD for the case of limited training data is further validated. the blank of prior-free method in FAR task.

[1]  Tapan K. Gandhi,et al.  Landmark-Aware and Part-Based Ensemble Transfer Learning Network for Static Facial Expression Recognition from Images , 2023, IEEE Transactions on Artificial Intelligence.

[2]  Swagatam Das,et al.  Do Preprocessing and Class Imbalance Matter to the Deep Image Classifiers for COVID-19 Detection? An Explainable Analysis , 2023, IEEE Transactions on Artificial Intelligence.

[3]  Wei Wang,et al.  Character-Level Street View Text Spotting Based on Deep Multisegmentation Network for Smarter Autonomous Driving , 2022, IEEE Transactions on Artificial Intelligence.

[4]  Coen de Vente,et al.  Automated COVID-19 Grading With Convolutional Neural Networks in Computed Tomography Scans: A Systematic Comparison , 2021, IEEE Transactions on Artificial Intelligence.

[5]  C. L. P. Chen,et al.  Enhanced Broad Siamese Network for Facial Emotion Recognition in Human–Robot Interaction , 2021, IEEE Transactions on Artificial Intelligence.

[6]  Alireza Zaeemzadeh Face Image Retrieval with Attribute Manipulation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[7]  Chunhua Shen,et al.  Learning Spatial-Semantic Relationship for Facial Attribute Recognition with Limited Labeled Data , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Zhenghao Chen,et al.  Improving Facial Attribute Recognition by Group and Graph Learning , 2021, 2021 IEEE International Conference on Multimedia and Expo (ICME).

[9]  Feng Lu,et al.  PureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation , 2021, AAAI.

[10]  Yongxiong Wang,et al.  An Efficient Convolutional Neural Network Model Based on Object-Level Attention Mechanism for Casting Defect Detection on Radiography Images , 2020, IEEE Transactions on Industrial Electronics.

[11]  Tat-Seng Chua,et al.  Generating Face Images With Attributes for Free , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Wei Wu,et al.  Hierarchical Feature Embedding for Attribute Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Peter Wonka,et al.  Disentangled Image Generation Through Structured Noise Injection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Vishal M. Patel,et al.  Multi-Scale Thermal to Visible Face Verification via Attribute Guided Synthesis , 2020, IEEE Transactions on Biometrics, Behavior, and Identity Science.

[15]  Itamar Friedman,et al.  TResNet: High Performance GPU-Dedicated Architecture , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[16]  Yan Yan,et al.  Deep Multi-Task Multi-Label CNN for Effective Facial Attribute Classification , 2020, IEEE Transactions on Affective Computing.

[17]  David Berthelot,et al.  FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence , 2020, NeurIPS.

[18]  Ying Wu,et al.  Recognizing Part Attributes With Insufficient Data , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Hassan Foroosh,et al.  Slim-CNN: A Light-Weight CNN for Face Attribute Prediction , 2019, 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020).

[20]  Zhe L. Lin,et al.  Semantic Component Decomposition for Face Attribute Manipulation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[22]  Feiyue Huang,et al.  Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition , 2018, IJCAI.

[23]  Zhongfei Zhang,et al.  Partially Shared Multi-task Convolutional Neural Network with Local Constraint for Face Attribute Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Shuicheng Yan,et al.  Landmark Free Face Attribute Prediction , 2018, IEEE Transactions on Image Processing.

[25]  Nikos Komodakis,et al.  Unsupervised Representation Learning by Predicting Image Rotations , 2018, ICLR.

[26]  Shiguang Shan,et al.  Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Shin Ishii,et al.  Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Rama Chellappa,et al.  Attributes for Improved Attributes: A Multi-Task Network Utilizing Implicit and Explicit Relationships for Facial Attribute Classification , 2017, AAAI.

[29]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Yu Qiao,et al.  Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Paolo Favaro,et al.  Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles , 2016, ECCV.

[32]  Terrance E. Boult,et al.  MOON: A Mixed Objective Optimization Network for the Recognition of Facial Attributes , 2016, ECCV.

[33]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Honglak Lee,et al.  Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[35]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[36]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[37]  Dahua Lin,et al.  Hidden Factor Analysis for Age Invariant Face Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[38]  Yan-Ying Chen,et al.  Scalable Face Image Retrieval Using Attribute-Enhanced Sparse Codewords , 2013, IEEE Transactions on Multimedia.

[39]  Shree K. Nayar,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Describable Visual Attributes for Face Verification and Image Search , 2022 .

[40]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.