Combating Uncertainty and Class Imbalance in Facial Expression Recognition

Recognition of facial expression is a challenge when it comes to computer vision. The primary reasons are class imbalance due to data collection and uncertainty due to inherent noise such as fuzzy facial expressions and inconsistent labels. However, current research has focused either on the problem of class imbalance or on the problem of uncertainty, ignoring the intersection of how to address these two problems. Therefore, in this paper, we propose a framework based on Resnet and Attention to solve the above problems. We design weight for each class. Through the penalty mechanism, our model will pay more attention to the learning of small samples during training, and the resulting decrease in model accuracy can be improved by a Convolutional Block Attention Module (CBAM). Meanwhile, our backbone network will also learn an uncertain feature for each sample. By mixing uncertain features between samples, the model can better learn those features that can be used for classification, thus suppressing uncertainty. Experiments show that our method surpasses most basic methods in terms of accuracy on facial expression data sets (e.g., AffectNet, RAF-DB), and it also solves the problem of class imbalance well.

[1]  Shutao Li,et al.  Facial Expression Recognition With Visual Transformers and Attentional Selective Fusion , 2021, IEEE Transactions on Affective Computing.

[2]  Linlin Shen,et al.  Triplet Loss With Multistage Outlier Suppression and Class-Pair Margins for Facial Expression Recognition , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Changsheng Xu,et al.  Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data , 2021, IEEE Transactions on Multimedia.

[4]  M. Mahoor,et al.  Ad-Corre: Adaptive Correlation-Based Loss for Facial Expression Recognition in the Wild , 2022, IEEE Access.

[5]  Tao Mei,et al.  Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Xiaojun Qi,et al.  Facial Expression Recognition in the Wild via Deep Attentive Center Loss , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7]  Weihong Deng,et al.  Relative Uncertainty Learning for Facial Expression Recognition , 2021, NeurIPS.

[8]  Jianfei Yang,et al.  Suppressing Uncertainties for Large-Scale Facial Expression Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Mohammad H. Mahoor,et al.  AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild , 2017, IEEE Transactions on Affective Computing.

[10]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[11]  Junping Du,et al.  Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Pascal Vincent,et al.  Disentangling Factors of Variation for Facial Expression Recognition , 2012, ECCV.

[13]  Takeo Kanade,et al.  Recognizing Action Units for Facial Expression Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..