Facial Expression Recognition in the Wild Using FAM, RRDB and Vision Transformer Based Convolutional Neural Networks

Facial expression recognition in the wild is a quite challenge task because facial images captured in natural settings are often affected by various factors such as irregular occlusions, inconsistent face angles and varying light levels. To overcome these factors, we presented a new triple-branch deep neural network model containing facial attention module, residual in residual dense block and vision transformer to deal with facial expression recognition problem. The facial attention module can help backbone network extract more useful features. Residual in residual dense block and vision transformer can further get more detailed features. In addition to the image problems caused by above factors, the ratio of number of images for different expressions in the dataset is also very different. We proposed a new loss function to tackle this uneven ration problem. Experimental results on two benchmark in-the-wild datasets show that our model is indeed helpful for images in the wild. We also created a new expressions dataset called Fairfaceplus, which is built by adding expression categories to the original label on FairFace dataset. The code of our proposed method will be made available on GitHub https://github.com/dreampledge/fairfaceplus.

[1]  Kuan-Hsien Liu,et al.  Fusion of Triple Attention to Residual in Residual Dense Block to Attention Based CNN for Facial Expression Recognition , 2022, 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[2]  Kuan-Hsien Liu,et al.  Fabric Defect Detection VIA Unsupervised Neural Networks , 2022, 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[3]  Kuan-Hsien Liu,et al.  Unsupervised UNet for Fabric Defect Detection , 2022, 2022 IEEE International Conference on Consumer Electronics - Taiwan.

[4]  Kuan-Hsien Liu,et al.  Clothing Recommendation Based on Deep Learning , 2022, 2022 IEEE International Conference on Consumer Electronics - Taiwan.

[5]  Kuan-Hsien Liu,et al.  Mix Attention Based Convolutional Neural Network for Clothing Brand Logo Recognition and Classification , 2021, IEEE International Conference on Systems, Man and Cybernetics.

[6]  Kuan-Hsien Liu,et al.  CBL: A Clothing Brand Logo Dataset and a New Method for Clothing Brand Recognition , 2021, 2020 28th European Signal Processing Conference (EUSIPCO).

[7]  Xiaojun Qi,et al.  Facial Expression Recognition in the Wild via Deep Attentive Center Loss , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8]  Jungseock Joo,et al.  FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[9]  D. Tao,et al.  A Survey on Vision Transformer , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  C. Thongchaisuratkrul,et al.  Development of Application and Face Recognition for Smart Home , 2020, 2020 International Conference on Power, Energy and Innovations (ICPEI).

[11]  Kuan-Hsien Liu,et al.  Identifying Poultry Farms from Satellite Images with Residual Dense U-Net , 2020, 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[12]  Kuan-Hsien Liu,et al.  Locating Waterfowl Farms from Satellite Images with Parallel Residual U-Net Architecture , 2020, 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[13]  Jianfei Yang,et al.  Suppressing Uncertainties for Large-Scale Facial Expression Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Chang Xu,et al.  Efficient Residual Dense Block Search for Image Super-Resolution , 2019, AAAI.

[15]  Yu Qiao,et al.  Frame Attention Networks for Facial Expression Recognition in Videos , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[16]  Kuan-Hsien Liu,et al.  A Structure-Based Human Facial Age Estimation Framework Under a Constrained Condition , 2019, IEEE Transactions on Image Processing.

[17]  Jianfei Yang,et al.  Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition , 2019, IEEE Transactions on Image Processing.

[18]  Shiguang Shan,et al.  Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism , 2019, IEEE Transactions on Image Processing.

[19]  Y. Fu,et al.  Residual Dense Network for Image Restoration , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Sasmoko,et al.  Face Detection and Recognition Based E-Learning for Students Authentication: Study Literature Review , 2018, 2018 International Conference on Information Management and Technology (ICIMTech).

[21]  Andrew Zisserman,et al.  Emotion Recognition in Speech using Cross-Modal Transfer in the Wild , 2018, ACM Multimedia.

[22]  Soo-Chang Pei,et al.  Spatio-Temporal Interactive Laws Feature Correlation Method to Video Quality Assessment , 2018, 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[23]  Tsung-Jung Liu,et al.  No-Reference Image Quality Assessment by Wide-Perceptual-Domain Scorer Ensemble Method , 2018, IEEE Transactions on Image Processing.

[24]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26]  Junping Du,et al.  Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Shuicheng Yan,et al.  Robust LSTM-Autoencoders for Face De-Occlusion in the Wild , 2016, IEEE Transactions on Image Processing.

[28]  Soo-Chang Pei,et al.  Age estimation via fusion of multiple binary age grouping systems , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[29]  Emad Barsoum,et al.  Training deep networks for facial expression recognition with crowd-sourced label distribution , 2016, ICMI.

[30]  Chu-Song Chen,et al.  MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute Prediction , 2016, ICMR.

[31]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Junmo Kim,et al.  Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33]  Shuicheng Yan,et al.  Age Estimation via Grouping and Decision Fusion , 2015, IEEE Transactions on Information Forensics and Security.

[34]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[35]  Shuicheng Yan,et al.  Age group classification via structured fusion of uncertainty-driven shape features and selected surface features , 2014, IEEE Winter Conference on Applications of Computer Vision.

[36]  王小勇 Access control system based on face recognition , 2011 .

[37]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[38]  Maja Pantic,et al.  Web-based database for facial expression analysis , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[39]  P. Ekman,et al.  Constants across cultures in the face and emotion. , 1971, Journal of personality and social psychology.

[40]  Hu Han,et al.  Semi-Supervised Natural Face De-Occlusion , 2021, IEEE Transactions on Information Forensics and Security.

[41]  Jiande Sun,et al.  Eye Recognition With Mixed Convolutional and Residual Network (MiCoRe-Net) , 2018, IEEE Access.