Deep Learning for Illumination Invariant Facial Expression Recognition

In this work we propose a novel method to address illumination invariance for racial expression recognition. We propose a Deep Convolutional Network (CNN) pre-trained as a Deep Stacked Convolutional Autoencoder (SCAE) in a greedy layer-wise unsupervised fashion. The SCAE model learns to encode racial expression images and produce a feature vector with relatively similar illumination, regardless of the luminance level of the input image. Moreover, we propose fine-tuning the stacked shallow autoencoders after each one of these is trained greedily, rather than just at the end, and show that this approach significantly improves the set of illumination invariant features learnt by the SCAE. Finally, we propose the use of a variant rectifier linear unit transfer function that helps the SCAE model reduce or increase the illumination of images with high or low luminance, and show that the lower and upper bounds greatly influence classification performance. The method proposed provides an increase in classification accuracy of 4% on the KDEF dataset and 8% on the CK+ dataset.

[1]  Kin-Man Lam,et al.  Illumination invariant face recognition , 2005, Pattern Recognit..

[2]  Thomas Hofmann,et al.  Greedy Layer-Wise Training of Deep Networks , 2007 .

[3]  Takeo Kanade,et al.  Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[4]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[5]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[6]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[7]  Paul E. Debevec,et al.  Effect of illumination on automatic expression recognition: A novel 3D relightable facial database , 2011, Face and Gesture 2011.

[8]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Mohammad Shukri Salman,et al.  Illumination invariant face recognition system , 2013, 2013 21st Signal Processing and Communications Applications Conference (SIU).

[10]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[11]  Shiguang Shan,et al.  Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[13]  Earnest Paul Ijjina,et al.  Illumination invariant face recognition using convolutional neural networks , 2015, 2015 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES).

[14]  D. Lundqvist,et al.  Karolinska Directed Emotional Faces , 2015 .

[15]  Ramesh Raskar,et al.  Deep video gesture recognition using illumination invariants , 2016, ArXiv.

[16]  Zankhana H. Shah,et al.  Facial Expression Recognition Invariant to Illumination Using ROI Based Local Binary Pattern , 2016 .

[17]  Xiaoyue Jiang,et al.  Local feature hierarchy for face recognition across pose and illumination , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[18]  Nanning Zheng,et al.  Pose-and-illumination-invariant face representation via a triplet-loss trained deep reconstruction model , 2017, Multimedia Tools and Applications.

[19]  Mark Elshaw,et al.  Stacked deep convolutional auto-encoders for emotion recognition from facial expressions , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[20]  Ayyaz Hussain,et al.  Illumination invariant facial expression recognition using selected merged binary patterns for real world images , 2018 .