论文信息 - Subspace learning for facial expression recognition: an overview and a new perspective

Subspace learning for facial expression recognition: an overview and a new perspective

For image recognition, an extensive number of subspace-learning methods have been proposed to overcome the high-dimensionality problem of the features being used. In this paper, we first give an overview of the most popular and state-of-the-art subspace-learning methods, and then, a novel manifold-learning method, named soft locality preserving map (SLPM), is presented. SLPM aims to control the level of spread of the different classes, which is closely connected to the generalizability of the learned subspace. We also do an overview of the extension of manifold learning methods to deep learning by formulating the loss functions for training, and further reformulate SLPM into a soft locality preserving (SLP) loss. These loss functions are applied as an additional regularization to the learning of deep neural networks. We evaluate these subspace-learning methods, as well as their deep-learning extensions, on facial expression recognition. Experiments on four commonly used databases show that SLPM effectively reduces the dimensionality of the feature vectors and enhances the discriminative power of the extracted features. Moreover, experimental results also demonstrate that the learned deep features regularized by SLP acquire a better discriminability and generalizability for facial expression recognition.

K. Lam | Xiangjian He | Cigdem Turan | Rui Zhao

[1] Sam Kwong,et al. Constrained Clustering With Dissimilarity Propagation-Guided Graph-Laplacian PCA , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[2] Kin-Man Lam,et al. Deep Multi-task Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[3] Sam Kwong,et al. Pairwise Constraint Propagation With Dual Adversarial Manifold Regularization , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[4] Daniel P.K. Lun,et al. Enhancement of a CNN-Based Denoiser Based on Spatial and Spectral Analysis , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[5] Sam Kwong,et al. Clustering-Aware Graph Construction: A Joint Learning Perspective , 2019, IEEE Transactions on Signal and Information Processing over Networks.

[6] Shan Li,et al. Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition , 2019, IEEE Transactions on Image Processing.

[7] Kin-Man Lam,et al. Histogram-based local descriptors for facial expression recognition (FER): A comprehensive study , 2018, J. Vis. Commun. Image Represent..

[8] Hui Wang,et al. Deep Convolutional Neural Network for Facial Expression Recognition Using Facial Parts , 2017, 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[9] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[10] Zhiyuan Li,et al. Island Loss for Learning Discriminative Features in Facial Expression Recognition , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[11] Mahesh M. Goyani,et al. Recognition of Facial Expressions using Local Mean Binary Pattern , 2017, ELCVIA Electronic Letters on Computer Vision and Image Analysis.

[12] Ping Liu,et al. Identity-Aware Convolutional Neural Network for Facial Expression Recognition , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[13] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14] Yangdong Ye,et al. Rank-based pooling for deep convolutional neural networks , 2016, Neural Networks.

[15] Yu Qiao,et al. A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[16] Jane You,et al. Data-driven facial animation via semi-supervised local patch alignment , 2016, Pattern Recognit..

[17] Jing-Yu Yang,et al. Semi-supervised linear discriminant analysis for dimension reduction and classification , 2016, Pattern Recognit..

[18] Lei Zhang,et al. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[19] Hassan Mansour,et al. Dimensionality reduction of visual features for efficient retrieval and classification , 2016, APSIPA Transactions on Signal and Information Processing.

[20] Duncan Fyfe Gillies,et al. Overfitting in linear feature extraction for classification of high-dimensional image data , 2016, Pattern Recognit..

[21] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Junmo Kim,et al. Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[23] Jian-Jiun Ding,et al. Facial expression recognition based on improved local binary pattern and class-regularized locality preserving projection , 2015, Signal Process..

[24] Ç. Erdem,et al. BAUM-2: a multilingual audio-visual affective face database , 2015, Multimedia Tools and Applications.

[25] Lianfa Bai,et al. A New Supervised Manifold Learning Algorithm , 2015, ICIG.

[26] Binbin Huang,et al. Sparse Autoencoder for Facial Expression Recognition , 2015, 2015 IEEE 12th Intl Conf on Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom).

[27] Stefan Wermter,et al. Face expression recognition with a 2-channel Convolutional Neural Network , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[28] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] William J. Christmas,et al. When Face Recognition Meets with Deep Learning: An Evaluation of Convolutional Neural Networks for Face Recognition , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[31] Chiou-Ting Hsu,et al. Dual Subspace Nonnegative Graph Embedding for Identity-Independent Expression Recognition , 2015, IEEE Transactions on Information Forensics and Security.

[32] Shiguang Shan,et al. Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis , 2014, ACCV.

[33] Kin-Man Lam,et al. Region-based feature fusion for facial-expression recognition , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[34] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35] Jiwen Lu,et al. PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[36] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.

[37] Andreas Savakis,et al. Manifold based sparse representation for facial understanding in natural images , 2013, Image Vis. Comput..

[38] Jianzhong Wang,et al. An improved locality sensitive discriminant analysis approach for feature extraction , 2013, Multimedia Tools and Applications.

[39] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[40] Behrooz Kamgar-Parsi,et al. Toward Development of a Face Recognition System for Watchlist Surveillance , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41] Lei Zhang,et al. A multi-manifold discriminant analysis method for image feature extraction , 2011, Pattern Recognit..

[42] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[43] Oksam Chae,et al. Robust Facial Expression Recognition Based on Local Directional Pattern , 2010 .

[44] Y. V. Venkatesh,et al. A novel application of self-organizing network for facial expression recognition from radial encoded contours , 2009, Soft Comput..

[45] Shaogang Gong,et al. Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[46] Chao Wang,et al. Feature extraction using constrained maximum variance mapping , 2008, Pattern Recognit..

[47] Ville Ojansivu,et al. Blur Insensitive Texture Classification Using Local Phase Quantization , 2008, ICISP.

[48] Haixian Wang,et al. Locality-Preserved Maximum Information Projection , 2008, IEEE Transactions on Neural Networks.

[49] Andrew Zisserman,et al. Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[50] Kun Zhou,et al. Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[51] Bir Bhanu,et al. Evolutionary feature synthesis for facial expression recognition , 2006, Pattern Recognit. Lett..

[52] Shaogang Gong,et al. Appearance Manifold of Facial Expression , 2005, ICCV-HCI.

[53] Deng Cai,et al. Orthogonal locality preserving indexing , 2005, SIGIR '05.

[54] Evgeniy Gabrilovich,et al. Feature Generation for Text Categorization Using World Knowledge , 2005, IJCAI.

[55] Franck Davoine,et al. Facial expression recognition and synthesis based on an appearance model , 2004, Signal Process. Image Commun..

[56] Daoqiang Zhang,et al. Efficient and robust feature extraction by maximum margin criterion , 2003, IEEE Transactions on Neural Networks.

[57] Xiaofei He,et al. Locality Preserving Projections , 2003, NIPS.

[58] Changbo Hu,et al. Manifold of facial expression , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[59] A. Kak,et al. PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[60] Mikhail Belkin,et al. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[61] L. Saul,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[62] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[63] Takeo Kanade,et al. Comprehensive database for facial expression analysis , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[64] Michael J. Lyons,et al. Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[65] Paul D. Gader,et al. Automatic Feature Generation for Handwritten Digit Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[66] Anil K. Jain,et al. Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[67] R. Fisher. THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[68] Andrew L. Maas. Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[69] W. Wong,et al. Supervised optimal locality preserving projection , 2012, Pattern Recognit..

[70] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[71] Stephen Lin,et al. Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[72] Matti Pietikäinen,et al. A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[73] H. Hotelling. Analysis of a complex of statistical variables into principal components. , 1933 .

[74] Matti Pietikäinen,et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .