An ensemble convolutional echo state networks for facial expression recognition

Facial expressions recognition (FER) plays a much important role in various applications from human-computer interfaces to psychological tests. However, most methods are confronted with the quality of the face images, vanishing gradients problem, over-trained problem, difference of face images such as in age and ethnicity, mulitple parameters required tuning, and dubious class labels in the training data. These negative factors largely hurt the recognition performance. To alleviate these problems, this paper proposes an new approach named ensemble convolutional echo state network, which takes Echo State Network (ESN) as the base classifier for ensemble and Convolutional Network (CN) to transform the input face image for further feeding to ESN, where the random parameters and architectures are assigned to ensure the diversity of the ensemble and to avoid computing stochastic gradient. Based on the rich dynamics of ESN and rich variations of input face image finished by CN, the proposed approach has the great ability to deal with the real facial expression recognition and to be scaled to the larger training data. It has also only one parameter to be adjusted. Conducted experiments show that the method achieves significant improvement over current methods on person-independent facial expression recognition.

[1]  Surendra Ranganath,et al.  Cloud basis function neural network: A modified RBF network architecture for holistic facial expression recognition , 2008, Pattern Recognit..

[2]  Peter Tiño,et al.  Minimum Complexity Echo State Network , 2011, IEEE Transactions on Neural Networks.

[3]  Anastasios Delopoulos,et al.  Linear subspaces for facial expression recognition , 2014, Signal Process. Image Commun..

[4]  Ying Chen,et al.  Facial Expression Recognition via Non-Negative Least-Squares Sparse Coding , 2014, Inf..

[5]  Friedhelm Schwenker,et al.  Neural Network Ensembles in Reinforcement Learning , 2013, Neural Processing Letters.

[6]  Jake K. Aggarwal,et al.  Spontaneous facial expression recognition: A robust metric learning approach , 2014, Pattern Recognit..

[7]  Y-Lan Boureau,et al.  Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[8]  Günther Palm,et al.  Real-Time Emotion Recognition from Speech Using Echo State Networks , 2008, ANNPR.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Luiz Eduardo Soares de Oliveira,et al.  Fusion of feature sets and classifiers for facial expression recognition , 2013, Expert Syst. Appl..

[11]  Nicolás García-Pedrajas,et al.  Supervised subspace projections for constructing ensembles of classifiers , 2012, Inf. Sci..

[12]  Deepak Ghimire,et al.  Extreme Learning Machine Ensemble Using Bagging for Facial Expression Recognition , 2014, J. Inf. Process. Syst..

[13]  Yanhua Zhang,et al.  Multi-classifier Fusion Based Facial Expression Recognition Approach , 2014, KSII Trans. Internet Inf. Syst..

[14]  Bernard Fong,et al.  Affective Computing in Consumer Electronics , 2012, IEEE Trans. Affect. Comput..

[15]  Friedhelm Schwenker,et al.  Selective neural network ensembles in reinforcement learning: Taking the advantage of many agents , 2015, Neurocomputing.

[16]  Zhen Li,et al.  Recognizing Emotions From an Ensemble of Features , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[17]  Yann LeCun,et al.  Understanding Deep Architectures using a Recursive Convolutional Network , 2013, ICLR.

[18]  Fernando Fernández Martínez,et al.  Towards a robust affect recognition: Automatic facial expression recognition in 3D faces , 2015, Expert Syst. Appl..

[19]  Jang Myung Lee,et al.  Fuzzy Echo State Neural Networks and Funnel Dynamic Surface Control for Prescribed Performance of a Nonlinear Dynamic System , 2014, IEEE Transactions on Industrial Electronics.

[20]  Lijun Yin,et al.  Static and dynamic 3D facial expression recognition: A comprehensive survey , 2012, Image Vis. Comput..

[21]  Raúl Monge,et al.  Parallel Approach for Ensemble Learning with Locally Coupled Neural Networks , 2010, Neural Processing Letters.

[22]  Min Han,et al.  Subspace Echo State Network for Multivariate Time Series Prediction , 2012, ICONIP.