Sparse representations for facial expressions recognition via l1 optimization

In this paper, the principles of sparse signal representation theory are explored in order to perform facial expressions recognition from frontal views. Motivated by the success such methods have demonstrated in the face recognition problem, we formulate the feature extraction procedure in order to achieve facial expression recognition as an l1 optimization problem. We show that the straightforward application of these methods to expressive images imposes certain difficulties. The use of difference images (i.e., the images that are derived from the subtraction of the neutral image from the expressive one) for sparse facial expression representations is justified. The use of expressive facial grids for similar tasks is also studied. Finally, the robustness of the proposed representations under facial image occlusion is shown and the efficacy of the proposed method in a series of experiments is demonstrated.

[1]  E. Candes,et al.  11-magic : Recovery of sparse signals via convex programming , 2005 .

[2]  S. Palmer Hierarchical structure in perceptual representation , 1977, Cognitive Psychology.

[3]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[4]  Junzhou Huang,et al.  Simultaneous image transformation and sparse representation recovery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Zhihong Zeng,et al.  A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[7]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[8]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[10]  D. Perrett,et al.  Recognition of objects and their component parts: responses of single units in the temporal cortex of the macaque. , 1994, Cerebral cortex.

[11]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  E. Rolls High-level vision: Object recognition and visual cognition, Shimon Ullman. MIT Press, Bradford (1996), ISBN 0 262 21013 4 , 1997 .

[13]  Ioannis Pitas,et al.  Facial Expression Recognition in Image Sequences Using Geometric Deformation Features and Support Vector Machines , 2007, IEEE Transactions on Image Processing.

[14]  Patrik O. Hoyer,et al.  Modeling Receptive Fields with Non-Negative Sparse Coding , 2002, Neurocomputing.

[15]  P. Ekman Emotion in the human face , 1982 .

[16]  Ioannis Pitas,et al.  Application of non-negative and local non negative matrix factorization to facial expression recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[17]  I. Pitas,et al.  A new sparse image representation algorithm applied to facial expression recognition , 2004, Proceedings of the 2004 14th IEEE Signal Processing Society Workshop Machine Learning for Signal Processing, 2004..

[18]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[19]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[20]  Zihan Zhou,et al.  Demo: Robust face recognition via sparse representation , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[21]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.