Volume structured ordinal features with background similarity measure for video face recognition

It has been shown in different studies the benefits of using spatio-temporal information for video face recognition. However, most of the existing spatio-temporal representations do not capture the local discriminative information present in human faces. In this paper we introduce a new local spatio-temporal descriptor, based on structured ordinal features, for video face recognition. The proposed method not only encodes jointly the local spatial and temporal information, but also extracts the most discriminative facial dynamic information while trying to discard spatio-temporal features related to intra-personal variations. Besides, a similarity measure based on a set of background samples is proposed to be used with our descriptor, showing to boost its performance. Extensive experiments conducted on the recent but difficult YouTube Faces database demonstrate the good performance of our proposal, achieving state-of-the-art results.

[1]  Rama Chellappa,et al.  3D face reconstruction from video using a generic model , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2]  A. O'Toole,et al.  Recognizing moving faces: a psychological and neural synthesis , 2002, Trends in Cognitive Sciences.

[3]  Tsuhan Chen,et al.  Eigenspace updating for non-stationary process and its application to face recognition , 2003, Pattern Recognit..

[4]  Tomaso A. Poggio,et al.  Face recognition: component-based versus global approaches , 2003, Comput. Vis. Image Underst..

[5]  Hujun Bao,et al.  Practical super-resolution from dynamic video sequences , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6]  Tieniu Tan,et al.  Face Recognition Using Ordinal Features , 2006, ICB.

[7]  Shengcai Liao,et al.  Structured Ordinal Features for Appearance-Based Object Representation , 2007, AMFG.

[8]  Shengcai Liao,et al.  Learning Multi-scale Block Local Binary Patterns for Face Recognition , 2007, ICB.

[9]  Tieniu Tan,et al.  Ordinal representations for biometrics recognition , 2007, 2007 15th European Signal Processing Conference.

[10]  Jean-Luc Dugelay,et al.  Person recognition using facial video information: A state of the art , 2009, J. Vis. Lang. Comput..

[11]  Tieniu Tan,et al.  Ordinal Measures for Iris Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Alan C. Bovik,et al.  No-reference image blur assessment using multiscale gradient , 2009, 2009 International Workshop on Quality of Multimedia Experience.

[13]  Cordelia Schmid,et al.  Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.

[14]  Tal Hassner,et al.  The One-Shot similarity kernel , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Yunhong Wang,et al.  Video-based Face Recognition: A Survey , 2009 .

[16]  Matti Pietikäinen,et al.  Combining appearance and motion for face and gender recognition from videos , 2009, Pattern Recognit..

[17]  Matti Pietikäinen,et al.  Computer Vision Using Local Binary Patterns , 2011, Computational Imaging and Vision.

[18]  Chao Wang,et al.  Video-Based Face Recognition: State of the Art , 2011, CCBR.

[19]  Brian C. Lovell,et al.  Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework , 2011, EURASIP J. Image Video Process..

[20]  Tal Hassner,et al.  Face recognition in unconstrained videos with matched background similarity , 2011, CVPR 2011.

[21]  Rama Chellappa,et al.  Dictionary-Based Face Recognition from Video , 2012, ECCV.

[22]  Patrick J. Flynn,et al.  Face Recognition from Video: a Review , 2012, Int. J. Pattern Recognit. Artif. Intell..