A semantic model for video based face recognition

Video-based face recognition has attracted a great deal of attention in recent years due to its wide applications. The challenge of video-based face recognition comes from several aspects. First, video data involves many frames, which increases data size and processing complexity. Second, key frames extracted from videos are usually of high intra-personal discrepancy due to variations in expressions, poses, and illuminations. In order to address these problems, we propose a novel semantic based subspace model to improve the performance of video based face recognition. The basic idea is to construct an appropriate low-dimensional subspace for each person, upon which a semantic model is built to classify the key frames of the person into specific class. After the semantic classification, the key frames belonging to the same classes, i.e. the same semantics, are used to train the linear classifiers for recognition. Extensive experiments on a large face video database (XM2VTS) clearly show that our approach obtains a significant performance improvement over the traditional approaches.

[1]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Vinod Chandran,et al.  Probabilistic Matching of Image Sets for Video-Based Face Recognition , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[3]  Zhifeng Li,et al.  Frame synchronization and multi-level subspace analysis for video based face recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[4]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[5]  Matti Pietikäinen,et al.  Face Recognition with Local Binary Patterns , 2004, ECCV.

[6]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[8]  Ammad Ali,et al.  Face Recognition with Local Binary Patterns , 2012 .

[9]  Likun Huang,et al.  Face recognition based on image sets , 2014 .

[10]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[11]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[12]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[13]  Dahua Lin,et al.  Nonparametric Discriminant Analysis for Face Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Juyang Weng,et al.  Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Zhifeng Li,et al.  Audio-Guided Video-Based Face Recognition , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Rama Chellappa,et al.  Video-based face recognition via joint sparse representation , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[17]  Xiaogang Wang,et al.  A unified framework for subspace face recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  B. Moore Principal component analysis in linear systems: Controllability, observability, and model reduction , 1981 .