Video-Based Face Recognition Using Earth Mover's Distance

In this paper, we present a novel approach of using Earth Mover's Distance for video-based face recognition. General methods can be classified into sequential approach and batch approach. Batch approach is to compute a similarity function between two videos. There are two classical batch methods. The one is to compute the angle between subspaces, and the other is to find K-L divergence between probabilistic models. This paper considers a most straightforward method of using distance for matching. We propose a metric based on an average Euclidean distance between two videos as the classifier. This metric makes use of Earth Mover's Distance (EMD) as the underlying similarity measurement between two distributions of face images. To make the algorithm more effective, dimensionality reduction is needed. Fisher's Linear Discriminant analysis (FLDA) is used for linear transformation and making each class more separable. The set of features is then compressed with a signature, which is composed of numbers of points and their corresponding weights. During matching, the distance between two signatures is computed by EMD. Experimental results demonstrate the efficiency of EMD for video-based face recognition.

[1]  Leonidas J. Guibas,et al.  Adaptive Color-Image Embeddings for Database Navigation , 1998, ACCV.

[2]  Mads Nielsen,et al.  Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.

[3]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[4]  Jiří Matas,et al.  Computer Vision - ECCV 2004 , 2004, Lecture Notes in Computer Science.

[5]  V. Rich Personal communication , 1989, Nature.

[6]  Sven J. Dickinson,et al.  Many-to-Many Feature Matching Using Spherical Coding of Directed Graphs , 2004, ECCV.

[7]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[9]  Rama Chellappa,et al.  Probabilistic Human Recognition from Video , 2002, ECCV.

[10]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[11]  Trevor Darrell,et al.  Face Recognition from Long-Term Observations , 2002, ECCV.

[12]  Tsuhan Chen,et al.  Video-based face recognition using adaptive hidden Markov models , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Ken-ichi Maeda,et al.  Face recognition using temporal image sequence , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[14]  Ting-Chuen Pong,et al.  Computer Vision — ACCV'98 , 1998, Lecture Notes in Computer Science.

[15]  Marian Stewart Bartlett,et al.  Independent component representations for face recognition , 1998, Electronic Imaging.

[16]  M. Fatih Demirci,et al.  Many-to-many graph matching via metric embedding , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  David J. Kriegman,et al.  Video-based face recognition using probabilistic appearance manifolds , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[19]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[20]  Leonidas J. Guibas,et al.  The Earth Mover's Distance under transformation sets , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.