Automatic head pose estimation using randomly projected dense SIFT descriptors

In this paper, we propose an automatic method for determining the head pose from a given face image. The face image is divided into a regular grid and a representation of the image is obtained by extracting dense SIFT descriptors from its grid points. Random Projection (RP) is then applied to reduce the dimension of the concatenated SIFT descriptor vector. Classification and regression using Support Vector Machine (SVM) are combined in order to obtain an accurate estimate of the head pose. The advantage of the proposed approach is that it does not require facial feature points such as eye corners, mouth corners and the nose tip to be extracted from the input face image as in many other methods. Experimental results are presented to demonstrate the effectiveness of the approach.

[1]  Jun Li,et al.  Boosting dense SIFT descriptors and shape contexts of face images for gender recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[2]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[3]  I. Jolliffe Principal Component Analysis , 2002 .

[4]  Sethuraman Panchanathan,et al.  Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  William T. Freeman,et al.  Example-based head tracking , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[6]  J. Crowley,et al.  Estimating Face orientation from Robust Detection of Salient Facial Structures , 2004 .

[7]  Shihong Lao,et al.  Sparse Bayesian Regression for Head Pose Estimation , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[8]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[9]  Harry Wechsler,et al.  Face pose discrimination using support vector machines (SVM) , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[10]  Jian-Gang Wang,et al.  EM enhancement of 3D head pose estimated by point at infinity , 2007, Image Vis. Comput..

[11]  Yun Fu,et al.  Head pose estimation: Classification or regression? , 2008, 2008 19th International Conference on Pattern Recognition.

[12]  Chinmay Hegde,et al.  Efficient Machine Learning Using Random Projections , 2007 .

[13]  Ying Wu,et al.  Query Driven Localized Linear Discriminant Models for Head Pose Estimation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[14]  W. B. Johnson,et al.  Extensions of Lipschitz mappings into Hilbert space , 1984 .

[15]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[18]  Antonio Torralba,et al.  SIFT Flow: Dense Correspondence across Different Scenes , 2008, ECCV.

[19]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[20]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.