Estimating facial pose from a sparse representation [face recognition applications]

We present an approach to estimate the poses off human heads in natural scenes. The essential features for estimating the head pose are the positions of the prominent facial features relative to the position of the head. We have developed a high-dimensional, randomly sparse representation of a human face using a simplified facial feature model. The representation transforms a raw face image into a vector, representing how well the image matches a large number of randomly-posed and shaped head models. This transformation is designed to collect salient features of the face image that are useful to estimate the pose, while suppressing any irrelevant variations of face appearance. The relation between the sparse representation and the pose is learned using SVR (support vector regression). The sparse representation combined with SVR is shown to estimate the pose more quickly and accurately than SVR applied to raw images.

[1]  Gerald Sommer,et al.  Efficient Head Pose Estimation with Gabor Wavelet Networks , 2000, BMVC.

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  Shaogang Gong,et al.  Support vector regression and classification based multi-view face detection and recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[4]  Stan Z. Li,et al.  Learning Low Dimensional Invariant Signature of 3-D Object under Varying View and Illumination from 2-D Appearances , 2001, ICCV.

[5]  Azriel Rosenfeld,et al.  3D object tracking using shape-encoded particle propagation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[6]  Shizuo Sakamoto,et al.  Geodesic illumination basis: compensating for illumination variations in any pose for face recognition , 2002, Object recognition supported by user interaction for service robots.

[7]  Azriel Rosenfeld,et al.  Optimal edge-based shape detection , 2002, IEEE Trans. Image Process..

[8]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression (PIE) database , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[9]  P. Jonathon Phillips,et al.  Face recognition vendor test 2002 , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[10]  Thomas Vetter,et al.  Face Recognition Based on Fitting a 3D Morphable Model , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  P. Jonathon Phillips,et al.  Face Recognition Vendor Test 2002: Overview and Summary , 2003 .

[12]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..