Subspace learning for human head pose estimation

This paper proposes a fully automatic framework for static human head pose estimation. With a 2D human multi-view face image as input, the face region is detected and cropped out. Then the pose of the face is assessed by the pose categories. Based on the appearance of the face region, variant subspace learning methods including principal component analysis (PCA), linear discriminant analysis (LDA), locality preserving projection (LPP) and pose-specific subspace (PSS) are proposed for effective representation of the face poses. Several aspects, such as human identification, illumination changes and expression variations are considered during the classification process. The experiment results on large public database demonstrate the effectiveness of the proposed framework and recognition algorithms. Performance comparisons and discussions are also provided in detail to help the algorithm selection when designing practical face pose estimation systems for different scenarios.

[1]  Stan Z. Li,et al.  Real-time multi-view face detection , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[2]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression (PIE) database , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[3]  Bernhard Schölkopf,et al.  Kernel machine based learning for multi-view face detection and pose estimation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[4]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[5]  Ying Wu,et al.  Wide-range, person- and illumination-insensitive head orientation estimation , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[6]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[7]  Yuxiao Hu,et al.  Head pose estimation using Fisher Manifold learning , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[8]  Shaogang Gong,et al.  Multi-view face detection using support vector machines and eigenspace modelling , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[9]  Tao Daniel Alter 3-D Pose from 3 Points Using Weak-Perspective , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.