Affine correspondence based head pose estimation for a sequence of images by using a 3D model

This work proposes a method of determining human head poses from a sequence of images. The main idea is to use some features in a 3D head model to generate a virtual fronto-parallel projection that satisfies conditions of affine approximation. Then the affine parameters between the virtual projection and input view are calculated. After that, rotation and translation parameters of the head are roughly estimated by a circle-ellipse correspondence technique based on the affine parameters. Finally, an iterative optimization algorithm is utilized further to refine the results. The accuracy is maintained by estimating reliability of the 2D-33D feature correspondences an weighting each factor of the optimization objective function. The system performance is also improved by applying a modified KLT technique to speed up the convergence during the face feature tracking process. Experimental results show that our method can accurately recover head poses in a wide range of head motion.

[1]  Shinn-Ying Ho,et al.  An analytic solution for the pose determination of human faces from a monocular image , 1998, Pattern Recognit. Lett..

[2]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[3]  Andrew Calway,et al.  Using affine correspondence to estimate 3-D facial pose , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[4]  Thomas S. Huang,et al.  3D head pose computation from 2D images: templates versus features , 1995, Proceedings., International Conference on Image Processing.

[5]  Jing Xiao,et al.  Robust full-motion recovery of head by dynamic templates and re-registration techniques , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[6]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[7]  Qian Chen,et al.  3D head pose estimation without feature tracking , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[9]  Tieniu Tan,et al.  Head pose estimation using Gabor eigenspace modeling , 2002, Proceedings. International Conference on Image Processing.

[10]  D. Rees,et al.  Three-dimensional pose from two-dimensional images: a novel approach using synergetic networks , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[11]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Trevor Darrell,et al.  Active face tracking and pose estimation in an interactive room , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Koichiro Deguchi,et al.  Head pose determination from one image using a generic model , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[14]  Ruigang Yang,et al.  Model-based head pose tracking with stereovision , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[15]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[16]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .