EM enhancement of 3D head pose estimated by point at infinity

Head pose estimation is a key task for visual surveillance, HCI and face recognition applications. In this paper, a new approach is proposed for estimating 3D head pose from a monocular image. The approach assumes the full perspective projection camera model. Our approach employs general prior knowledge of face structure and the corresponding geometrical constraints provided by the location of a certain vanishing point to determine the pose of human faces. To achieve this, eye-lines, formed from the far and near eye corners, and mouth-line of the mouth corners are assumed parallel in 3D space. Then the vanishing point of these parallel lines found by the intersection of the eye-line and mouth-line in the image can be used to infer the 3D orientation and location of the human face. In order to deal with the variance of the facial model parameters, e.g. ratio between the eye-line and the mouth-line, an EM framework is applied to update the parameters. We first compute the 3D pose using some initially learnt parameters (such as ratio and length) and then adapt the parameters statistically for individual persons and their facial expressions by minimizing the residual errors between the projection of the model features points and the actual features on the image. In doing so, we assume every facial feature point can be associated to each of features points in 3D model with some a posteriori probability. The expectation step of the EM algorithm provides an iterative framework for computing the a posterori probabilities using Gaussian mixtures defined over the parameters. The robustness analysis of the algorithm on synthetic data and some real images with known ground-truth are included.

[1]  Aaron Hertzmann,et al.  Learning Non-Rigid 3D Shape from 2D Motion , 2003, NIPS.

[2]  Tomaso A. Poggio,et al.  Hierarchical morphable models , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[3]  Fadi Dornaika,et al.  Object Pose: The Link between Weak Perspective, Paraperspective, and Full Perspective , 1997, International Journal of Computer Vision.

[4]  Roberto Cipolla,et al.  Estimating gaze from a single view of a face , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[5]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[6]  Jian-Gang Wang,et al.  Frontal-view face detection and facial feature extraction using color and morphological operations , 1999, Pattern Recognit. Lett..

[7]  William M. Wells,et al.  Statistical Approaches to Feature-Based Object Recognition , 2004, International Journal of Computer Vision.

[8]  Jian-Gang Wang,et al.  Morphology-based front-view facial contour detection , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.

[9]  Alex Pentland,et al.  LAFTER: lips and face real time tracker , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Philip David,et al.  SoftPOSIT: Simultaneous Pose and Correspondence Determination , 2002, ECCV.

[11]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[12]  Edwin R. Hancock,et al.  Estimating 3D Facial Pose using the EM Algorithm , 1998 .

[13]  David Salesin,et al.  Resynthesizing facial animation through 3D model-based tracking , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[14]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[15]  Shaogang Gong,et al.  Learning to Associate Faces across Views in Vector Space of Similarities to Prototypes , 1998, BMVC.

[16]  Agnès Desolneux,et al.  Vanishing Point Detection without Any A Priori Information , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Jake K. Aggarwal,et al.  Determining vanishing points from perspective images , 1984, Comput. Vis. Graph. Image Process..

[18]  Roberto Cipolla,et al.  Fast visual tracking by temporal consensus , 1996, Image Vis. Comput..

[19]  Andrew C. Gallagher A ground truth based vanishing point detection algorithm , 2002, Pattern Recognit..

[20]  Wen-Hsiang Tsai,et al.  Camera Calibration by Vanishing Lines for 3-D Computer Vision , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  J. G. Semple,et al.  Algebraic Projective Geometry , 1953 .

[22]  Vicki Bruce,et al.  Face Recognition: From Theory to Applications , 1999 .

[23]  Larry S. Davis,et al.  Computing 3-D head orientation from a monocular image sequence , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[24]  Alex Pentland,et al.  Parametrized structure from motion for 3D adaptive feedback tracking of faces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Isaac Weiss,et al.  Model-Based Recognition of 3D Objects from Single Images , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[27]  Timothy F. Cootes,et al.  Interpreting face images using active appearance models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[28]  Thomas S. Huang,et al.  Capturing subtle facial motions in 3D face tracking , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[30]  Trevor Darrell,et al.  Pose estimation using 3D view-based eigenspaces , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[31]  Stephen M. Smith,et al.  SUSAN—A New Approach to Low Level Image Processing , 1997, International Journal of Computer Vision.

[32]  Andrew Zisserman,et al.  Multiple view geometry in computer visiond , 2001 .

[33]  Kenichi Kanatani,et al.  Geometric computation for machine vision , 1993 .

[34]  Larry S. Davis,et al.  Iterative Pose Estimation Using Coplanar Feature Points , 1996, Comput. Vis. Image Underst..

[35]  Pietro Parodi,et al.  3D Shape Reconstruction by Using Vanishing Points , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Robert M. Haralick,et al.  Determining camera parameters from the perspective projection of a rectangle , 1989, Pattern Recognit..