Robust Head Pose Estimation Using Textured Polygonal Model with Local Correlation Measure

In this paper, a robust head pose estimation algorithm is presented. In contrast with other approaches, the proposed algorithm adopts textured polygonal model generated from two orthogonal views for accurate head pose estimation. To achieve robust estimation under varying illumination, local correlation coefficient is taken as the similarity measure. The tracking is further improved by modeling head dynamics with Kalman filtering. Preliminary simulation results indicate that the proposed algorithm can reliably estimate the head pose under large rotation angles with varying illumination, and the average estimation error are all below 4 degrees.

[1]  Marco La Cascia,et al.  Fast, Reliable Head Tracking under Varying Illumination: An Approach Based on Registration of Texture-Mapped 3D Models , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Peter Eisert,et al.  Model-based 3D-motion estimation with illumination compensation , 1997 .

[3]  Frank Fallside,et al.  An adaptive training algorithm for back propagation networks , 1987 .

[4]  Irfan Essa,et al.  Head Tracking Using a Textured Polygonal Model , 1998 .

[5]  Yung-Chang Chen,et al.  Virtual Talk: a model-based virtual phone using a layered audio-visual integration , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[6]  Zicheng Liu,et al.  Rapid modeling of animated faces from video , 2001, Comput. Animat. Virtual Worlds.

[7]  Qian Chen,et al.  3D head pose estimation without feature tracking , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8]  Pat Hanrahan,et al.  Visual Computing , 2000, CG International Series.

[9]  Subbarayan Pasupathy,et al.  Predictive head movement tracking using a Kalman filter , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[10]  Fabio Lavagetto,et al.  The facial animation engine: toward a high-level interface for the design of MPEG-4 compliant animated faces , 1999, IEEE Trans. Circuits Syst. Video Technol..

[11]  DaugmanJohn Face and Gesture Recognition , 1997 .

[12]  Larry S. Davis,et al.  Computing 3-D head orientation from a monocular image sequence , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[13]  Nadia Magnenat-Thalmann,et al.  Fast head modeling for animation , 2000, Image Vis. Comput..

[14]  Alex Pentland,et al.  3D structure from 2D motion , 1999, IEEE Signal Process. Mag..

[15]  X. Pennec,et al.  3D non-rigid registration by gradient descent on a Gaussian-windowed similarity measure using convolutions , 2000, Proceedings IEEE Workshop on Mathematical Methods in Biomedical Image Analysis. MMBIA-2000 (Cat. No.PR00737).