Creating 3D virtual heads from video sequences: a recursive approach by combining EKF and DFFD

An automatic system for creating a virtual head that is compatible with MPEG-4 facial object specification is presented. Color classification and a valley detection filter are performed to find face and facial definition points (FDPs) at the initialization stage. Extracted FDPs are tracked by normalized correlation and their trajectories are fed into an extended Kalman filter (EKF) to recover camera geometry, facial orientation, and depth of selected FDPs. Based on a recovered point-wise 3D structure, Dirichlet free-form deformations (DFFD) is applied to deform a generic 3D model. Once a virtual head is created, the head can be used to track FDPs for large out-of-plane rotations and to update the head model continuously based on refined depth information. A complete texture map is created by mixing frontal and rotated faces based on the recovered face orientation.

[1]  Nadia Magnenat-Thalmann,et al.  MPEG-4 compatible faces from orthogonal photos , 1999, Proceedings Computer Animation 1999.

[2]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yung-Chang Chen,et al.  Implementation of a virtual chat room for multimedia communications , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[4]  Ken Sakamura,et al.  Multimedia Montage—Counterpoint Synthesis of Movies , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[5]  Naokazu Yokoya,et al.  SeamlessDesign: a face-to-face collaborative virtual/augmented environment for rapid prototyping of geometrically constrained 3-D objects , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[6]  A. Pentland,et al.  Real time tracking and modeling of faces: an EKF-based analysis by synthesis approach , 1999, Proceedings IEEE International Workshop on Modelling People. MPeople'99.

[7]  Yao Wang,et al.  Facial feature extraction and tracking in video sequences , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[8]  Nadia Magnenat-Thalmann,et al.  Fast head modeling for animation , 2000, Image Vis. Comput..

[9]  Alex Pentland,et al.  Parametrized structure from motion for 3D adaptive feedback tracking of faces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.