Time-Varying Image Processing for 3D Model-Based Video Coding

Publisher Summary This chapter presents a robust and novel approach to real-time feature tracking using a 3D model-based framework. Image understanding and motion analysis of video sequences are two aspects of computer vision that connect and relate to a number of applications ranging from video annotation to video coding to human computer interfaces. In most scenarios, objects are not rigid because their motion, with respect to the scene, is combined with deformations and changes in light conditions. A framework for automatic detection and tracking of moving objects in such complex conditions is very important in time-varying imagery, and would provide a tool for further analysis and understanding of the higher level aspects of the observed scene. A small set of facial features are tracked successfully over a large range of head motion. Nonrigid points are also tracked successfully after compensating for the computed global motion. The combination of the 3D model, head pose estimation, and texture mapping avoids the error accumulation problem and allows better localization of the features. Results confirm that the system is robust, accurate, and can be implemented in real-time. This framework finds applications ranging from model-based video coding and facial expression analysis to face recognition using video sequences.

[1]  A. Murat Tekalp,et al.  2-D mesh tracking for synthetic transfiguration , 1995, Proceedings., International Conference on Image Processing.

[2]  David G. Stork,et al.  Speechreading: an overview of image processing, feature extraction, sensory integration and pattern recognition techniques , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[3]  Thomas S. Huang,et al.  3D head pose computation from 2D images: templates versus features , 1995, Proceedings., International Conference on Image Processing.

[4]  Kiyoharu Aizawa,et al.  Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[5]  Patrick Bouthemy,et al.  Region-Based Tracking Using Affine Motion Models in Long Image Sequences , 1994 .

[6]  Thomas S. Huang,et al.  Maximum likelihood face detection , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[7]  Alex Pentland,et al.  A vision system for observing and extracting facial action parameters , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Daniel P. Huttenlocher,et al.  Tracking non-rigid objects in complex scenes , 1993, 1993 (4th) International Conference on Computer Vision.

[9]  A. Murat Tekalp,et al.  Two-dimensional object-based coding using a content-based mesh and affine motion parameterization , 1995, Proceedings., International Conference on Image Processing.

[10]  Rama Chellappa,et al.  Dynamic feature point tracking in an image sequence , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[11]  Charles Kervrann,et al.  Robust tracking of stochastic deformable models in long image sequences , 1994, Proceedings of 1st International Conference on Image Processing.

[12]  Frederic Fol Leymarie,et al.  Tracking Deformable Objects in the Plane Using an Active Contour Model , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Yao Wang,et al.  Active mesh-a feature seeking and tracking image sequence representation scheme , 1994, IEEE Trans. Image Process..

[14]  Ishwar K. Sethi,et al.  Finding Trajectories of Feature Points in a Monocular Image Sequence , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Thomas S. Huang,et al.  Head Pose Computation for Very Low Bit-rate Video Coding , 1995, CAIP.