3D motion estimation of video objects using a priori data and 2D apparent motion

This paper deals with the estimation of the three dimensional motion of a locutor appearing in a real video sequence. This is done using a new head tracking method in which the head position is modeled by means of a parametric elliptic contour fitted by minimization of a suitable objective function. The ellipse is tracked using a Kalman approach and its successive positions allow the computation of the head's apparent motion. Finally, the three dimensional translatory movement is determined using the estimated apparent displacement and a priori information of head size.

[1]  Henri Nicolas,et al.  Very low bit rate coding using hybrid synthetic/real images for multisite videoconference applications , 1997, Electronic Imaging.

[2]  Patrick Pérez,et al.  Generalized Likelihood Ratio-based Face Detection and Extraction of Mouth Features , 1997, AVBPA.

[3]  Joachim Denzler,et al.  Model based extraction of articulated objects in image sequences for gait analysis , 1997, Proceedings of International Conference on Image Processing.

[4]  Alexandros Eleftheriadis,et al.  Automatic face location detection and tracking for model-assisted coding of video teleconferencing sequences at low bit-rates , 1995, Signal Process. Image Commun..

[5]  Patrick Bouthemy,et al.  Tracking of articulated structures exploiting spatio-temporal image slices , 1997, Proceedings of International Conference on Image Processing.

[6]  Timothy F. Cootes,et al.  A unified approach to coding and interpreting face images , 1995, Proceedings of IEEE International Conference on Computer Vision.

[7]  Leonardo Chiariglione MPEG and multimedia communications , 1997, IEEE Trans. Circuits Syst. Video Technol..

[8]  Gang Xu,et al.  Tracking Human Body Motion Based on a Stick Figure Model , 1994, J. Vis. Commun. Image Represent..

[9]  Michael J. Black,et al.  Cardboard people: a parameterized model of articulated image motion , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[10]  J. Odobez,et al.  Separation of Moving Regions from Background in an Image Sequence Acquired with a Mobil Camera , 1997 .

[11]  K. Rohr Towards model-based recognition of human movements in image sequences , 1994 .

[12]  Rachid Deriche,et al.  Energy-based methods for 2D curve tracking, reconstruction, and refinement of 3D curves and applications , 1993, Optics & Photonics.

[13]  Raphaël Féraud,et al.  Traitement du signal audio-visuel et visiophone personne libre , 1997 .

[14]  Takeo Kanade,et al.  Neural network-based face detection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Henri Nicolas,et al.  Global motion identification for image sequence analysis and coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.