Automatic fitting and tracking of facial features in head-and-shoulders sequences

Model-based video coding requires the application of both image processing and machine vision techniques for proper fitting of the semantic model and its subsequent tracking throughout the rest of the sequence of a certain type (e.g. 'head-and-shoulders' or 'head-only'). A method of automatic semantic wire-frame fitting and tracking based on principal component analysis using an independent reference data-base of facial images is presented. The method has been tested on widely used 'head-and-shoulders' video sequences with very good results. It was possible to accurately retrieve the position of the desired facial features in all cases. The position of the facial features in initial frames was subsequently used in automatic tracking. Experimental results are also presented.

[1]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[2]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[3]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[4]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[5]  Arnaud E. Jacquin,et al.  Image coding based on a fractal theory of iterated contractive image transformations , 1992, IEEE Trans. Image Process..

[6]  Allen Gersho,et al.  On the structure of vector quantizers , 1982, IEEE Trans. Inf. Theory.

[7]  Peter M. Grant,et al.  Facial features motion analysis for wire-frame tracking in model-based moving image coding , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Robert Forchheimer,et al.  Image coding-from waveforms in animation , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Peter Grant,et al.  Facial feature model fitting in semantic-based scene analysis , 1997 .