Face-to-face video teleconferencing is very important for real time communication. Current teleconferencing applications use standard video codecs, such as MPEG 1/2/4, for the compression of face video. Either a high bandwidth is required for high quality video transmission, or the transmitted face video is blurred at low bitrates. We present a system for real-time coding of face video at low bit-rate. There are two main contributions. First, we improve the technique of long term memory prediction by selecting frames into the database in an optimal way. A new frame is selected into the database only when it is significantly different from those frames which are already in the database. In this way, the database can cover a wider range of images. Second, we incorporate prior knowledge about faces into the long term memory prediction framework. The prior knowledge includes: (1) facial motions are repetitive such that most of them can be reconstructed from multiple reference frames; (2) different components of the face and the background can tolerate different levels of error because of different perceptual importance. Experiments show that, at similar PSNR, the proposed system works much faster and achieves better visual quality than the standard H.264/JVT codec.
[1]
Hans Peter Graf,et al.
Photo-Realistic Talking-Heads from Image Samples
,
2000,
IEEE Trans. Multim..
[2]
Itu-T.
Video coding for low bitrate communication
,
1996
.
[3]
Kiyoharu Aizawa,et al.
MODEL-BASED VIDEO CODING
,
1996
.
[4]
Pertti Roivainen,et al.
3-D Motion Estimation in Model-Based Facial Image Coding
,
1993,
IEEE Trans. Pattern Anal. Mach. Intell..
[5]
Jörn Ostermann.
Object-based analysis-synthesis coding (OBASC) based on the source model of moving flexible 3-D objects
,
1994,
IEEE Trans. Image Process..
[6]
Kiyoharu Aizawa,et al.
Model-based image coding
,
1994,
Other Conferences.
[7]
Thomas S. Huang,et al.
Nonrigid Motion Modeling and Analysis in Video Sequences for Realistic Facial Animation
,
1999
.
[8]
Peter Eisert,et al.
Model-aided coding: a new approach to incorporate facial animation into motion-compensated video coding
,
2000,
IEEE Trans. Circuits Syst. Video Technol..