论文信息 - Synthetic and hybrid imaging in the HUMANOID and VIDAS projects

Synthetic and hybrid imaging in the HUMANOID and VIDAS projects

The research activity in natural/synthetic image processing and representation reported in this paper, initiated under the Esprit project HUMANOID and currently continued under the ACTS project VIDAS, concerns the application of virtual reality methodologies to interpersonal audio/video communication. The 3D videophone scene is modeled in video (the talker's face) and in audio (the talker's speech) so that natural data can be efficiently mixed with synthetic data and adapted onto deformable parameterized structures. Robust image analysis/synthesis tools are necessary to extract the visual primitives associated to the talker's face and to adapt them onto suitable modeling structures (wire-frames). Image/speech analysis performed at the transmitter provides suitable audio/video parameters which are encoded and used at the receiver to synthesize the corresponding facial expressions together with synchronized lip movements.

[1] Daniel Thalmann,et al. Real-time facial interaction , 1994 .

[2] Haibo Li,et al. Two-view facial movement estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[3] Fabio Lavagetto,et al. Object-oriented scene modeling for interpersonal video communication at very low bit-rate , 1994, Signal Process. Image Commun..

[4] Nadia Magnenat-Thalmann,et al. Simulation of Facial Skin using Texture Mapping and Coloration , 1993, ICCG.

[5] F. Lavagetto,et al. Converting speech into lip movements: a multimedia telephone for hard of hearing people , 1995 .

[6] Daniel Thalmann,et al. SMILE: A Multilayered Facial Animation System , 1991, Modeling in Computer Graphics.

[7] Daniel Thalmann,et al. The HUMANOID Environment for Interactive Animation of Multiple Deformable Human Characters , 1995, Comput. Graph. Forum.