论文信息 - Near-videorealistic synthetic talking faces: implementation and evaluation - 字舞流文

Near-videorealistic synthetic talking faces: implementation and evaluation

Gavin C. Cawley | Barry-John Theobald | Iain A. Matthews | J. Andrew Bangham | I. Matthews | G. Cawley | B. Theobald | J. Bangham

[1] B. Barsky,et al. An Introduction to Splines for Use in Computer Graphics and Geometric Modeling , 1987 .

[2] Matthew Brand,et al. Voice puppetry , 1999, SIGGRAPH.

[3] D. Stork,et al. Speechreading by Man and Machine: Models, Systems, and Applications , 1996 .

[4] Christoph Bregler,et al. Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[5] C. Benoit,et al. On the assessment of synthetic speech , 1992 .

[6] Hans Peter Graf,et al. Sample-based synthesis of photo-realistic talking heads , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).

[7] Frédéric H. Pighin,et al. Synthesizing realistic facial expressions from photographs , 1998, SIGGRAPH Courses.

[8] Frederic I. Parke,et al. A parametric model for human faces. , 1974 .

[9] P. Kricos. Differences in Visual Intelligibility Across Talkers , 1996 .

[10] Gavin C. Cawley,et al. Near-videorealistic synthetic visual speech using non-rigid appearance models , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[11] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.

[12] Gérard Bailly,et al. Audiovisual Speech Synthesis , 2003, Int. J. Speech Technol..

[13] Levent M. Arslan,et al. 3-D Face Point Trajectory Synthesis Using An Automatically Derived Visual Phoneme Similarity Matrix , 1998, AVSP.

[14] D. Massaro,et al. Perceiving Talking Faces , 1995 .

[15] Tony Ezzat,et al. Trainable videorealistic speech animation , 2002, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[16] Hans Peter Graf,et al. Triphone based unit selection for concatenative visual speech synthesis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17] G. Meek. Mathematical statistics with applications , 1973 .

[18] Steve Young,et al. The HTK book , 1995 .

[19] Simon Baker,et al. Equivalence and efficiency of image alignment algorithms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20] E. Owens,et al. Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[21] S. Lesner. Differences in visual intelligibility across talkers , 1982 .

[22] Gérard Bailly,et al. Talking Machines: Theories, Models, and Designs , 1992 .

[23] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.

[24] Keith Waters,et al. A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.