Near-videorealistic synthetic talking faces: implementation and evaluation
暂无分享,去创建一个
Gavin C. Cawley | Barry-John Theobald | Iain A. Matthews | J. Andrew Bangham | I. Matthews | G. Cawley | B. Theobald | J. Bangham
[1] B. Barsky,et al. An Introduction to Splines for Use in Computer Graphics and Geometric Modeling , 1987 .
[2] Matthew Brand,et al. Voice puppetry , 1999, SIGGRAPH.
[3] D. Stork,et al. Speechreading by Man and Machine: Models, Systems, and Applications , 1996 .
[4] Christoph Bregler,et al. Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.
[5] C. Benoit,et al. On the assessment of synthetic speech , 1992 .
[6] Hans Peter Graf,et al. Sample-based synthesis of photo-realistic talking heads , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).
[7] Frédéric H. Pighin,et al. Synthesizing realistic facial expressions from photographs , 1998, SIGGRAPH Courses.
[8] Frederic I. Parke,et al. A parametric model for human faces. , 1974 .
[9] P. Kricos. Differences in Visual Intelligibility Across Talkers , 1996 .
[10] Gavin C. Cawley,et al. Near-videorealistic synthetic visual speech using non-rigid appearance models , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[11] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.
[12] Gérard Bailly,et al. Audiovisual Speech Synthesis , 2003, Int. J. Speech Technol..
[13] Levent M. Arslan,et al. 3-D Face Point Trajectory Synthesis Using An Automatically Derived Visual Phoneme Similarity Matrix , 1998, AVSP.
[14] D. Massaro,et al. Perceiving Talking Faces , 1995 .
[15] Tony Ezzat,et al. Trainable videorealistic speech animation , 2002, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..
[16] Hans Peter Graf,et al. Triphone based unit selection for concatenative visual speech synthesis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[17] G. Meek. Mathematical statistics with applications , 1973 .
[18] Steve Young,et al. The HTK book , 1995 .
[19] Simon Baker,et al. Equivalence and efficiency of image alignment algorithms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[20] E. Owens,et al. Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.
[21] S. Lesner. Differences in visual intelligibility across talkers , 1982 .
[22] Gérard Bailly,et al. Talking Machines: Theories, Models, and Designs , 1992 .
[23] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.
[24] Keith Waters,et al. A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.