Near-videorealistic synthetic talking faces: implementation and evaluation

[1]  B. Barsky,et al.  An Introduction to Splines for Use in Computer Graphics and Geometric Modeling , 1987 .

[2]  Matthew Brand,et al.  Voice puppetry , 1999, SIGGRAPH.

[3]  D. Stork,et al.  Speechreading by Man and Machine: Models, Systems, and Applications , 1996 .

[4]  Christoph Bregler,et al.  Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[5]  C. Benoit,et al.  On the assessment of synthetic speech , 1992 .

[6]  Hans Peter Graf,et al.  Sample-based synthesis of photo-realistic talking heads , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).

[7]  Frédéric H. Pighin,et al.  Synthesizing realistic facial expressions from photographs , 1998, SIGGRAPH Courses.

[8]  Frederic I. Parke,et al.  A parametric model for human faces. , 1974 .

[9]  P. Kricos Differences in Visual Intelligibility Across Talkers , 1996 .

[10]  Gavin C. Cawley,et al.  Near-videorealistic synthetic visual speech using non-rigid appearance models , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[11]  Timothy F. Cootes,et al.  Active Appearance Models , 1998, ECCV.

[12]  Gérard Bailly,et al.  Audiovisual Speech Synthesis , 2003, Int. J. Speech Technol..

[13]  Levent M. Arslan,et al.  3-D Face Point Trajectory Synthesis Using An Automatically Derived Visual Phoneme Similarity Matrix , 1998, AVSP.

[14]  D. Massaro,et al.  Perceiving Talking Faces , 1995 .

[15]  Tony Ezzat,et al.  Trainable videorealistic speech animation , 2002, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[16]  Hans Peter Graf,et al.  Triphone based unit selection for concatenative visual speech synthesis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  G. Meek Mathematical statistics with applications , 1973 .

[18]  Steve Young,et al.  The HTK book , 1995 .

[19]  Simon Baker,et al.  Equivalence and efficiency of image alignment algorithms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  E. Owens,et al.  Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[21]  S. Lesner Differences in visual intelligibility across talkers , 1982 .

[22]  Gérard Bailly,et al.  Talking Machines: Theories, Models, and Designs , 1992 .

[23]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[24]  Keith Waters,et al.  A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.