Talking Heads

This paper describes an interactive presentation that introduces the Talking Heads website, which was originally proposed at the AVSP'97 meeting in Rhodes, Greece. Talking Heads is an effort to bring together information from a wide range of sources. The site provides interactive access to multimodal material in both its original form and as summarized by us. In addition, the authors have provided historical information, supporting essays and tutorials, interviews, etc., that try to contextualize and make coherent this rapidly developing area. Both the website and the interactive presentation are described.

[1]  Thomas Baer,et al.  An articulatory synthesizer for perceptual research , 1978 .

[2]  D J Ostry,et al.  An examination of the degrees of freedom of human jaw motion in speech and mastication. , 1997, Journal of speech, language, and hearing research : JSLHR.

[3]  Tony Ezzat,et al.  MikeTalk: a talking facial display based on morphing visemes , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).

[4]  Eric Vatikiotis-Bateson,et al.  Measuring and Modeling Speech Production , 1998 .

[5]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[6]  平山亮 会議報告-Speechreading by Humans and Machines; Models Systems and Applications , 1997 .

[7]  C. Benoît,et al.  A set of French visemes for visual speech synthesis , 1994 .

[8]  Lionel Revéret From raw images of the lips to articulatory parameters: a viseme-based prediction , 1997, EUROSPEECH.

[9]  F S COOPER,et al.  The interconversion of audible and visible patterns as a basis for research in the perception of speech. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Alex Pentland,et al.  3D modeling and tracking of human lip motions , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[11]  N. Michael Brooke Talking Heads and Speech Recognisers That Can See: The Computer Processing of Visual Speech Signals , 1996 .

[12]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[13]  D. Ostry,et al.  The equilibrium point hypothesis and its application to speech motor control. , 1996, Journal of speech and hearing research.

[14]  D. Massaro Speech Perception By Ear and Eye: A Paradigm for Psychological Inquiry , 1989 .

[15]  H. K. Dunn The Calculation of Vowel Resonances, and an Electrical Vocal Tract , 1950 .

[16]  R. Wilhelms-Tricarico Physiological modeling of speech production: methods for modeling soft-tissue articulators. , 1995, The Journal of the Acoustical Society of America.

[17]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[18]  Daniel Thalmann,et al.  Digital actors for interactive television , 1995 .

[19]  N. Magnenat-Thalmann,et al.  Synthetic actors in computer-generated 3D films , 1990 .

[20]  Frederic I. Parke,et al.  A model for human faces that allows speech synchronized animation , 1974, SIGGRAPH '74.

[21]  Parke,et al.  Parameterized Models for Facial Animation , 1982, IEEE Computer Graphics and Applications.

[22]  Dominic W. Massaro,et al.  Synthesis of visible speech , 1990 .

[23]  Daniel Thalmann,et al.  Navigation for digital actors based on synthetic vision, memory, and learning , 1995, Comput. Graph..

[24]  Q Summerfield,et al.  Use of Visual Information for Phonetic Perception , 1979, Phonetica.

[25]  D. Massaro Bimodal Speech Perception: A Progress Report , 1996 .

[26]  M H Cohen,et al.  Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. , 1992, The Journal of the Acoustical Society of America.

[27]  D. Ostry,et al.  Control of jaw orientation and position in mastication and speech. , 1994, Journal of neurophysiology.

[28]  Gérard Bailly,et al.  Talking Machines: Theories, Models, and Designs , 1992 .

[29]  E. Vatikiotis-Bateson,et al.  Kinematics-Based Synthesis of Realistic Talking Faces , 1998, AVSP.

[30]  Daniel Thalmann,et al.  Complex models for animating synthetic actors , 1991, IEEE Computer Graphics and Applications.

[31]  Hani Yehia,et al.  Quantitative association of vocal-tract and facial behavior , 1998, Speech Commun..