Visual analysis of lip coarticulation in VCV utterances

This paper presents an investigation of the visual variation on the bilabial plosive consonant /p/ in three coarticulation contexts. The aim is to provide detailed ensemble analysis to assist coarticulation modelling in visual speech synthesis. The underlying dynamics of labeled visual speech units, represented as lip shape, from symmetric VCV utterances, is investigated. Variation in lip dynamics is quantitively and qualitatively analyzed. This analysis shows that there are statistically significant differences in both the lip shape and trajectory during coarticulation.

[1]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Tomaso Poggio,et al.  Trainable Videorealistic Speech Animation , 2004, FGR.

[3]  Raymond G. Daniloff,et al.  On defining coarticulation , 1973 .

[4]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[5]  S. Ohman Numerical model of coarticulation. , 1967, The Journal of the Acoustical Society of America.

[6]  Jianwu Dang,et al.  Investigation and modeling of coarticulation during speech , 2005, INTERSPEECH.

[7]  Patricia A. Keating,et al.  Papers in Laboratory Phonology: The window model of coarticulation: articulatory evidence , 1990 .

[8]  Hans Peter Graf,et al.  Sample-based synthesis of photo-realistic talking heads , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).

[9]  A Löfqvist,et al.  Interarticulator programming in VCV sequences: lip and tongue movements. , 1999, The Journal of the Acoustical Society of America.

[10]  William L. Henke,et al.  Dynamic articulatory model of speech production using computer simulation. , 1966 .

[11]  Christoph Bregler,et al.  Video rewrite: visual speech synthesis from video , 1997, AVSP.

[12]  Nadia Magnenat-Thalmann,et al.  Visyllable Based Speech Animation , 2003, Comput. Graph. Forum.

[13]  Michael M. Cohen,et al.  Modeling Coarticulation in Synthetic Visual Speech , 1993 .

[14]  Anders Löfqvist,et al.  Speech as Audible Gestures , 1990 .

[15]  S. Öhman Coarticulation in VCV Utterances: Spectrographic Measurements , 1966 .