论文信息 - Parameterisation of 3d speech lip movements

Parameterisation of 3d speech lip movements

In this paper we describe a parameterisation of lip movements which maintains the dynamic structure inherent in the task of producing speech sounds. A stereo capture system is used to reconstruct 3D models of a speaker producing sentences from the TIMIT corpus. This data is mapped into a space which maintains the relationships between samples and their temporal derivatives. By incorporating dynamic information within the parameterisation of lip movements we can model the cyclical structure, as well as the causal nature of speech movements as described by an underlying visual speech manifold. It is believed that such a structure will be appropriate to various areas of speech modeling, in particular the synthesis of speech lip movements.

Adrian Hilton | James D. Edge | Philip J. B. Jackson

[1] Barry-John Theobald,et al. A real-time speech-driven talking head using active appearance models , 2007, AVSP.

[2] Timothy F. Cootes,et al. Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Michael M. Cohen,et al. Modeling Coarticulation in Synthetic Visual Speech , 1993 .

[4] Sam T. Roweis,et al. EM Algorithms for PCA and SPCA , 1997, NIPS.

[5] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[6] Frédéric H. Pighin,et al. Unsupervised learning for speech motion editing , 2003, SCA '03.

[7] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[8] Nadia Magnenat-Thalmann,et al. Visyllable Based Speech Animation , 2003, Comput. Graph. Forum.

[9] Aaron Hertzmann,et al. Style-based inverse kinematics , 2004, ACM Trans. Graph..

[10] David J. Fleet,et al. Gaussian Process Dynamical Models , 2005, NIPS.

[11] Thomas Vetter,et al. A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[12] Aaron Hertzmann,et al. Style-based inverse kinematics , 2004, SIGGRAPH 2004.

[13] J. Gower. Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .