论文信息 - MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters

MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters

In this paper, we illustrate the use of the MAGE performative speech synthesizer through its application to the conversion of realtime-measured facial features with FaceOSC into speech synthesis features such as vocal tract shape or intonation. MAGE is a new software library for using HMM-based speech synthesis in reactive programming environments. MAGE uses a rewritten version of the HTS engine enabling the computation of speech audio samples on a two-label window instead of the whole sentence. Only this feature enables the realtime mapping of facial attributes to synthesis parameters.

Thierry Dutoit | Maria Astrinaki | Nicolas D'Alessandro

[1] Thierry Dutoit,et al. HandSketch bi-manual controller: investigation on expressive control issues of an augmented tablet , 2007, NIME '07.

[2] Heiga Zen,et al. Statistical Parametric Speech Synthesis , 2007, IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Sidney Fels,et al. Developing vowel mappings for an interactive voice synthesis system controlled by hand motions. , 2010 .