论文信息 - Automatic transcription of intonation using an identified prosodic alphabet

Automatic transcription of intonation using an identified prosodic alphabet

A solution is proposed for rapidly adapting prosodic models to a new voice or a new application. First, a prosodic alphabet that is supported by linguistic knowledge is identified at the acoustic level. The observation of the realisation of prosodic events on the acoustic corpus allows classes of breaks, F0 shapes and accents to be constructed and automatic transcription rules to be written. Then the transcribed corpus is used in the estimation of the parameters of a prosodic model for French. The good F0 contours and duration generated with the prosodic model verify the agreement of the identified alphabets to describe prosodic phenomena. Finally, the prosodic model is integrated in the CNET standard French Text-to-Speech Synthesis system. The quality of the generated prosody is considered by naïve listeners as equivalent to the handcrafted system. This result verifies the appropriateness of the alphabet as prosodic descriptors.

Stéphanie de Tournemire | S. D. Tournemire

[1] Olivier Boeffard Dosierre. Segmentation automatique d'unites acoustiques pour la synthese de la parole , 1993 .

[2] Stéphanie de Tournemire. Identification et génération automatique de contours prosodiques pour la synthèse vocale à partir du texte en français , 1998 .

[3] Piet Mertens,et al. L'intonation du français. De la description linguistique à la reconnaissance automatique , 1987 .

[4] Mari Ostendorf,et al. Automatic labeling of prosodic patterns , 1994, IEEE Trans. Speech Audio Process..

[5] Julia Hirschberg,et al. Automatic classification of intonational phrase boundaries , 1992 .

[6] Françoise Emerard,et al. Linguistic and prosodic processing for a text-to-speech synthesis system , 1989, EUROSPEECH.

[7] Katarina Bartkova,et al. A model of segmental duration for speech synthesis in French , 1987, Speech Commun..