论文信息 - Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis

Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis

The paper deals with the process of designing a phonetically and prosodically rich speech corpus for unit selection speech synthesis. The attention is given mainly to the recording and verification stage of the process. In order to ensure as high quality and consistency of the recordings as possible, a special recording environment consisting of a recording session management and "pluggable" chain of checking modules was designed and utilised. Other stages, namely text collection (including) both phonetically and prosodically balanced sentence selection and a careful annotation on both orthographic and phonetic level are also mentioned.

Daniel Tihelka | Jindrich Matousek | Jan Romportl

[1] Daniel Tihelka,et al. Corpus recording and checking on the recorded data , 2007 .

[2] Mark Liberman,et al. Transcriber: Development and use of a tool for assisting speech corpora production , 2001, Speech Commun..

[3] Jindrich Matousek,et al. Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis , 2007, TSD.

[4] Jindrich Matousek,et al. On building phonetically and prosodically rich speech corpus for text-to-speech synthesis , 2006, Computational Intelligence.

[5] Jindrich Matousek,et al. Design of speech corpus for text-to-speech synthesis , 2001, INTERSPEECH.

[6] Jan Romportl. Structural Data-Driven Prosody Model for TTS Synthesis , 2006 .

[7] Daniel Tihelka,et al. Current State of Czech Text-to-Speech System ARTIC , 2006, TSD.