论文信息 - Combining non-uniform unit selection with diphone based synthesis

Combining non-uniform unit selection with diphone based synthesis

This paper describes the unit selection algorithm of a speech synthesis system, which selects the k-best paths over units from a relational unit database. The algorithm uses words and diphones as basic unit types. It is part of a customisable textto-speech system designed for generating new prompts using a recorded speech corpus, with the option that the user can interactively optimise the results from the unit selection algorithm. This algorithm combines advantages of nonuniform unit selection algorithms and diphone inventory based speech synthesis.

Erhard Rank | Michael Pucher | Friedrich Neubarth | Georg Niklfeld | Qi Guan

[1] David Eppstein,et al. Finding the k Shortest Paths , 1999, SIAM J. Comput..

[2] Esther Klabbers,et al. Creation of speech corpora for the multilingual Bonn Open Synthesis System , 2001, SSW.

[3] Paul Taylor,et al. Festival Speech Synthesis System , 1998 .

[4] Paul Taylor,et al. Speech synthesis by phonological structure matching , 1999, EUROSPEECH.

[5] Ann K. Syrdal,et al. Preselection of candidate units in a unit selection-based text-to-speech synthesis system , 2000, INTERSPEECH.

[6] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7] Erhard Rank. Concatenative Speech Synthesis Using SRELP , 2002 .

[8] Raymond N. J. Veldhuis,et al. Reducing audible spectral discontinuities , 2001, IEEE Trans. Speech Audio Process..

[9] Mark Huckvale,et al. Improvements in Speech Synthesis , 2001 .