论文信息 - Speech quality improvement in TTS system using ABS/OLA sinusoidal model

Speech quality improvement in TTS system using ABS/OLA sinusoidal model

In this paper, we propose a novel unit concatenation and synthesis method using ABS/OLA sinusoidal model. Phase succession is used in the unit synthesis assuming that the pitch onset time of the rst frame in a given unit is the frame center. In the unit concatenation, the phase succession and interpolation of the sinusoid amplitudes via several frames around the concatenation point is utilized. As a result of applying this method to the Text-toSpeech(TTS) system, we got speech samples which were more intelligible and natural than those produced by conventional method.

Yung-Hwan Oh | Jae-Hyun Bae | Heo-Jin Byeon

[1] T. Quatieri,et al. Phase modelling and its application to sinusoidal transform coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] Mark A. Clements,et al. Speech concatenation and synthesis using an overlap-add sinusoidal model , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[4] Mark J. T. Smith,et al. Analysis-by-Synthesis/Overlap-Add Sinusoidal Modeling Applied to the Analysis and Synthesis of Musical Tones , 1992 .