Prosody and the Selection of Source Units for Concatenative Synthesis
暂无分享,去创建一个
[1] N. Iwahashi,et al. Speech Segment Selection for Concatenative Synthesis Based on Spectral Distortion Minimization , 1993 .
[2] K. D. Jong. The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation , 1995 .
[3] S. Nakajima,et al. Automatic generation of synthesis units based on context oriented clustering , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[4] Björn Lindblom,et al. Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .
[5] A. Marchal,et al. Speech production and speech modelling , 1990 .
[6] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[7] Gérard Bailly,et al. Talking Machines: Theories, Models, and Designs , 1992 .
[8] W. Nick Campbell,et al. Prosodic encoding of English speech , 1992, ICSLP.
[9] Shinya Nakajima. Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering , 1994, Speech Commun..
[10] Yoshinori Sagisaka,et al. ATR μ-talk speech synthesis system , 1992, ICSLP.
[11] J. Sundberg,et al. Spectral correlates of glottal voice source waveform characteristics. , 1989, Journal of speech and hearing research.