A trial of communicative prosody generation based on control characteristic of one word utterance observed in real conversational speech
暂无分享,去创建一个
[1] Keiichi Tokuda,et al. Hidden Markov models based on multi-space probability distribution for pitch pattern modeling , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[2] Y. Sagisaka,et al. On the prediction of global F/sub 0/ shape for Japanese text-to-speech , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[3] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..
[4] Christof Traber,et al. SVOX: the implementation of a text-to-speech system for German , 1995 .
[5] M. D. Riley. Tree-based modeling of segmental durations , 1992 .
[6] Yoshinori Sagisaka,et al. F0 control characterization by perceptual impressions on speaking attitudes using multiple dimensional scaling analysis , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[7] Keikichi Hirose,et al. Analysis of voice fundamental frequency contours for declarative sentences of Japanese , 1984 .
[8] Yoshinori Sagisaka,et al. Communicative speech synthesis using constituent word attributes , 2005, INTERSPEECH.