Iterative unit selection with unnatural prosody detection
暂无分享,去创建一个
[1] Yong Zhao,et al. Measuring Target Cost in Unit Selection with Kl-Divergence Between Context-Dependent HMMS , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[2] Keikichi Hirose,et al. Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and television announcers , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Justin Fackrell,et al. The application of interactive speech unit selection in TTS systems , 2003, INTERSPEECH.
[4] Robert E. Donovan,et al. The IBM trainable speech synthesis system , 1998, ICSLP.
[5] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[6] Yong Zhao,et al. Microsoft Mulan - a bilingual TTS system , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[7] Sin-Horng Chen,et al. An RNN-based prosodic information synthesizer for Mandarin text-to-speech , 1998, IEEE Trans. Speech Audio Process..
[8] Yong Zhao,et al. Modeling stylized invariance and local variability of prosody in text-to-speech synthesis , 2006, Speech Commun..
[9] Yannis Stylianou,et al. Perceptual and objective detection of discontinuities in concatenative speech synthesis , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).