Custom-tailoring TTS voice font - keeping the naturalness when reducing database size
暂无分享,去创建一个
Yong Zhao | Hu Peng | Min Chu | Eric Chang
[1] Hu Peng,et al. Selecting non-uniform units from a very large corpus for concatenative speech synthesizer , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[2] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[3] Keikichi Hirose,et al. Pruning of redundant synthesis instances based on weighted vector quantization , 2001, INTERSPEECH.
[4] Alex Acero,et al. Automatic generation of synthesis units for trainable text-to-speech systems , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[5] Yong Zhao,et al. Microsoft Mulan - a bilingual TTS system , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[6] Hu Peng,et al. An objective measure for estimating MOS of synthesized speech , 2001, INTERSPEECH.
[7] Paul Taylor,et al. Automatically clustering similar units for unit selection in speech synthesis , 1997, EUROSPEECH.
[8] Alex Acero,et al. Whistler: a trainable text-to-speech system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.