Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone
暂无分享,去创建一个
Ryuki Tachibana | Masafumi Nishimura | Noboru Babaguchi | Gakuto Kurata | Tohru Nagano | N. Babaguchi | Ryuki Tachibana | M. Nishimura | Gakuto Kurata | Tohru Nagano
[1] Mahesh Viswanathan,et al. Recent improvements to the IBM trainable speech synthesis system , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[2] Philip C. Woodland,et al. Improvements in an HMM-based speech synthesiser , 1995, EUROSPEECH.
[3] Ryuki Tachibana,et al. Automatic Accent Labeling for a Text-to-Speech System , 2007 .
[4] Mark Hasegawa-Johnson,et al. An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[5] Alex Acero,et al. Recent improvements on Microsoft's trainable text-to-speech system-Whistler , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[6] Masafumi Nishimura,et al. A stochastic approach to phoneme and accent estimation , 2005, INTERSPEECH.
[7] Alan W. Black,et al. Impact of durational outlier removal from unit selection catalogs , 2004, SSW.
[8] Jordi Adell,et al. Database Pruning for Unsupervised Building of Text-To-Speech Voices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.