HMM-Based singing voice synthesis and its application to Japanese and English
暂无分享,去创建一个
Yoshihiko Nankaku | Keiichi Tokuda | Kazuhiro Nakamura | Keiichiro Oura | Keiichiro Oura | K. Tokuda | Yoshihiko Nankaku | Kazuhiro Nakamura
[1] K. Shichiri. Eigenvoice for HMM-based speech synthesis , 2002 .
[2] Naonori Ueda,et al. Deterministic annealing EM algorithm , 1998, Neural Networks.
[3] K. Tokuda,et al. Speech parameter generation from HMM using dynamic features , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[4] Hideki Kenmochi,et al. VOCALOID - commercial singing synthesizer based on sample concatenation , 2007, INTERSPEECH.
[5] Yoshihiko Nankaku,et al. Pitch adaptive training for hmm-based singing voice synthesis , 2014, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Keiichi Tokuda,et al. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.
[7] Heiga Zen,et al. An HMM-based singing voice synthesis system , 2006, INTERSPEECH.
[8] Heiga Zen,et al. Hidden Semi-Markov Model Based Speech Synthesis System , 2006 .
[9] Keiichi Tokuda,et al. Speech synthesis using HMMs with dynamic features , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[10] Keiichi Tokuda,et al. Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[11] Satoshi Imai,et al. Cepstral analysis synthesis on the mel frequency scale , 1983, ICASSP.
[12] Keiichi Tokuda,et al. Hidden Markov models based on multi-space probability distribution for pitch pattern modeling , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[13] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..
[14] Keiichi Tokuda,et al. Speaker interpolation in HMM-based speech synthesis system , 1997, EUROSPEECH.
[15] Koichi Shinoda,et al. MDL-based context-dependent subword modeling for speech recognition , 2000 .