Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems
暂无分享,去创建一个
[1] Mari Ostendorf,et al. HMM topology design using maximum likelihood successive state splitting , 1997, Comput. Speech Lang..
[2] Alan W. Black,et al. CHATR: a generic speech synthesis system , 1994, COLING.
[3] Hideki Kawahara,et al. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..
[4] Heiga Zen,et al. A Hidden Semi-Markov Model-Based Speech Synthesis System , 2007, IEICE Trans. Inf. Syst..
[5] Shigeki Sagayama,et al. A successive state splitting algorithm for efficient allophone modeling , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[6] Koichi Shinoda,et al. MDL-based context-dependent subword modeling for speech recognition , 2000 .
[7] Philip C. Woodland,et al. Automatic speech synthesiser parameter estimation using HMMs , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[8] Steve Young,et al. Benchmark DARPA RM results using the HTK portable HMM toolkit , 1992 .
[9] Jj Odell,et al. The Use of Context in Large Vocabulary Speech Recognition , 1995 .
[10] Kai-Fu Lee,et al. Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990 .
[11] K. Tokuda,et al. Spectral estimation of speech by mel‐generalized cepstral analysis , 1993 .
[12] Alan W. Black,et al. Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[13] Mei-Yuh Hwang,et al. Predicting unseen triphones with senones , 1996, IEEE Trans. Speech Audio Process..
[14] Shingo Kuroiwa,et al. Tree-based clustering for gaussian mixture HMMs , 2002, Systems and Computers in Japan.
[15] Shigeru Katagiri,et al. ATR Japanese speech database as a tool of speech recognition and synthesis , 1990, Speech Commun..
[16] Keiichi Tokuda,et al. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.
[17] Mark J. F. Gales,et al. Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..
[18] Wu Chou,et al. Decision tree state tying based on penalized Bayesian information criterion , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[19] B. Juang,et al. Context-dependent Phonetic Hidden Markov Models for Speaker-independent Continuous Speech Recognition , 2008 .
[20] Keiichi Tokuda,et al. A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis , 2007, IEICE Trans. Inf. Syst..