论文信息 - An Introduction to HMM-Based Speech Synthesis - 字舞流文

An Introduction to HMM-Based Speech Synthesis

Junichi Yamagishi | J. Yamagishi

[1] Keiichi Tokuda,et al. Speaker adaptation of pitch and spectrum for HMM-based speech synthesis , 2002, Systems and Computers in Japan.

[2] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[3] S. J. Young,et al. Tree-based state tying for high accuracy acoustic modelling , 1994 .

[4] Keiichi Tokuda,et al. Generalized cepstral analysis of speech - unified approach to LPC and cepstral method , 1990, ICSLP.

[5] Satoshi Nakamura,et al. Voice conversion through vector quantization , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6] K. Tokuda,et al. Spectral estimation of speech by mel‐generalized cepstral analysis , 1993 .

[7] Yannis Stylianou,et al. A system for voice conversion based on probabilistic classification and a harmonic plus noise model , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8] Biing-Hwang Juang,et al. Hidden Markov Models for Speech Recognition , 1991 .

[9] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10] Keiichi Tokuda,et al. Duration modeling for HMM-based speech synthesis , 1998, ICSLP.

[11] Takao Kobayashi,et al. Complex Chebyshev approximation for IIR digital filters using an iterative WLS technique , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[12] K. Tokuda,et al. Speech parameter generation from HMM using dynamic features , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[13] Keiichi Tokuda,et al. Speaker adaptation for HMM-based speech synthesis system using MLLR , 1998, SSW.

[14] Keiichi Tokuda,et al. An adaptive algorithm for mel-cepstral analysis of speech , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15] S. Imai,et al. Mel Log Spectrum Approximation (MLSA) filter for speech synthesis , 1983 .

[16] Keiichi Tokuda,et al. Text-to-speech synthesis with arbitrary speaker's voice from average voice , 2001, INTERSPEECH.

[17] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[18] Keiichi Tokuda,et al. Speech synthesis using HMMs with dynamic features , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[19] F. Itakura,et al. A statistical method for estimation of speech spectral density and formant frequencies , 1970 .

[20] Ronald W. Schafer,et al. Digital Processing of Speech Signals , 1978 .

[21] Keiichi Tokuda,et al. Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[22] Keiichi Tokuda,et al. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.

[23] Keiichi Tokuda,et al. Hidden Markov models based on multi-space probability distribution for pitch pattern modeling , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[24] Keiichi Tokuda,et al. Multi-Space Probability Distribution HMM , 2002 .

[25] Norio Higuchi,et al. Spectral mapping for voice conversion using speaker selection and vector field smoothing , 1995, EUROSPEECH.

[26] Koichi Shinoda,et al. MDL-based context-dependent subword modeling for speech recognition , 2000 .