Power spectral density based channel equalization of large speech database for concatenative TTS system
暂无分享,去创建一个
Yu Shi | Hu Peng | Min Chu | Eric Chang
[1] José Carlos Príncipe,et al. Nonlinear dynamic modeling of the voiced excitation for improved speech synthesis , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[2] Michael W. Macon,et al. Spectral modification for concatenative speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[3] P. Welch. The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .
[4] Hu Peng,et al. Selecting non-uniform units from a very large corpus for concatenative speech synthesizer , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[5] Yannis Stylianou. Assessment and correction of voice quality variabilities in large speech databases for concatenative speech synthesis , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).