Speech Unit Selection Based on Matching Pursuit
暂无分享,去创建一个
This paper introduces a new method based on Matching Pursuit for speech unit selection. We used the Matching Pursuit transform parameters with a comparison algorithm to find the best match for a selected unit in a Text-To-Speech system based on concatenation. We chose Gabor atoms. Also Wigner-Ville distribution implemented for the time-frequency presentation of the transform and we used image processing approach to compare these time-frequency presentations of the acoustic units. On a database of 42 units 92% accuracy was obtained.
[1] Michael W. Macon,et al. A perceptual evaluation of distance measures for concatenative speech synthesis , 1998, ICSLP.
[2] Raymond N. J. Veldhuis,et al. Reducing audible spectral discontinuities , 2001, IEEE Trans. Speech Audio Process..
[3] Hisashi Kawai,et al. Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC , 2002, INTERSPEECH.
[4] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..