论文信息 - Speaker normalization algorithms for very-low-rate speech coding

Speaker normalization algorithms for very-low-rate speech coding

We use speaker normalization for vocoding the speech of a new input speaker by using a speaker dependent segment vocoder operating at a very-low bit rate, below 300 b/s. The normalization consists of a spectral transformation, applied on the spectral parameter vector of the reference speaker, which should improve the match between the reference and input speakers. The optimal spectral transformation is determined by an iterative algorithm that is guaranteed to converge to a local optimum, i.e., the quantization error of the segment vocoder is minimized by the normalization algorithm. We demonstrate the general algorithm by deriving a linear least squares solution for the spectral transformation. We present some results on several male and female speakers.

Salim E. Roucos | Alexander MacLeod Wilgus

[1] Richard M. Schwartz,et al. A segment vocoder at 150 b/s , 1983, ICASSP.

[2] R. J. Golibersuch. Automatic prediction of linear frequency warp for speech recognition , 1983, ICASSP.

[3] S. Roucos,et al. Segment quantization for very-low-rate speech coding , 1982, ICASSP.