论文信息 - Speech spectrum representation and coding using multigrams with distance

Speech spectrum representation and coding using multigrams with distance

The multigrams allow us to split a string of symbols into a stream of variable length sequences. The direct application of this method to vector-quantized speech spectra fails, we develop an extension of the method called modified multigrams or multigrams with distance. The algorithm for modified multigram dictionary training as well as experimental results are presented. We found a significant improvement of rate/distortion ratio in comparison to vector quantization with small codebooks. For precise spectrum representation, this method is less suitable and we see its application rather in speech segmentation or in very low bit rate coding.

Gérard Chollet | Jan Cernocký | Geneviève Baudoin

[1] Philip A. Chou,et al. Variable dimension vector quantization of linear predictive coefficients of speech , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2] Frédéric Bimbot,et al. Language modeling by variable length sequences: theoretical formulation and evaluation of multigrams , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3] R. Pieraccini,et al. Variable-length sequence modeling: multigrams , 1995, IEEE Signal Processing Letters.