论文信息 - Shape-gain matrix quantizers for LPC speech

Shape-gain matrix quantizers for LPC speech

It has been recently demonstrated that the principles of vector quantization for LPC speech can be simply extended to encompass matrices of LPC vectors with significant savings in bit rate. Unfortunately, however, such locally optimal matrix quantizers have prohibitively high complexity and memory requirements when implemented in a speech vocoder at bit rates giving acceptable quality speech. One approach to solving the problem is to separately code gain and shape in the matrix quantizer. This paper generalizes the principles of shape-gain vector quantizer design for LPC speech to matrix quantization and investigates the properties of the resulting quantizers. In particular, we present a design which combines shape matrices consisting of N shape vectors with K-dimensional gain vectors, where N and K are small integers, in practice, with K \geq N . Experimental results show that with K, N \geq 3 , significant reductions in bit rate over locally optimal vector quantizers are obtained for comparable performance. Simulations indicate that a shape-gain matrix quantizer, using a 10 bit shape codebook and an 8 bit codebook with K = N = 3 operating at 6 bits/frame for the LPC model, gives speech quality comparable to a locally optimal vector quantizer at 9 bits/frame. The matrix quantizer has somewhat greater than 5.7 times the memory requirement of the above vector quantizer, but less than 2.1 times the complexity. Subjective tests show that the speech from this matrix quantizer is intelligible to native speakers of English.

Robert M. Gray | Chieh Tsao

[1] Robert M. Gray,et al. Rate-distortion speech coding with a minimum discrimination information distortion measure , 1981, IEEE Trans. Inf. Theory.

[2] Richard M. Schwartz,et al. A segment vocoder at 150 b/s , 1983, ICASSP.

[3] John E. Markel,et al. Linear Prediction of Speech , 1976, Communication and Cybernetics.

[4] Biing-Hwang Juang,et al. An 800 bit/s vector quantization LPC vocoder , 1982 .

[5] D. Burton. Applying matrix quantization to isolated word recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] R. Gallager. Information Theory and Reliable Communication , 1968 .

[7] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[8] R. Gray,et al. Distortion measures for speech processing , 1980 .

[9] Robert M. Gray,et al. Global convergence and empirical consistency of the generalized Lloyd algorithm , 1986, IEEE Trans. Inf. Theory.

[10] Robert M. Gray,et al. Matrix quantizer design for LPC speech using the generalized Llyod algorithm , 1985, IEEE Trans. Acoust. Speech Signal Process..

[11] S. Roucos,et al. Segment quantization for very-low-rate speech coding , 1982, ICASSP.

[12] Robert M. Gray,et al. Locally Optimal Block Quantizer Design , 1980, Inf. Control..

[13] Robert M. Gray,et al. An Algorithm for the Design of Labeled-Transition Finite-State Vector Quantizers , 1985, IEEE Trans. Commun..

[14] R. Gray,et al. Product code vector quantizers for waveform and voice coding , 1984 .

[15] R. Gray,et al. Speech coding based upon vector quantization , 1980, ICASSP.

[16] D. Wong,et al. Very low data rate speech compression with LPC vector and matrix quantization , 1983, ICASSP.

[17] Robert M. Gray,et al. Finite-state vector quantization for waveform coding , 1985, IEEE Trans. Inf. Theory.

[18] Richard M. Schwartz,et al. A variable-order Markov chain for coding of speech spectra , 1982, ICASSP.