On adaptive vector transform quantization for speech coding

Adaptive vector transform quantization (AVTQ) as a coding system is discussed. The optimal bit assignment is derived based on vector quantization asymptotic theory for different PDFs (probability density functions) of the transform coefficients. Strategies for shaping the quantization noise spectrum and for adapting the bit assignment to the changes in the speech statistics are discussed. A good estimate of the efficiency of any coding system is given by the system coding gain over scalar PCM (pulse code modulation). Based on the optimal bit allocation, the coding gain of the vector transform quantization (VTQ) system operating on a stationary input signal is derived. The VTQ coding gain demonstrates a significant advantage of vector quantization over scalar quantization within the framework of transform coding. System simulation results are presented for a first-order Gauss-Markov process and for typical speech waveforms. The results of fixed and adaptive systems are compared for speech input. Also, the AVTQ results are compared to known scalar speech coding systems. >

[1]  P. Schultheiss,et al.  Block Quantization of Correlated Gaussian Random Variables , 1963 .

[2]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[3]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[4]  Ning He,et al.  A frequency domain waveform speech compression system based on product vector quantizers , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Robert M. Gray,et al.  Finite-state vector quantization for waveform coding , 1985, IEEE Trans. Inf. Theory.

[6]  P. Noll,et al.  Adaptive transform coding of speech signals , 1977 .

[7]  Adrian Segall Bit allocation and encoding for vector sources , 1976, IEEE Trans. Inf. Theory.

[8]  Barry J. Bunin Rate-distortion functions for Gaussian Markov processes , 1969 .

[9]  Huseyin Abut,et al.  Low-rate speech encoding using vector quantization and subband coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Tor A. Ramstad,et al.  Fully vector-quantized subband coding with adaptive codebook allocation , 1984, ICASSP.

[11]  Kiyoharu Aizawa,et al.  Adaptive discrete cosine transform coding with vector quantization for color images , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Jack May,et al.  Fourier Transform Vector Quantization for Speech Coding , 1987, IEEE Trans. Commun..

[13]  Allen Gersho,et al.  Vector Predictive Coding of Speech at 16 kbits/s , 1985, IEEE Trans. Commun..

[14]  Paul A. Wintz,et al.  Waveform error control in PCM telemetry , 1968, IEEE Trans. Inf. Theory.

[15]  Ronald E. Crochiere,et al.  Frequency domain coding of speech , 1979 .

[16]  V. Cuperman,et al.  Vector quantization: A pattern-matching technique for speech coding , 1983, IEEE Communications Magazine.

[17]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.