Efficient vector quantization of LPC parameters at 24 bits/frame

For low bit rate speech coding applications, it is important to quantize the LPC parameters accurately using as few bits as possible. Though vector quantizers are more efficient than scalar quantizers, their use for accurate quantization of linear predictive coding (LPC) information (using 24-26 bits/frame) is impeded by their prohibitively high complexity. A split vector quantization approach is used here to overcome the complexity problem. An LPC vector consisting of 10 line spectral frequencies (LSFs) is divided into two parts, and each part is quantized separately using vector quantization. Using the localized spectral sensitivity property of the LSF parameters, a weighted LSF distance measure is proposed. With this distance measure, it is shown that the split vector quantizer can quantize LPC information in 24 bits/frame with an average spectral distortion of 1 dB and less than 2% of the frames having spectral distortion greater than 2 dB. The effect of channel errors on the performance of this quantizer is also investigated and results are reported. >

[1]  Allen Gersho,et al.  Phonetically-based vector excitation coding of speech at 3.6 kbps , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  B. S. Atal,et al.  PREDICTIVE CODING OF SPEECH USING ANALYSIS-BY-SYNTHESIS TECHNIQUES , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[3]  K. K. Paliwal A perception‐based LSP distance measure for speech recognition , 1988 .

[4]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[5]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[6]  Takehiro Moriya,et al.  Speech coder using phase equalization and vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Richard V. Cox,et al.  Spectral quantization and interpolation for CELP coders , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[8]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[9]  Joseph P. Campbell,et al.  An expandable error-protected 4800 bps CELP coder (US Federal Standard 4800 bps voice coder) , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[10]  Biing-Hwang Juang,et al.  Optimal quantization of LSP parameters , 1993, IEEE Trans. Speech Audio Process..

[11]  Allen H. Levesque,et al.  Error-control techniques for digital communication , 1985 .

[12]  Thomas P. Barnwell,et al.  A low bit rate segment vocoder based on line spectrum pairs , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Nariman Farvardin,et al.  A study of vector quantization for noisy channels , 1990, IEEE Trans. Inf. Theory.

[14]  B. S. Atal,et al.  High-quality digital speech at 4 kb/s , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[15]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[16]  J. Makhoul,et al.  Quantization properties of transmission parameters in linear predictive systems , 1975 .

[17]  Allen Gersho,et al.  A fast codebook search algorithm for nearest-neighbor pattern matching , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Bishnu S. Atal,et al.  Beyond Multipulse and CELP Towards High Quality Speech at 4 Kb/s , 1991 .

[19]  Allen Gersho,et al.  Fast search algorithms for vector quantization and pattern matching , 1984, ICASSP.

[20]  Bishnu S. Atal Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[21]  A. Gray,et al.  Distortion performance of vector quantization for LPC voice coding , 1982 .

[22]  Allen Gersho,et al.  Pseudo-Gray coding , 1990, IEEE Trans. Commun..

[23]  L. Fransen,et al.  Application of line-spectrum pairs to low-bit-rate speech encoders , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Kuldip K. Paliwal,et al.  Fast K-dimensional tree algorithms for nearest neighbor search with application to vector quantization encoding , 1992, IEEE Trans. Signal Process..

[25]  Fumitada Itakura,et al.  Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP - , 1986, Speech Commun..

[26]  Frank K. Soong,et al.  Optimal quantization of LSP parameters using delayed decisions , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[27]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[28]  A. Gray,et al.  Quantization and bit allocation in speech processing , 1976 .

[29]  N. Farvardin,et al.  Quantizer design in LSP speech analysis and synthesis , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[30]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[31]  Roar Hagen,et al.  Low bit-rate spectral coding in CELP, a new LSP-method , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[32]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[33]  Yair Shoham Cascaded likelihood vector coding of the LPC information , 1989, International Conference on Acoustics, Speech, and Signal Processing,.