Efficient vector quantization of LPC parameters at 24 bits/frame

Though vector quantizers are more efficient than scalar quantizers, their use for fine quantization of linear predictive coding (LPC) information (using 24-26 b/frame) is impeded due to their prohibitively high complexity. In the present work, a split vector quantization approach is used to overcome the complexity problem. The LPC vector, consisting of ten line spectral frequencies (LSFs), is divided into two parts and each part is quantized separately using vector quantization. Using the localized spectral sensitivity property of the LSF parameters, a weighted LSF distance measure is proposed. Using this distance measure, it is shown that the split vector quantizer can quantize LPC information in 24 b/frame with 1-dB average spectral distortion and <2% outlier frames (having spectral distortion greater than 2 dB).<<ETX>>

[1]  Richard V. Cox,et al.  Spectral quantization and interpolation for CELP coders , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  Biing-Hwang Juang,et al.  Optimal quantization of LSP parameters , 1993, IEEE Trans. Speech Audio Process..

[3]  Allen Gersho,et al.  Pseudo-Gray coding , 1990, IEEE Trans. Commun..

[4]  Takehiro Moriya,et al.  Speech coder using phase equalization and vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Allen Gersho,et al.  Fast search algorithms for vector quantization and pattern matching , 1984, ICASSP.

[6]  Joseph P. Campbell,et al.  An expandable error-protected 4800 bps CELP coder (US Federal Standard 4800 bps voice coder) , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7]  Frank K. Soong,et al.  Optimal quantization of LSP parameters using delayed decisions , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[8]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[9]  A. Gray,et al.  Quantization and bit allocation in speech processing , 1976 .

[10]  N. Farvardin,et al.  Quantizer design in LSP speech analysis and synthesis , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[11]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[12]  A. Gray,et al.  Distortion performance of vector quantization for LPC voice coding , 1982 .

[13]  Yair Shoham Cascaded likelihood vector coding of the LPC information , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[14]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[15]  L. Fransen,et al.  Application of line-spectrum pairs to low-bit-rate speech encoders , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Kuldip K. Paliwal,et al.  Fast K-dimensional tree algorithms for nearest neighbor search with application to vector quantization encoding , 1992, IEEE Trans. Signal Process..

[17]  Nariman Farvardin,et al.  A study of vector quantization for noisy channels , 1990, IEEE Trans. Inf. Theory.

[18]  Dominique Massaloux,et al.  OF SPEECH SIGNALS , 1989 .

[19]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[20]  Roar Hagen,et al.  Low bit-rate spectral coding in CELP, a new LSP-method , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[21]  Fumitada Itakura,et al.  Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP - , 1986, Speech Commun..

[22]  Allen H. Levesque,et al.  Error-control techniques for digital communication , 1985 .

[23]  Allen Gersho,et al.  Phonetically-based vector excitation coding of speech at 3.6 kbps , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[24]  J. Makhoul,et al.  Quantization properties of transmission parameters in linear predictive systems , 1975 .

[25]  B. S. Atal,et al.  PREDICTIVE CODING OF SPEECH USING ANALYSIS-BY-SYNTHESIS TECHNIQUES , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[26]  Allen Gersho,et al.  A fast codebook search algorithm for nearest-neighbor pattern matching , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[28]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[29]  K. K. Paliwal A perception‐based LSP distance measure for speech recognition , 1988 .

[30]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[31]  Thomas P. Barnwell,et al.  A low bit rate segment vocoder based on line spectrum pairs , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  B. S. Atal,et al.  High-quality digital speech at 4 kb/s , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.