LSP quantization by a union of locally trained codebooks

We present a fixed rate encoding scheme for the line spectrum pair (LSP) representation of an LPC-filter, based on Gaussian mixture (GM) modeling. For each mixture component, we construct a codebook by a union of product quantizers. Each local codebook is trained, independently, using a clustering scheme similar to the generalized Lloyd algorithm (GLA), over synthetic data. The training algorithm iterates fast, due to low complexity encoding, and converges in few iterations. The overall codebook is a combination of local codebooks, and inherits their high performance, while having a moderate complexity. We provide numerical results for average spectral distortion (SD) of the proposed encoder, and benchmark them by a lower bound, according to high-rate theory. We achieve an average SD (full-band measure) of 1 dB at 23 b/frame, for speech signals sampled at 8 kHz and LPC of order 10. By tolerating additional complexity, we reach a SD within 0.01 dB of the lower bound.

[1]  James A. Bucklew,et al.  Companding and random quantization in several dimensions , 1981, IEEE Trans. Inf. Theory.

[2]  Rajiv Laroia,et al.  Robust and efficient quantization of speech LSP parameters using structured vector quantizers , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Saburo Tazaki,et al.  Asymptotic performance of block quantizers with difference distortion measures , 1980, IEEE Trans. Inf. Theory.

[4]  Bhaskar D. Rao,et al.  Theoretical analysis of the high-rate vector quantization of LPC parameters , 1995, IEEE Trans. Speech Audio Process..

[5]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[6]  Hai Le Vu,et al.  Efficient distance measure for quantization of LSF and its Karhunen-Loeve transformed parameters , 2000, IEEE Trans. Speech Audio Process..

[7]  Bhaskar D. Rao,et al.  Low complexity recursive coding of spectrum parameters , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Turaj Zakizadeh Shabestary,et al.  Spectral quantization by companding , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[10]  Jan Skoglund,et al.  Vector quantization based on Gaussian mixture models , 2000, IEEE Trans. Speech Audio Process..

[11]  Jonas Samuelsson,et al.  Recursive coding of spectrum parameters , 2001, IEEE Trans. Speech Audio Process..

[12]  Chin-Teng Lin,et al.  Incorporating error shaping technique into LSF vector quantization , 2001, IEEE Trans. Speech Audio Process..

[13]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[14]  H. Sorenson,et al.  Recursive bayesian estimation using gaussian sums , 1971 .

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Bhaskar D. Rao,et al.  Speech LSF quantization with rate independent complexity, bit scalability and learning , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[17]  Ness B. Shroff,et al.  Quantization based on a novel sample-adaptive product quantizer (SAPQ) , 1999, IEEE Trans. Inf. Theory.

[18]  Paul L. Zador,et al.  Asymptotic quantization error of continuous signals and the quantization dimension , 1982, IEEE Trans. Inf. Theory.

[19]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[20]  Robert M. Gray,et al.  High-resolution quantization theory and the vector quantizer advantage , 1989, IEEE Trans. Inf. Theory.

[21]  Thomas R. Fischer,et al.  Vector quantization of speech line spectrum pair parameters and reflection coefficients , 1998, IEEE Trans. Speech Audio Process..

[22]  Thomas R. Fischer,et al.  Vector quantization of speech LSP parameters using trellis codes and l/sub 1/-norm constraints , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Bhaskar D. Rao,et al.  PDF optimized parametric vector quantization of speech line spectral frequencies , 2003, IEEE Trans. Speech Audio Process..