Low complexity wideband LSF quantization using GMM of uncorrelated Gaussian mixtures

We develop a Gaussian mixture model (GMM) based vector quantization (VQ) method for coding wideband speech line spectrum frequency (LSF) parameters at low complexity. The PDF of LSF source vector is modeled using the Gaussian mixture (GM) density with higher number of uncorrelated Gaussian mixtures and an optimum scalar quantizer (SQ) is designed for each Gaussian mixture. The reduction of quantization complexity is achieved using the relevant subset of available optimum SQs. For an input vector, the subset of quantizers is chosen using nearest neighbor criteria. The developed method is compared with the recent VQ methods and shown to provide high quality rate-distortion (R/D) performance at lower complexity. In addition, the developed method also provides the advantages of bitrate scalability and rate-independent complexity.

[1]  Thippur V. Sreenivas,et al.  Normalized two stage SVQ for minimum complexity wide-band LSF quantization , 2007, INTERSPEECH.

[2]  Kuldip K. Paliwal,et al.  A comparative study of LPC parameter representations and quantisation schemes for wideband speech coding , 2007, Digit. Signal Process..

[3]  Kuldip K. Paliwal,et al.  Multi-frame GMM-based block quantisation of line spectral frequencies for wideband speech coding , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[5]  Thippur V. Sreenivas,et al.  Predicting VQ Performance Bound for LSF Coding , 2008, IEEE Signal Processing Letters.

[6]  Bhaskar D. Rao,et al.  Low-Complexity Source Coding Using Gaussian Mixture Models, Lattice Vector Quantization, and Recursive Coding with Application to Speech Spectrum Quantization , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Kuldip K. Paliwal,et al.  Efficient product code vector quantisation using the switched split vector quantiser , 2007, Digit. Signal Process..

[8]  Bhaskar D. Rao,et al.  PDF optimized parametric vector quantization of speech line spectral frequencies , 2003, IEEE Trans. Speech Audio Process..

[9]  L. Hanzo,et al.  Speech spectral quantizers for wideband speech coding , 2001, Eur. Trans. Telecommun..

[10]  Kuldip K. Paliwal,et al.  Switched split vector quantisation of line spectral frequencies for wideband speech coding , 2005, INTERSPEECH.

[11]  Bhaskar D. Rao,et al.  Theoretical analysis of the high-rate vector quantization of LPC parameters , 1995, IEEE Trans. Speech Audio Process..

[12]  Kuldip K. Paliwal,et al.  Multi-frame GMM-based block quantisation of line spectral frequencies , 2005, Speech Commun..

[13]  Thippur V. Sreenivas,et al.  Two stage transform vector quantization of LSFs for wideband speech coding , 2006, INTERSPEECH.

[14]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[15]  Thomas R. Fischer,et al.  Low-complexity predictive trellis-coded quantization of speech line spectral frequencies , 2004, IEEE Transactions on Signal Processing.