论文信息 - Efficient encoding of speech LSP parameters using the discrete cosine transformation

Efficient encoding of speech LSP parameters using the discrete cosine transformation

The intraframe and interframe correlation properties are used to develop two efficient encoding algorithms for speech line spectrum pair (LSP) parameters. The first algorithm (2-D DCT), which requires relatively large coding delays, is based on two-dimensional (time and frequency) discrete cosine transform coding techniques; the second algorithm (DCT-DPCM), which does not need any coding delay, uses one-dimensional discrete cosine transform in the frequency domain and DPCM (differential pulse-code modulation) in the time domain. The performances of these systems for different bit rates and delays are studied, and appropriate comparisons are made. It is shown that an average spectral distortion of approximately 1 dB/sup 2/ can be achieved with 21 and 25 bits/frame using the 2-D DCT and DCT-DPCM schemes, respectively. This is a noticeable improvement over the previously reported bit rates of 32 bits/frame and above.<<ETX>>

Rajiv Laroia | Nariman Farvardin

[1] Nariman Farvardin,et al. Optimal block cosine transform image coding for noisy channels , 1990, IEEE Trans. Commun..

[2] Nariman Farvardin,et al. Quantizer design in LSP speech analysis-synthesis , 1988, IEEE J. Sel. Areas Commun..

[3] Biing-Hwang Juang,et al. Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[4] Thomas P. Barnwell,et al. A low bit rate segment vocoder based on line spectrum pairs , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] John C. Hardwick,et al. A 4.8 kbps multi-band excitation speech coder , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.