Embedded WI coding between 2.0 and 4.8 kbit/s

This paper describes an embedded speech coder based on waveform interpolation (WI) techniques. Since the quantization of line spectral frequency (LSF) parameters is fairly orthogonal to the quantization of excitation information, designing an embedded system with WI is much easier than that of other approaches. By using a hierarchical bit-allocation of excitation signals that consist of a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW), the proposed system works well at the bit-rate of 2.0, 2.4, 3.0, 4.0 and 4.8 kbit/s. Listening tests indicate that the performance of the new system is comparable to an optimized fixed-rate WI coder, and the quality degrades gracefully as the bit-rate decreases.

[1]  Thomas F. Quatieri,et al.  Multirate STC and Its Application to Multi-Speaker Conferencing , 1993 .

[2]  Robert B. Dunn,et al.  Embedded dual-rate sinusoidal transform coding , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.

[3]  Rosario Drogo de Iacovo,et al.  Embedded CELP coding for variable bit-rate between 6.4 and 9.6 kbit/s , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Saeed Vaseghi Finite state CELP for variable rate speech coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  Hong-Goo Kang,et al.  Phase adjustment in waveform interpolation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[6]  Thomas Eriksson,et al.  Pitch quantization in low bit-rate speech coding , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).