A 8–32 KBIT/S Scalable Wideband Speech and Audio Coding Candidate for ITU-T G729EV Standardization

This paper describes a 8-32 kbit/s scalable speech and audio coder submitted as a candidate for the ITU-T G729-based embedded variable bitrate (G729EV) standardization. The coder is built upon a 3-stage coding structure consisting of: narrowband cascade CELP coding at 8 and 12 kbit/s, bandwidth extension based on wideband linear-predictive coding (WB-LPC) at 14 kbit/s, and MDCT coding in a WB-LPC weighted signal domain from 14 to 32 kbit/s. ITU-T test results showed that this coder passed all the requirements of the G729EV qualification phase

[1]  Hervé Taddei,et al.  A Scalable Three Bit Rate (8, 14.2, and 24 kbit/s) Audio Coder , 1999 .

[2]  Balázs Kövesi,et al.  A scalable speech and audio coding scheme with continuous bitrate flexibility , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Koji Yoshida,et al.  Predictive VQ for bandwidth scalable LSP quantization [speech coding applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4]  Kazunori Ozawa,et al.  A bitrate and bandwidth scalable CELP coder , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  Rosario Drogo de Iacovo,et al.  Embedded CELP coding for variable bit-rate between 6.4 and 9.6 kbit/s , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.