Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1

Recommendation G.729.1 is a new ITU-T standard which was approved in May 2006. This recommendation describes a hierarchical speech and audio coding algorithm built on top of a narrowband core codec. One challenge in the codec design is the generation of a wideband signal with a very limited additional bit rate (less than 2 kb/s). In this paper, we describe the respective codec layer, which extends the transmitted acoustic bandwidth from the narrowband frequency range (50 Hz-4 kHz) to the wideband frequency range (50 Hz-7 kHz). The underlying algorithm uses a fairly coarse parametric description of the temporal and spectral energy envelopes of the high frequency band (4-7 kHz). This parameter set is quantized with a bit rate of 1.65 kb/s. At the decoder side, the high-frequency components are regenerated by appropriately shaping a synthetically generated ldquoexcitation signal.rdquo Apart from the algorithmic description and a discussion, we state a complexity evaluation as well as some listening test results.

[1]  Peter Vary,et al.  Digital Speech Transmission: Enhancement, Coding and Error Concealment , 2006 .

[2]  Peter Jax,et al.  A Celp-Wavelet Scalable Wideband Speech Coder , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3]  Peter Jax,et al.  A Qualified ITU-T G.729EV Codec Candidate for Hierarchical Speech and Audio Coding , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[4]  Yang Gao,et al.  ITU-T G.729.1: AN 8-32 Kbit/S Scalable Coder Interoperable with G.729 for Wideband Telephony and Voice Over IP , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Peter Jax,et al.  An Embedded Scalable Wideband Codec Based on the GSM EFR Codec , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[6]  Xavier Maitre,et al.  7 kHz audio coding within 64 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[7]  Yannis Stylianou,et al.  Conditional Vector Quantization for Speech Coding , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Peter Vary,et al.  Backwards Compatible Wideband Telephony in Mobile Networks: CELP Watermarking and Bandwidth Extension , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9]  Jürgen W. Paulus,et al.  16 kbit/s wideband speech coding based on unequal subbands , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[10]  Alan McCree,et al.  A 14 kb/s wideband speech coder with a parametric highband model , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[11]  Andreas Johannes Gerrits,et al.  Hi-BIN: an alternative approach to wideband speech coding , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[12]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[13]  Roch Lefebvre,et al.  Bandwidth extension of narrowband speech for low bit-rate wideband coding , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).

[14]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[15]  Peter Jax,et al.  , and Vary : G . 729 “ Add-On ” ITG-Fachtagung Sprachkommunikation 2006 A Scalable Wideband “ Add-On ” for the G . 729 Speech Codec , 2006 .

[16]  Peter Vary,et al.  Trends and perspectives in wideband speech coding , 2000, Signal Process..

[17]  Ronaldus Maria Aarts,et al.  Bandwidth Extension for Speech , 2005 .

[18]  Peter Jax,et al.  Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding? , 2006, IEEE Communications Magazine.

[19]  Peter Vary,et al.  Artificial bandwidth extension without side information for ITU-t g.729.1 , 2007, INTERSPEECH.

[20]  Per Ekstrand BANDWIDTH EXTENSION OF AUDIO SIGNALS BY SPECTRAL BAND REPLICATION , 2002 .