Wideband speech coding has been gaining popularity in recent years. Due to its higher sampling rate, wideband speech inherently requires higher processing power than telephone-bandwidth speech when the same coding algorithm is used. However, today many wideband coding applications demand a complexity that is even lower than that of most state-of-the-art telephone-bandwidth coders. To meet this complexity challenge, we created two wideband coders that are fundamentally different from and simpler than popular narrowband coders. The first one is a transform coder based on the modified discrete cosine transform (MDCT), LPC spectral fit, and pitch harmonic fit. The second algorithm is called UTransform Predictive Coding”, or TPC. It uses short-term and long-term prediction to remove the redundancy in speech. The prediction residual is quantized in the frequency domain based on a calculated noise masking threshold. In its current form, the TPC coder uses only open-loop quantization and therefore has a low complexity. We estimate that its complexity is only about half of that of the ITU-T 16 kb/s G.728 LD-CELP narrowband coder. The speech quality of TPC is almost transparent at 32 kb/s, very good at 24 kb/s, and acceptable at 16 kb/s. Work is in progress to further improve the speech quality at 16 kb/s.
[1]
Akihiko Sugiyama,et al.
Adaptive transform coding with an adaptive block size (ATC-ABS)
,
1990,
International Conference on Acoustics, Speech, and Signal Processing.
[2]
James D. Johnston,et al.
Transform coding of audio signals using perceptual noise criteria
,
1988,
IEEE J. Sel. Areas Commun..
[3]
John Princen,et al.
Subband/Transform coding using filter bank designs based on time domain aliasing cancellation
,
1987,
ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[4]
Juin-Hwey Chen.
Toll-quality 16 kb/s CELP speech coding with very low complexity
,
1995,
1995 International Conference on Acoustics, Speech, and Signal Processing.
[5]
Roch Lefebvre,et al.
High quality coding of wideband audio signals using transform coded excitation (TCX)
,
1994,
Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.