Dual-Mode Wideband Speech Compression

Many bandwidth extension techniques attempt to predict the high-band frequencies based on features extracted from the lower band. Recent work suggests that such methods are limiting because the correlation between the low band and the high band is insufficient for adequate representation. As a result, additional high-band information must be sent to the decoder. In this paper, we propose a dual mode wideband speech coding algorithm based on the principles of bandwidth extension. The principal contributions include a mode selection algorithm based on greedy algorithm that maximizes the loudness criteria, and a bandwidth extension algorithm based on a constrained MMSE estimator. Results reveal that the proposed system improves the quality of narrowband speech while performing at a lower bit rate.

[1]  Visar Berisha,et al.  A Scalable Bandwidth Extension Algorithm , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2]  Peter Jax,et al.  An upper bound on the quality of artificial bandwidth extension of narrowband speech signals , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Julien Epps,et al.  Wideband Extension of Narrowband Speech for Enhancement and Coding , 2000 .

[4]  Visar Berisha,et al.  Enhancing vocoder performance for music signals , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[5]  Andreas Spanias,et al.  Speech coding: a tutorial review , 1994, Proc. IEEE.

[6]  Peter Jax,et al.  Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Visar Berisha,et al.  Bandwidth Extension of Audio Based on Partial Loudness Criteria , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[8]  W. Bastiaan Kleijn,et al.  On the mutual information between frequency bands in speech , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  W. Bastiaan Kleijn,et al.  Avoiding over-estimation in bandwidth extension of telephony speech , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[10]  Peter J. Patrick Enhancement of band-limited speech signals , 1983 .

[11]  Visar Berisha,et al.  Enhancing the Quality of Coded Audio Using Perceptual Criteria , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[12]  Alan McCree,et al.  A robust narrowband to wideband extension system featuring enhanced codebook mapping , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[13]  W. Bastiaan Kleijn,et al.  Gaussian mixture model based mutual information estimation between frequency bands in speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.