Speech bandwidth extension using magnitude spectrum data hiding

Public telephone systems transmit speech across a limited frequency range, about 300–3400 Hz, called narrowband (NB) which results in a significant reduction of quality and intelligibility of speech. This paper proposes a fully backward compatible novel method for bandwidth extension of NB speech. The method uses magnitude spectrum data hiding technique to provide a perceptually better wideband speech signal. Code excited linear prediction (CELP) parameters are extracted from the down sampled frequency shifted version of the high frequency components of speech signal existing above NB, which are spread by using pseudo-noise codes, and are embedded in the low-amplitude high-frequency regions of the magnitude spectrum of NB speech signal. The embedded information is extracted at the receiving end to reconstruct the wideband speech signal. Theoretical and simulation analyses show that the proposed method is robust to quantization and channel noises. The log spectral distortion test clearly show that the reconstructed wideband signal gives a much better performance in terms of speech quality when compared to some of the existing speech bandwidth extension methods employing data hiding.

[1]  NTTアドバンステクノロジ株式会社 Multi-lingual speech database for telephnometry 1994 , 1994 .

[2]  Henry Leung,et al.  Artificial bandwidth extension of telephony speech by data hiding , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[3]  T. Rabie,et al.  Magnitude Spectrum Speech Hiding , 2007, 2007 IEEE International Conference on Signal Processing and Communications.

[4]  Ted Painter,et al.  Audio Signal Processing and Coding , 2007 .

[5]  Acoustic and phonological factors in vowel identification. , 1979 .

[6]  Peter Jax,et al.  Artificial bandwidth extension of speech supported by watermark-transmitted side information , 2005, INTERSPEECH.

[7]  Gary J. Saulnier,et al.  Perspectives in spread spectrum , 1998 .

[8]  Peter Jax,et al.  Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding? , 2006, IEEE Communications Magazine.

[9]  Peter Jax,et al.  An upper bound on the quality of artificial bandwidth extension of narrowband speech signals , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Henry Leung,et al.  Speech Bandwidth Extension by Data Hiding and Phonetic Classification , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[11]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.