A Band Extension Technique for G.711 Speech Using Steganography

This study investigates a band extension technique for speech data encoded with G.711, the most common codec for digital speech communications system such as VoIP. The proposed technique employs steganography for the transmission of the side information required for the band extension. Due to the steganography, the proposed technique is able to enhance the speech quality without an increase of the amount of data transmission. From the results of a subjective experiment, it is indicated that the proposed technique may potentially be useful for improving the speech quality, compared with the conventional technique.

[1]  Andreas Johannes Gerrits,et al.  Hi-BIN: an alternative approach to wideband speech coding , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2]  Alan McCree,et al.  A 14 kb/s wideband speech coder with a parametric highband model , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[3]  Udo Zoelzer,et al.  DAFX: Digital Audio Effects , 2011 .

[4]  Peter Kabal,et al.  Classified highband excitation for bandwidth extension of telephony signals , 2005, 2005 13th European Signal Processing Conference.

[5]  P. Strevens Iii , 1985 .

[6]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[7]  Ronaldus Maria Aarts,et al.  Improving percieved bass and reconstruction of high frequencies for band limited signals , 2002 .

[8]  E. Paulus,et al.  Speech Signal Processing , 1997, The Electrical Engineering Handbook - Six Volume Set.

[9]  Thomas P. Barnwell,et al.  MCCREE AND BARNWELL MIXED EXCITAmON LPC VOCODER MODEL LPC SYNTHESIS FILTER 243 SYNTHESIZED SPEECH-PERIODIC PULSE TRAIN-1 PERIODIC POSITION JITTER PULSE 4 , 2004 .