A multipulse FEC scheme based on amplitude estimation for CELP codecs over packet networks

This paper presents a forward error correction (FEC) technique based on a multipulse representation of the excitation for codeexcited linear prediction (CELP) speech transmission under packet loss conditions. In this approach, the encoder sends the position of a pulse that it is used for the resynchronization of the adaptive codebook, so that propagation errors can be prevented. At the decoder, the amplitude of the resynchronization pulse is estimated by means of minimum mean square error (MMSE) estimation based on Gaussian mixture models (GMMs) of the received parameters and the pulse amplitude. The proposal is tested employing PESQ scores and AMR 12.2 kbps, a wellknown CELP codec. The results show that, with a very small additional information (350 bps), this technique achieves a noticeable improvement over the results obtained by the packet loss concealment included in the legacy codec.

[1]  Ángel M. Gómez,et al.  A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Rainer Martin,et al.  Estimation of missing LSF parameters using Gaussian mixture models , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Jian Wang,et al.  Parameter interpolation to enhance the frame erasure robustness of CELP coders in packet networks , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Manohar N. Murthi,et al.  On packet loss concealment artifacts and their implications for packet labeling in voice over IP , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[5]  Jan Skoglund,et al.  iLBC - a linear predictive coder with robustness to packet losses , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[6]  Bishnu S. Atal,et al.  Amplitude optimization and pitch prediction in multipulse coders , 1989, IEEE Trans. Acoust. Speech Signal Process..

[7]  Yannis Stylianou,et al.  Coding with side information techniques for LSF reconstruction in voice over IP , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  Ángel M. Gómez,et al.  A scalable coding scheme based on interframe dependency limitation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Per Hedelin,et al.  Model based spectrum prediction , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).

[10]  Philippe Gournay,et al.  A study of design compromises for speech coders in packet networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.