A scalable coding scheme based on interframe dependency limitation

While VoIP (voice over IP) is gaining importance in comparison with other types of telephony, packet loss remains as the main source of degradation in VoIP systems. Traditional speech codecs, such as those based on the CELP (code excited linear prediction) paradigm, can achieve low bit-rates at the cost of introducing interframe dependencies. As a result, the effect of a packet loss burst is propagated to the frames correctly received after the burst. iLBC (internet low bit-rate codec) alleviates this problem by removing the interframe dependencies at the cost of a higher bit-rate. In this paper we propose a combination of iLBC with an ACELP (algebraic CELP) codec in which a variable number of ACELP-coded frames is inserted between every two iLBC-coded frames. The experimental results show that the combined codec can achieve a performance close to that of iLBC at different loss conditions but with a smaller bit-rate. Also, scalability is achieved by modifying the number of inserted ACELP-coded frames.

[1]  Jerry D. Gibson,et al.  A multiple description speech coder based on AMR-WB for mobile ad hoc networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Ahmet M. Kondoz,et al.  Multiple Description Coding for Voice over IP using Sinusoidal Speech Coding , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4]  Philippe Gournay,et al.  A study of design compromises for speech coders in packet networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  W. Bastiaan Kleijn,et al.  Internet Low Bit Rate Codec (iLBC) , 2004, RFC.

[6]  Philippe Gournay,et al.  Improved packet loss recovery using late frames for prediction-based speech coders , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Manohar N. Murthi,et al.  Towards iLBC speech coding at lower rates through a new formulation of the start state search , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  Jan Skoglund,et al.  iLBC - a linear predictive coder with robustness to packet losses , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[9]  Wenyu Jiang,et al.  Modeling of Packet Loss and Delay and Their Effect on Real-Time Multimedia Service Quality , 2000 .

[10]  Manohar N. Murthi,et al.  On Variable Rate Frame Independent Predictive Speech Coding: Re-Engineering ILBC , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Roch Lefebvre,et al.  Efficient Frame Erasure Concealment in Predictive Speech Codecs using Glottal Pulse Resynchronisation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[12]  Peter Vary,et al.  A candidate proposal for a 3GPP adaptive multi-rate wideband speech codec , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).