A Novel Error Mitigation Scheme Based on Replacement Vectors and FEC Codes for Speech Recovery in Loss-Prone Channels

In this paper, we propose an error mitigation scheme which combines two different approaches, a replacement super vector technique which provides replacements to reconstruct both the LPC coefficients and the excitation signal along bursts of lost packets, and a Forward Error Code (FEC) technique in order to minimize the error propagation after the last lost frame. Moreover, this FEC code is embedded into the bitstream in order to avoid the bitrate increment and keep the codec working in a compliant way on clean transmissions. The success of our recovery technique deeply relies on a quantization of the speech parameters (LPC coefficients and the excitation signal), especially in the case of the excitation signal where a modified version of the well-known Linde-Buzo-Gray (LBG) algorithm is applied. The performance of our proposal is evaluated over the AMR codec in terms of speech quality by using the PESQ algorithm. Our proposal achieves a noticeable improvement over the standard AMR legacy codec under adverse channel conditions without incurring neither on high computational costs or delays during the decoding stage nor consuming any additional bitrate.

[1]  Philippe Gournay,et al.  Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Koji Yoshida,et al.  Decoder Initializing Technique for Improving Frame-Erasure Resilience of a CELP Speech Codec , 2008, IEEE Transactions on Multimedia.

[3]  Domingo López-Oller,et al.  Source-based error mitigation for speech transmissions over erasure channels , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[4]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[5]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[6]  Colin Perkins,et al.  A survey of packet loss recovery techniques for streaming audio , 1998 .

[7]  Ahmet M. Kondoz,et al.  Digital Speech: Coding for Low Bit Rate Communication Systems , 1995 .

[8]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[9]  Per Hedelin,et al.  Packet loss concealment based on sinusoidal extrapolation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Meir Tzur,et al.  Speech reconstruction from mel frequency cepstral coefficients and pitch frequency , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[11]  Honggang Zhang,et al.  Nonlinear estimation of missing ΔLSF parameters by a mixture of Dirichlet distributions , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Ángel M. Gómez,et al.  A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  W. Bastiaan Kleijn,et al.  Autoregressive model-based speech packet-loss concealment , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Fatiha Merazka Packet loss concealment by interpolation for speech over IP network services , 2013 .

[15]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  M. Serizawa,et al.  A packet loss recovery method using packet arrived behind the playout time for CELP decoding , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Peter Vary,et al.  High rate data hiding in ACELP speech codecs , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Ángel M. Gómez,et al.  A scalable coding scheme based on interframe dependency limitation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Domingo López-Oller,et al.  Steganographic Pulse-Based Recovery for Robust ACELP Transmission over Erasure Channels , 2012, IberSPEECH.

[20]  Angel M. Gomez,et al.  One-Pulse FEC Coding for Robust CELP-Coded Speech Transmission Over Erasure Channels , 2011, IEEE Transactions on Multimedia.

[21]  Søren Holdt Jensen,et al.  Hidden Markov model-based packet loss concealment for voice over IP , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  F. Merazka Differential quantization of spectral parameters for CELP based coders in packet networks , 2012, IECON 2012 - 38th Annual Conference on IEEE Industrial Electronics Society.

[23]  Ming-Syan Chen,et al.  Adaptive recovery techniques for real-time audio streams , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[24]  Daoud Berkani,et al.  The estimation of Line Spectral Frequencies trajectories based on Unscented Kalman Filtering , 2009, 2009 6th International Multi-Conference on Systems, Signals and Devices.