Estimation of missing LSF parameters using Gaussian mixture models

Speech transmission over packet networks has to cope with packet delays and packet losses. When a packet loss occurs the missing information must be estimated. We focus on restoring the spectral parameters of a speech coder. A novel approach to estimating missing line spectral frequency (LSF) parameters using Gaussian mixture models (GMM) is proposed. We present the estimation algorithm and study its performance when one or several LSF parameters are lost. We show that a GMM of a relatively low order is sufficient to achieve a substantial improvement in the parameter SNR. Therefore, the new estimation procedure requires much less memory than histogram based estimation methods.

[1]  Peter Vary,et al.  Robust speech decoding: a universal approach to bit error concealment , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Phil D. Green,et al.  Some solution to the missing feature problem in data classification, with application to noise robust ASR , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  Peter Vary,et al.  Softbit speech decoding: a new approach to error concealment , 2001, IEEE Trans. Speech Audio Process..

[6]  Peter Vary,et al.  Error concealment by near optimum MMSE-estimation of source codec parameters , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).

[7]  Jan Skoglund,et al.  Vector quantization based on Gaussian mixture models , 2000, IEEE Trans. Speech Audio Process..