论文信息 - A source model mitigation technique for distributed speech recognition over lossy packet channels

A source model mitigation technique for distributed speech recognition over lossy packet channels

In this paper, we develop a new mitigation technique for a distributed speech recognition system over IP. We have designed and tested several methods to improve the interpolation used in the Aurora DSR ETSI standard without any significant increase of computational cost at the decoder. These methods make use of the information contained in the data-source, because, in IP networks, unlike in cellular networks, no information is received during packet losses. When a packet loss occurs, the lost information can be reconstructed through estimations from the N nearest received packets. Due to the enormous amount of combinations from previous and next received speech vector sequences, we have developed a methodology that drastically reduces the amount of required estimations.

[1] Carmen Peláez-Moreno,et al. Recognizing voice over IP: a robust front-end for speech recognition on the world wide web , 2001, IEEE Trans. Multim..

[2] Jean-Chrysotome Bolot. End-to-end packet delay and loss behavior in the internet , 1993, SIGCOMM 1993.

[3] José L. Pérez-Córdoba,et al. Low complexity channel error mitigation for distributed speech recognition over wireless channels , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[4] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[5] Hugo Van hamme,et al. Investigation of speech recognition over IP channels , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] V. Hardman,et al. A survey of packet loss recovery techniques for streaming audio , 1998, IEEE Network.

[7] José L. Pérez-Córdoba,et al. HMM-based channel error mitigation and its application to distributed speech recognition , 2003, Speech Commun..