论文信息 - Model adaptation method for recognition of speech with missing frames.

Model adaptation method for recognition of speech with missing frames.

In distributed speech recognition (DSR), data packets may be lost over error prone channels. A commonly used approach to rectify this is to reconstruct a full frame rate data sequence for recognition using linear interpolation. In this study, an error-concealment decoding method that dynamically adapts the transition probabilities of hidden Markov models to match the frame loss observation sequence is proposed. Experimental results show that a DSR system using the proposed method can achieve the same level of accuracy as a data reconstruction method, is more robust against heavy frame loss, and significantly reduces the computation time.

Fu-Rong Jean | Lee-Min Lee

[1] John H. L. Hansen,et al. Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[2] Lee-Min Lee. Adaptation of hidden Markov models for half frame rate observations , 2010 .

[3] Man-Hung Siu,et al. A Robust Viterbi Algorithm Against Impulsive Noise With Application to Speech Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4] E. Gilbert. Capacity of a burst-noise channel , 1960 .

[5] Fu-Rong Jean,et al. Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate , 2013, IEEE Transactions on Cybernetics.

[6] Paul Dalsgaard,et al. Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8] Paul Dalsgaard,et al. Automatic speech recognition over error-prone wireless networks , 2005, Speech Commun..

[9] Abeer Alwan,et al. Low-bitrate distributed speech recognition for packet-based and wireless communication , 2002, IEEE Trans. Speech Audio Process..

[10] José L. Pérez-Córdoba,et al. HMM-based channel error mitigation and its application to distributed speech recognition , 2003, Speech Commun..