Model adaptation method for recognition of speech with missing frames.

In distributed speech recognition (DSR), data packets may be lost over error prone channels. A commonly used approach to rectify this is to reconstruct a full frame rate data sequence for recognition using linear interpolation. In this study, an error-concealment decoding method that dynamically adapts the transition probabilities of hidden Markov models to match the frame loss observation sequence is proposed. Experimental results show that a DSR system using the proposed method can achieve the same level of accuracy as a data reconstruction method, is more robust against heavy frame loss, and significantly reduces the computation time.

[1]  John H. L. Hansen,et al.  Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Lee-Min Lee Adaptation of hidden Markov models for half frame rate observations , 2010 .

[3]  Man-Hung Siu,et al.  A Robust Viterbi Algorithm Against Impulsive Noise With Application to Speech Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  E. Gilbert Capacity of a burst-noise channel , 1960 .

[5]  Fu-Rong Jean,et al.  Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate , 2013, IEEE Transactions on Cybernetics.

[6]  Paul Dalsgaard,et al.  Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Paul Dalsgaard,et al.  Automatic speech recognition over error-prone wireless networks , 2005, Speech Commun..

[9]  Abeer Alwan,et al.  Low-bitrate distributed speech recognition for packet-based and wireless communication , 2002, IEEE Trans. Speech Audio Process..

[10]  José L. Pérez-Córdoba,et al.  HMM-based channel error mitigation and its application to distributed speech recognition , 2003, Speech Commun..