Hidden Markov model environmental compensation for automatic speech recognition on hand-held mobile devices

This paper is concerned with applying hidden Markov model compensation techniques for improving the performance of automatic speech recognition (ASR) based services on hand-held mobile devices. The implementation and evaluation of an ASR based task for a mobile, hand–held device is presented, along with a set of compensation techniques that are used to compensate speaker independent hidden Markov models with respect to environmental and transducer variability. A technique for combined environment/transducer compensation is shown to significantly reduce the effects of environmental mismatch. The overall performance degradation with respect to clean conditions was reduced from 41.7 percent to 10.4 percent for speech spoken through a far–field microphone in an office environment, and from 79.2 percent to 39.8 percent for the same transducer in a noisy cafeteria environment.

[1]  Mark J. F. Gales,et al.  Cepstral parameter compensation for HMM recognition in noise , 1993, Speech Commun..

[2]  Douglas A. Reynolds,et al.  Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..

[3]  Roger K. Moore,et al.  Hidden Markov model decomposition of speech and noise , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4]  Sadaoki Furui,et al.  Adaptation method based on HMM composition and EM algorithm , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Mark J. F. Gales,et al.  Robust continuous speech recognition using parallel model combination , 1996, IEEE Trans. Speech Audio Process..