A Speech Recognition System Based on Dynamic Characterization of Background Noise

Robustness of automatic speech recognition (ASR) systems in realistic conditions of background noise is the essential conditions for their ample diffusion. As we know, the systems which exist at present suffer, however, a notable decrease in performance in the presence of background noise. In this article we propose an ASR system based on a dynamic characterization of background noise. In particular, the system makes a dynamic choice of HMM model related to the different types of noise and corresponding to different signal to noise ratios (SNR). The system was implemented and the tests performed using the AURORA2 database. The results were compared with a nonadaptive classification system in the presence of clean conditions and 4 different types of background noise and with 6 different SNRs. The proposed ASR system was found to be particularly adapted to applications functioning in extremely noisy contexts

[1]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[2]  Jean-Claude Junqua,et al.  Robustness in Automatic Speech Recognition: Fundamentals and Applications , 1995 .

[3]  Peter Kabal,et al.  Frame level noise classification in mobile environments , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[4]  S. Casale,et al.  Adaptive robust speech processing based on acoustic noise estimation and classification , 2005, Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005..

[5]  Olivier Siohan,et al.  Sequential estimation with optimal forgetting for robust speech recognition , 2004, IEEE Transactions on Speech and Audio Processing.

[6]  Yifan Gong,et al.  Speech recognition in noisy environments: A survey , 1995, Speech Commun..

[7]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[8]  Jean-Claude Junqua,et al.  Robustness in Automatic Speech Recognition , 1996 .

[9]  Abeer Alwan,et al.  Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR , 2005, IEEE Transactions on Speech and Audio Processing.

[10]  R. G. Leonard,et al.  A database for speaker-independent digit recognition , 1984, ICASSP.