Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment

In this work hands-free continuous speech recognition based on microphone arrays is investigated. A set of experiments was carried out using arrays having di erent numbers of omnidirectional microphones as well as di erent con gurations. Both real and simulated array signals, generated by means of the image method, were used. An enhanced input to a recognizer based on Hidden Markov Models was obtained by a time delay compensation module providing a beamformed signal. HMM adaptation was used to realign the recognizer acoustic modeling to the given acoustic condition.

[1]  Yves Grenier A microphone array for car environments , 1993, Speech Commun..

[2]  Satoshi Nakamura,et al.  Robust speech recognition with speaker localization by a microphone array , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  Richard M. Stern,et al.  Multiple Approaches to Robust Speech Recognition , 1992, HLT.

[4]  James L. Flanagan,et al.  Microphone Arrays and Neural Networks for Robust Speech Recognition , 1994, HLT.

[5]  Harvey F. Silverman,et al.  Microphone-array speech recognition via incremental map training , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  James L. Flanagan,et al.  Autodirective Microphone Systems , 1991 .

[7]  Maurizio Omologo,et al.  Use of the crosspower-spectrum phase in acoustic event location , 1997, IEEE Trans. Speech Audio Process..

[8]  Maurizio Omologo,et al.  Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus , 1994, ICSLP.

[9]  Maurizio Omologo,et al.  Hands-Free Speech Recognition In A Noisy And Reverberant Environment , 1997 .

[10]  Dirk Van Compernolle,et al.  Speech recognition in noisy environments with the aid of microphone arrays , 1989, Speech Commun..

[11]  Maurizio Omologo,et al.  Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM adaptation , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  Maurizio Omologo,et al.  Microphone array based speech recognition with different talker-array positions , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..