Spectral subtraction for reverberation reduction applied to automatic speech recognition

In real conditions, reverberation severely compromises the accuracy of automatic speech recognition (ASR) systems. The use and the assessment of spectral subtraction for reducing the reverberation effect in ASR systems is discussed. For such, a general procedure is presented. The influence of some important dereverberation parameters, such as the reverberation time, on the recognition rate is evaluated. By using the referred approach, the speech recognition rate is considerably improved.

[1]  Laurent Couvreur,et al.  Blind Model Selection for Automatic Speech Recognition in Reverberant Environments , 2004, J. VLSI Signal Process..

[2]  Guy J. Brown,et al.  Missing data speech recognition in reverberant conditions , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Douglas D. O'Shaughnessy,et al.  Interacting with computers by voice: automatic speech recognition and synthesis , 2003, Proc. IEEE.

[4]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[5]  E.A.P. Habets,et al.  Single-Channel Speech Dereverberation based on Spectral Subtraction , 2004 .

[6]  Les E. Atlas,et al.  Strategies for improving audible quality and speech recognition accuracy of reverberant speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Douglas L. Jones,et al.  Blind estimation of reverberation time. , 2003, The Journal of the Acoustical Society of America.

[8]  J. Boucher,et al.  A New Method Based on Spectral Subtraction for Speech , 2001 .

[9]  Nelson Morgan,et al.  Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments , 1998 .

[10]  Maurizio Omologo,et al.  Environmental conditions and acoustic transduction in hands-free speech recognition , 1998, Speech Commun..

[11]  Henrique S. Malvar,et al.  Speech dereverberation via maximum-kurtosis subband adaptive filtering , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  Rui Seara,et al.  A single-microphone approach for speech signal dereverberation , 2005, 2005 13th European Signal Processing Conference.

[13]  Jacob Benesty,et al.  New insights into the noise reduction Wiener filter , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  P. Renevey Speech recognition in noisy conditions using missing feature approach , 2000 .

[15]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..