A New Method Based on Spectral Subtraction for Speech Dereverberation

Summary A new monaural method for the suppression of late room reverberation from speech signals, based on spectral subtraction, is presented. The problem of reverberation suppression differs from classical speech de-noising in that the “reverberation noise” is non stationary. In this paper, the use of a novel estimator of the non-stationary reverberationnoise power spectrum, based on a statistical model of late reverberation, is presented. The algorithm is tested on real reverberated signals. The performances for different RIRs with ranging from 0.34 s to 1.7 s consistently show significant noise reduction with little signal distortion. Moreover, when used as a front end to an automatic speech recognition system, the algorithm brings about dramatic improvements in terms of automatic speech recognition scores in various reverberant environments.

[1]  F. Itakura,et al.  Dereverberation of Speech Signals Based on Sub-Band Envelope Estimation , 1991 .

[2]  Beghdad Ayad Systemes combines d'annulation d'echo acoustique et de reduction de bruit pour les terminaux mains-libres , 1997 .

[3]  Sven Fischer,et al.  Suppression of coherent and incoherent noise using a microphone array , 1994 .

[4]  Sridha Sridharan,et al.  Speech-seeking microphone array with multi-stage processing , 1995, EUROSPEECH.

[5]  R. Zelinski,et al.  A microphone array with adaptive post-filtering for noise reduction in reverberant rooms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6]  Sridha Sridharan,et al.  Position-Independent Enhancement of Reverberant Speech , 1997 .

[7]  W Soede,et al.  Development of a directional hearing instrument based on array technology. , 1993, The Journal of the Acoustical Society of America.

[8]  B Kollmeier,et al.  Binaural noise-reduction hearing aid scheme with real-time processing in the frequency domain. , 1993, Scandinavian audiology. Supplementum.

[9]  James A. Moorer,et al.  About This Reverberation Business , 1978 .

[10]  P. Jeffrey Bloom Evaluation of a dereverberation process by normal and impaired listeners , 1980, ICASSP.

[11]  Claude Marro Traitements de dereverberation et de debruitage pour le signal de parole dans des contextes de communication interactive , 1996 .

[12]  P. Jeffrey Bloom,et al.  Evaluation of two-input speech dereverberation techniques , 1982, ICASSP.

[13]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[14]  John Mourjopoulos,et al.  Modelling and enhancement of reverberant speech using an envelope convolution method , 1983, ICASSP.

[15]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[16]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[17]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[18]  Markus Bodden A concept for a cocktail-party-processor , 1990, ICSLP.

[19]  R. H. Bolt,et al.  Theory of Speech masking by reverberation , 1949 .

[20]  Gerald A. Studebaker,et al.  Acoustical Factors Affecting Hearing Aid Performance , 1992 .

[21]  Olivier Cappé Techniques de reduction de bruit pour la restauration d'enregistrements musicaux , 1993 .

[22]  David R. Cole,et al.  Speaker recognition in reverberant enclosures , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23]  P. W. Smith,et al.  Comments on “New Method of Measuring Reverberation Time” [M. R. Schroeder, J. Acoust. Soc. Am. 37, 409–412 (1965)] , 1965 .

[24]  Katia Lebart Speech dereverberation applied to automatic speech recognition and hearing aids , 1999 .

[25]  M. Bodden Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[26]  B Kollmeier,et al.  Real-time multiband dynamic compression and noise reduction for binaural hearing aids. , 1993, Journal of rehabilitation research and development.

[27]  James L. Flanagan,et al.  Robust distant-talking speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[28]  J. Polack La transmission de l'energie sonore dans les salles , 1988 .

[29]  Jont B. Allen,et al.  Invertibility of a room impulse response , 1979 .

[30]  Yannick Mahieux,et al.  Performance of adaptive dereverberation techniques using directivity controlled arrays , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[31]  A. Nabelek,et al.  Reverberant overlap- and self-masking in consonant identification. , 1989, The Journal of the Acoustical Society of America.

[32]  Sven Fischer,et al.  Beamforming microphone arrays for speech acquisition in noisy environments , 1996, Speech Commun..

[33]  S F Bahgat,et al.  Envelope expansion methods for speech enhancement. , 1991, The Journal of the Acoustical Society of America.

[34]  Robert F. Kubichek,et al.  Standards and technology issues in objective voice quality assessment , 1991, Digit. Signal Process..

[35]  J Verschuure,et al.  Directional hearing aid based on array technology. , 1995, Scandinavian audiology. Supplementum.