论文信息 - A New Method Based on Spectral Subtraction for Speech Dereverberation

A New Method Based on Spectral Subtraction for Speech Dereverberation

Summary A new monaural method for the suppression of late room reverberation from speech signals, based on spectral subtraction, is presented. The problem of reverberation suppression differs from classical speech de-noising in that the “reverberation noise” is non stationary. In this paper, the use of a novel estimator of the non-stationary reverberationnoise power spectrum, based on a statistical model of late reverberation, is presented. The algorithm is tested on real reverberated signals. The performances for different RIRs with ranging from 0.34 s to 1.7 s consistently show significant noise reduction with little signal distortion. Moreover, when used as a front end to an automatic speech recognition system, the algorithm brings about dramatic improvements in terms of automatic speech recognition scores in various reverberant environments.

[1] F. Itakura,et al. Dereverberation of Speech Signals Based on Sub-Band Envelope Estimation , 1991 .

[2] Beghdad Ayad. Systemes combines d'annulation d'echo acoustique et de reduction de bruit pour les terminaux mains-libres , 1997 .

[3] Sven Fischer,et al. Suppression of coherent and incoherent noise using a microphone array , 1994 .

[4] Sridha Sridharan,et al. Speech-seeking microphone array with multi-stage processing , 1995, EUROSPEECH.

[5] R. Zelinski,et al. A microphone array with adaptive post-filtering for noise reduction in reverberant rooms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6] Sridha Sridharan,et al. Position-Independent Enhancement of Reverberant Speech , 1997 .

[7] W Soede,et al. Development of a directional hearing instrument based on array technology. , 1993, The Journal of the Acoustical Society of America.

[8] B Kollmeier,et al. Binaural noise-reduction hearing aid scheme with real-time processing in the frequency domain. , 1993, Scandinavian audiology. Supplementum.

[9] James A. Moorer,et al. About This Reverberation Business , 1978 .

[10] P. Jeffrey Bloom. Evaluation of a dereverberation process by normal and impaired listeners , 1980, ICASSP.

[11] Claude Marro. Traitements de dereverberation et de debruitage pour le signal de parole dans des contextes de communication interactive , 1996 .

[12] P. Jeffrey Bloom,et al. Evaluation of two-input speech dereverberation techniques , 1982, ICASSP.

[13] Richard M. Schwartz,et al. Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[14] John Mourjopoulos,et al. Modelling and enhancement of reverberant speech using an envelope convolution method , 1983, ICASSP.

[15] J.B. Allen,et al. A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[16] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[17] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[18] Markus Bodden. A concept for a cocktail-party-processor , 1990, ICSLP.

[19] R. H. Bolt,et al. Theory of Speech masking by reverberation , 1949 .

[20] Gerald A. Studebaker,et al. Acoustical Factors Affecting Hearing Aid Performance , 1992 .

[21] Olivier Cappé. Techniques de reduction de bruit pour la restauration d'enregistrements musicaux , 1993 .

[22] David R. Cole,et al. Speaker recognition in reverberant enclosures , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23] P. W. Smith,et al. Comments on “New Method of Measuring Reverberation Time” [M. R. Schroeder, J. Acoust. Soc. Am. 37, 409–412 (1965)] , 1965 .

[24] Katia Lebart. Speech dereverberation applied to automatic speech recognition and hearing aids , 1999 .

[25] M. Bodden. Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[26] B Kollmeier,et al. Real-time multiband dynamic compression and noise reduction for binaural hearing aids. , 1993, Journal of rehabilitation research and development.

[27] James L. Flanagan,et al. Robust distant-talking speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[28] J. Polack. La transmission de l'energie sonore dans les salles , 1988 .

[29] Jont B. Allen,et al. Invertibility of a room impulse response , 1979 .

[30] Yannick Mahieux,et al. Performance of adaptive dereverberation techniques using directivity controlled arrays , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[31] A. Nabelek,et al. Reverberant overlap- and self-masking in consonant identification. , 1989, The Journal of the Acoustical Society of America.

[32] Sven Fischer,et al. Beamforming microphone arrays for speech acquisition in noisy environments , 1996, Speech Commun..

[33] S F Bahgat,et al. Envelope expansion methods for speech enhancement. , 1991, The Journal of the Acoustical Society of America.

[34] Robert F. Kubichek,et al. Standards and technology issues in objective voice quality assessment , 1991, Digit. Signal Process..

[35] J Verschuure,et al. Directional hearing aid based on array technology. , 1995, Scandinavian audiology. Supplementum.