Single-Microphone Early and Late Reverberation Suppression in Noisy Speech

This paper presents a single-microphone approach to the enhancement of noisy reverberant speech via inverse filtering and spectral processing. An efficient algorithm is used to blindly estimate the inverse filter of the Room Impulse Response (RIR). This filter is used to attenuate the early reverberation. A simple technique to blindly determine the filter length is presented. A two-step spectral subtraction method is proposed to efficiently reduce the effects of background noise and the residual reverberation on the equalized impulse response. In general, the equalized impulse response has two detrimental effects, late impulses and pre-echoes. For the late impulses, an efficient spectral subtraction algorithm is developed which introduces only minor musical noise. Then a new algorithm is introduced which reduces the remaining pre-echo effects. The performance of this two-stage method is examined in different reverberant conditions including real environments. It is also evaluated with white Gaussian and recorded babble noise. The results obtained demonstrate that the proposed blind method is superior in terms of reducing early and late reverberation effects and noise compared to well known single-microphone techniques in the literature.

[1]  Richard Heusdens,et al.  Single-microphone late-reverberation suppression in noisy speech by exploiting long-term correlation in the DFT domain , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Masashi Unoki,et al.  A method based on the MTF concept for dereverberating the power envelope from the reverberant signal , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  Peter Vary,et al.  A binaural room impulse response database for the evaluation of dereverberation algorithms , 2009, 2009 16th International Conference on Digital Signal Processing.

[4]  Rainer Martin,et al.  Bias compensation methods for minimum statistics noise power spectral density estimation , 2006, Signal Process..

[5]  DeLiang Wang,et al.  A Supervised Learning Approach to Monaural Segregation of Reverberant Speech , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[7]  Rainer Martin,et al.  NOISE POWER SPECTRAL DENSITY ESTIMATION ON HIGHLY CORRELATED DATA , 2006 .

[8]  Eap Emanuël Habets Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[9]  Li Deng,et al.  Speech Denoising and Dereverberation Using Probabilistic Models , 2000, NIPS.

[10]  Peter Vary,et al.  Low Delay Noise Reduction and Dereverberation for Hearing Aids , 2009, EURASIP J. Adv. Signal Process..

[11]  Sven Nordholm,et al.  Spectral subtraction using reduced delay convolution and adaptive averaging , 2001, IEEE Trans. Speech Audio Process..

[12]  H. Sheikhzadeh,et al.  Single-Microphone LP Residual Skewness-Based Inverse Filtering of the Room Impulse Response , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Jacob Benesty,et al.  Springer handbook of speech processing , 2007, Springer Handbooks.

[14]  T. Aaron Gulliver,et al.  Speech-Model Based Accurate Blind Reverberation Time Estimation Using an LPC Filter , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Bayya Yegnanarayana,et al.  Enhancement of reverberant speech using LP residual signal , 2000, IEEE Trans. Speech Audio Process..

[16]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Tomohiro Nakatani,et al.  Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Tomohiro Nakatani,et al.  Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[19]  Rodney A. Kennedy,et al.  Equalization in an acoustic reverberant environment: robustness results , 2000, IEEE Trans. Speech Audio Process..

[20]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[21]  P. Paajarvi,et al.  Blind equalization of PPM signals using third-order moments , 2007, 2007 IEEE 8th Workshop on Signal Processing Advances in Wireless Communications.

[22]  Peter Kabal,et al.  Reverberant speech enhancement using cepstral processing , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[23]  DeLiang Wang,et al.  A two-stage algorithm for one-microphone reverberant speech enhancement , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Emanuel A. P. Habets,et al.  Temporal selective dereverberation of noisy speech using one microphone , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  Takuya Yoshioka,et al.  Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[27]  James P. LeBlanc,et al.  Online adaptive blind deconvolution based on third-order moments , 2005, IEEE Signal Processing Letters.

[28]  P. Krishnamoorthy,et al.  Reverberant Speech Enhancement by Temporal and Spectral Processing , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  J.-M. Boucher,et al.  A New Method Based on Spectral Subtraction for Speech Dereverberation , 2001 .

[30]  Emanuel A. P. Habets,et al.  Late Reverberant Spectral Variance Estimation Based on a Statistical Model , 2009, IEEE Signal Processing Letters.

[31]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[32]  Panos E. Papamichalis,et al.  Practical approaches to speech coding , 1987 .

[33]  Israel Cohen,et al.  Speech enhancement using a noncausal a priori SNR estimator , 2004, IEEE Signal Processing Letters.

[34]  Richard Heusdens,et al.  Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[35]  E.A.P. Habets,et al.  Speech dereverberation using backward estimation of the late reverberant spectral variance , 2008, 2008 IEEE 25th Convention of Electrical and Electronics Engineers in Israel.

[36]  Saeed Mosayyebpour,et al.  Single Channel Inverse Filtering of Room Impulse Response by Maximizing Skewness of LP Residual , 2010, 2010 International Conference on Signal Acquisition and Processing.

[37]  Ken'ichi Furuya,et al.  Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[38]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .