Noise PSD estimation by logarithmic baseline tracing

A novel noise power spectral density (PSD) estimator for disturbed speech signals which operates in the short-time Fourier domain is presented. A noise PSD estimate is provided by constrained tracing with time of the noisy observation separately for each frequency bin. The constraint is a limitation of the logarithmic magnitude change between successive time frames. Since speech onset is assumed as sudden rises in the noisy observation, a fixed and adaptive tracing parameter β has been derived to track the contained noise while preventing speech leakage to the noise PSD estimate. The experimental evaluation and comparison with state-of-the-art algorithms, SPP and Minimum Statistics, confirms a lower logarithmic noise estimation error and superior speech enhancement rated in a standard noise reduction system. The proposed concept has extremely low computational complexity and memory usage. Thus, it is well suited for applications where processing power and memory is limited.

[1]  Rainer Martin,et al.  Bias compensation methods for minimum statistics noise power spectral density estimation , 2006, Signal Process..

[2]  Jerry D. Gibson,et al.  Digital coding of waveforms: Principles and applications to speech and video , 1985, Proceedings of the IEEE.

[3]  Gerhard Schmidt,et al.  Low-complexity noise power spectral density estimation for harsh automobile environments , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[4]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[5]  Thomas Esch,et al.  Model-based speech enhancement using SNR dependent MMSE estimation , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[7]  Sungjin Park,et al.  Speech Intelligibility Enhancement using Tunable Equalization Filter , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[8]  Peter Vary,et al.  Recursive Closed-Form Optimization of Spectral Audio Power Allocation for Near End Listening Enhancement , 2010, Sprachkommunikation.

[9]  Thomas Esch,et al.  Noise Reduction for Wideband Speech Exploiting Spectral Dependencies Based on Conditional Estimation , 2010, Sprachkommunikation.

[10]  Jesper Jensen,et al.  MMSE based noise PSD tracking with low complexity , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[12]  Richard C. Hendriks,et al.  Noise power estimation based on the probability of speech presence , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[13]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[14]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[15]  Peter Vary,et al.  Selflearning Codebook Speech Enhancement , 2014, ITG Symposium on Speech Communication.