Noise Estimation for Speech Enhancement Using Minimum-Spectral-Average and Vowel-Presence Detection Approach

The accuracy of noise estimation is important for the performance of a speech enhancement system. This study proposes using variable segment length for noise tracking and variable thresholds for the determination of speech-presence probability. Initially, the fundamental frequency is estimated to determine whether a frame is a vowel. In the case of a vowel frame, the segment length increases; meanwhile the threshold for speech-presence is decreased. So the noise magnitude is adequately underestimated. The speech distortion is accordingly reduced in enhanced speech. Conversely, the segment length is rapidly decreased during noise-dominant regions. This enables the noise estimate to be updated quickly and the noise variation to be well tracked, yielding background noise being efficiently removed by the process of speech enhancement. Experimental results show that the proposed method can efficiently track the variation of background noise, enabling the performance of speech enhancement to be improved.

[1]  Timo Gerkmann,et al.  Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Justinian P. Rosca,et al.  Speech Noise Estimation using Enhanced Minima Controlled Recursive Averaging , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[3]  Wei-Ping Zhu,et al.  Noise spectrum estimation with improved minimum controlled recursive averaging based on speech enhancement residue , 2012, 2012 IEEE 55th International Midwest Symposium on Circuits and Systems (MWSCAS).

[4]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[5]  Abdeldjalil Aïssa-El-Bey,et al.  Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[7]  Jong-Mo Kum,et al.  Speech Enhancement Based on Minima Controlled Recursive Averaging Incorporating Second-Order Conditional MAP Criterion , 2009, IEEE Signal Processing Letters.

[8]  Sven Nordholm,et al.  Noise estimation with lowcomplexity for speech enhancement , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[9]  Yeou-Jiunn Chen,et al.  Forward-backward minima controlled recursive averaging to speech enhancement , 2013, 2013 IEEE Symposium on Computational Intelligence for Multimedia, Signal and Vision Processing (CIMSIVP).

[10]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[11]  Hamid Reza Abutalebi,et al.  Improved speech enhancement method based on auditory filterbank and fast noise estimation , 2014, 7'th International Symposium on Telecommunications (IST'2014).