Speech enhancement based on minima controlled recursive averaging incorporating conditional maximum a posteriori criterion

In this paper, we propose a novel approach to improve the performance of minima controlled recursive averaging (MCRA) based on a conditional maximum a posteriori (MAP) criterion. From an investigation of the MCRA scheme, it is discovered that MCRA method cannot take full consideration of the inter-frame correlation of voice activity since the noise power estimate is adjusted by the speech presence probability depending on an observation of the current frame. To avoid this phenomenon, the proposed MCRA approach incorporates the conditional MAP criterion in which the noise power estimate is obtained using the speech presence probability conditioned on both the current observation and the speech activity decision in the previous frame Experimental results show that the proposed MCRA technique based on conditional MAP yields better results compared to the conventional MCRA method.

[1]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[2]  Nam Soo Kim,et al.  Voice Activity Detection Based on Conditional MAP Criterion , 2008, IEEE Signal Processing Letters.

[3]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[4]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[5]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[6]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[7]  Joon-Hyuk Chang,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Process. Lett..

[8]  Klaus Uwe Simmer,et al.  Kammeyer \Comparison of one-and two-channel noise-estimation techniques , 1997 .

[9]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[10]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[11]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[12]  Nam Soo Kim,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Processing Letters.

[13]  Ye Li,et al.  Speech Enhancement for Non-Stationary Noise Environments , 2009, 2009 International Conference on Information Engineering and Computer Science.