Speech enhancement based on novel two-step a priori SNR estimators

A widely used method to determine the a priori SNR from noisy speech is the decision directed (DD) approach, but the a priori SNR follows the a posteriori SNR with a delay of one frame in speech frames. As a consequence, the performance of the noise reduction system degrades. In order to overcome this artifact, we propose three computationally simple and efficient two-step methods based on the minimum mean square error (MMSE), the maximum a posteriori (MAP) and the joint MAP criteria for the estimation of the a priori SNR for speech enhancement. The proposed methods avoid the delay problem while keeping the advantages of the DD method. The performance of the proposed a priori SNR estimation methods are evaluated and compared with the conventional DD method by extensive objective quality measures and yield better results than the DD approach-based speech enhancement system.

[1]  Richard V. Cox,et al.  A modular approach to speech enhancement with an application to speech coding , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Peter Vary,et al.  Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model , 2005, EURASIP J. Adv. Signal Process..

[4]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[5]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[6]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[7]  Akihiko Sugiyama,et al.  Noise Suppression with High Speech Quality Based on Weighted Noise Estimation and MMSE STSA , 2002, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[8]  Aruna Bayya,et al.  Objective measures for speech quality assessment in wireless communications , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.