An improved iterative wiener filtering algorithm for speech enhancement

Speech enhancement is a critical technique for various applications like mobile communication and automatic speech recognition (ASR). This paper studies an improved iterative Wiener filtering (IWF) algorithm, which can help enhance the recognition rate of the ASR system. By using the voice activity detection (VAD) technology in the traditional IWF algorithm, the noise power spectrum estimation in the silent periods can be improved. Besides, the mini-tracking algorithm is also employed to estimate the signal-to-noise ratio (SNR) of the present speech signal, which is further used to control the number of iterations for the IWF algorithm. Therefore, the speech-like properties which are important in speech reconstruction and recognition are better retained. Computer simulations are conducted to verify the proposed algorithm has improved performance in ASR system over the conventional approach.

[1]  Simon King,et al.  Speech and Audio Signal Processing , 2011 .

[2]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[3]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[5]  Daniel P. W. Ellis,et al.  Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[6]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .

[7]  D. A. van Leeuwen,et al.  Speech and Audio Signal Processing , 2011 .

[8]  John H. L. Hansen,et al.  Constrained iterative speech enhancement with application to speech recognition , 1991, IEEE Trans. Signal Process..

[9]  Pavel Sovka,et al.  Noise suppression system for a car , 1993, EUROSPEECH.

[10]  Rainer Martin,et al.  An efficient algorithm to estimate the instantaneous SNR of speech signals , 1993, EUROSPEECH.

[11]  Masafumi Nishimura,et al.  Long-Term Spectro-Temporal and Static Harmonic Features for Voice Activity Detection , 2010, IEEE Journal of Selected Topics in Signal Processing.

[12]  Thippur V. Sreenivas,et al.  Codebook constrained Wiener filtering for speech enhancement , 1996, IEEE Trans. Speech Audio Process..

[13]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[14]  M. Asunción Moreno Bilbao,et al.  Speech enhancement by adaptive wiener filtering based on cumulant ar modelling , 1992 .

[15]  Akihiko Sugiyama,et al.  Noise Suppression with High Speech Quality Based on Weighted Noise Estimation and MMSE STSA , 2002, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..