A New Algorithm for Speech Enhancement Using Wavelet Packet Transform Based on Auditory Model

Human auditory has non-linear characteristics, while wavelet packet transform (WPT) has flexible analysis ability to time-frequency property so that it is more compatible to simulate the human auditory model. In this paper, human auditory model is analyzed, after which a new algorithm for speech enhancement using node-threshold wavelet packet transform based on bark-scaled decomposition is established, multi-resolution singular spectral entropy method is applied to estimate the node noise, and uses soft threshold to deal wavelet transform coefficient. The experiments show that this algorithm is valid on various noise conditions, especially for color noise and non-stationary noise conditions.

[1]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[2]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[3]  Y. Hu,et al.  A subspace approach for enhancing speech corrupted by colored noise , 2002, IEEE Signal Process. Lett..

[4]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[5]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  S. Mallat A wavelet tour of signal processing , 1998 .