Warped discrete cosine transform-based noisy speech enhancement

In this paper, a warped discrete cosine transform (WDCT)-based approach to enhance the degraded speech under background noise environments is proposed. For developing an effective expression of the frequency characteristics of the input speech, the variable frequency warping filter is applied to the conventional discrete cosine transform (DCT). The frequency warping control parameter is adjusted according to the analysis of spectral distribution in each frame. For a more accurate analysis of spectral characteristics, the split-band approach in which the global soft decision for speech presence is performed in each band separately is employed. A number of subjective and objective tests show that the WDCT-based enhancement method yields better performance than the conventional DCT-based algorithm.

[1]  Soo Ngee Koh,et al.  Noisy speech enhancement using discrete cosine transform , 1998, Speech Commun..

[2]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[3]  Joon-Hyuk Chang,et al.  Speech enhancement: new approaches to soft decision , 2000, INTERSPEECH.

[4]  Joon-Hyuk Chang,et al.  Speech enhancement using warped discrete cosine transform , 2002, Speech Coding, 2002, IEEE Workshop Proceedings..

[5]  Li Lee,et al.  Speaker normalization using efficient frequency warping procedures , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  Sanjit K. Mitra,et al.  Warped discrete-Fourier transform: Theory and applications , 2001 .

[7]  Sanjit K. Mitra,et al.  Warped discrete cosine transform and its application in image compression , 2000, IEEE Trans. Circuits Syst. Video Technol..

[8]  William J. Byrne,et al.  Speaker normalization with all-pass transforms , 1998, ICSLP.

[9]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[10]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[11]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[12]  Nam Soo Kim,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Processing Letters.

[13]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[14]  Sanjit K. Mitra,et al.  The Nonuniform Discrete Fourier Transform and Its Applications in Signal Processing , 1998 .