论文信息 - Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain

The human hearing system is the most robust speech processor despite noisy environments. This work presents a new computational model for our auditory system by exploring the psychoacoustical masking properties. The model is then applied to speech coding in the perceptual domain. The coding algorithm is capable of producing high quality coded speech and audio, which account for temporal as well as spectral details. The proposed filterbank is also applied to speech denoising in the perceptual domain. The enhanced speech is of good perceptual quality.

W. H. Holmes | E. Ambikairajah | L. Lin

[1] Eliathamby Ambikairajah,et al. Auditory filter bank design using masking curves , 2001, INTERSPEECH.

[2] Eliathamby Ambikairajah,et al. Wideband speech and audio coding using gammatone filter banks , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3] Eliathamby Ambikairajah,et al. Auditory masking and MPEG-1 audio compression , 1997 .

[4] Gernot Kubin,et al. On speech coding in a perceptual domain , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5] William M. Hartmann,et al. Psychoacoustics: Facts and Models , 2001 .

[6] R. Patterson,et al. Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[7] E. Zwicker,et al. Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[8] A.V. Oppenheim,et al. Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[9] E. Zwicker,et al. Audio engineering and psychoacoustics: matching signals to the final receiver, the human auditory system , 1991 .

[10] Hugo Fastl,et al. Psychoacoustics: Facts and Models , 1990 .

[11] Richard F. Lyon,et al. A computational model of filtering, detection, and compression in the cochlea , 1982, ICASSP.