Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band.
暂无分享,去创建一个
The perceptual relevance of adopting the temporal envelope to model the frequency band of 4-7 kHz (highband) in wideband speech signal is described in this letter. Based on theoretical work in psychoacoustics, we find out that the temporal envelope can indeed be a perceptual cue for the high-band signal, i.e., a noiseless sound can be obtained if the temporal envelope is roughly preserved. Subjective listening tests verify that transparent quality can be obtained if the model is used for the 4.5-7 kHz band. The proposed model has the benefits of offering flexible scalability and reducing the cost for quantization in coding applications.
[1] N. Viemeister. Temporal modulation transfer functions based upon modulation thresholds. , 1979, The Journal of the Acoustical Society of America.
[2] O Ghitza,et al. On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. , 2001, The Journal of the Acoustical Society of America.
[3] Doh-Suk Kim. A cue for objective speech quality estimation in temporal envelope representations , 2004, IEEE Signal Processing Letters.