Qualitative Analysis and Enhancement of Sine Transform Coding
暂无分享,去创建一个
In the past several years, significant progress has been made in the STC (sine transform coder) approach in terms of compression gain and implementation complexity [1]–[3]. As a result, it now appears to be a viable alternative to the CELP in the range of 2.4 kbps to 9.6 kbps. In contrast to the CELP, which is a timedomain, search-oriented technique, the STC compresses speech directly in the frequency domain using a harmonic model without decomposing it into excitation and time-varying filter parts. Because of its simplicity in speech modelling, at least in principle, it provides a tractable means of gaining insight into the process of speech compression.
[1] R. McAulay,et al. "Multirate sinusoidal transform coding at rates from 2.4 kbps to 8 kbps" , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2] Jae S. Lim,et al. Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.
[3] R. J. McAulay,et al. Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.