Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model
暂无分享,去创建一个
In this paper a sinusoidal model for the speech waveform is used to develop a new synthesis technique that requires specification of only the amplitudes and frequencies of the component sine waves. These parameters are estimated from the short-time spectral magnitude. The resulting synthetic waveform preserves the short-time spectral magnitude during rapid movements of spectral energy such as voiced/unvoiced transitions, and yields speech of very high quality and intelligibility. The approach is sufficiently flexible to also allow for high-quality time-scale modification with the option of time-varying scaling. Finally, results are given for some initial experiments that explore the possibility of magnitude-only waveform coding at 8 kbps.
[1] R. Crochiere,et al. Speech Coding , 1979, IEEE Transactions on Communications.
[2] Jae Lim,et al. Signal reconstruction from short-time Fourier transform magnitude , 1983 .
[3] M. Portnoff,et al. Time-scale modification of speech based on short-time Fourier analysis , 1981 .