论文信息 - Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model

Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model

In this paper a sinusoidal model for the speech waveform is used to develop a new synthesis technique that requires specification of only the amplitudes and frequencies of the component sine waves. These parameters are estimated from the short-time spectral magnitude. The resulting synthetic waveform preserves the short-time spectral magnitude during rapid movements of spectral energy such as voiced/unvoiced transitions, and yields speech of very high quality and intelligibility. The approach is sufficiently flexible to also allow for high-quality time-scale modification with the option of time-varying scaling. Finally, results are given for some initial experiments that explore the possibility of magnitude-only waveform coding at 8 kbps.

Thomas F. Quatieri | Robert J. McAulay

[1] R. Crochiere,et al. Speech Coding , 1979, IEEE Transactions on Communications.

[2] Jae Lim,et al. Signal reconstruction from short-time Fourier transform magnitude , 1983 .

[3] M. Portnoff,et al. Time-scale modification of speech based on short-time Fourier analysis , 1981 .