Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model

In this paper a sinusoidal model for the speech waveform is used to develop a new synthesis technique that requires specification of only the amplitudes and frequencies of the component sine waves. These parameters are estimated from the short-time spectral magnitude. The resulting synthetic waveform preserves the short-time spectral magnitude during rapid movements of spectral energy such as voiced/unvoiced transitions, and yields speech of very high quality and intelligibility. The approach is sufficiently flexible to also allow for high-quality time-scale modification with the option of time-varying scaling. Finally, results are given for some initial experiments that explore the possibility of magnitude-only waveform coding at 8 kbps.